From 1ff5618b6a2df6febfe6fb976b42f42ecf0dffea Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 20:50:10 +0530
Subject: [PATCH 001/192] docs: add plugin system documentation to README

- CLI section: add plugin list/install/remove commands
- New ## Plugins section: install, official plugins table, usage examples,
  manage commands, write-your-own guide with SKILL.md explanation

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 README.md | 80 +++++++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 80 insertions(+)
diff --git a/README.md b/README.md
index 6a4dc5b..c6efe63 100644
--- a/README.md
+++ b/README.md
@@ -23,6 +23,20 @@ Works with [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-
 | Agent support | Any MCP client | OpenClaw only | Any MCP client | Claude only | **Any MCP client** |
 | Playwright API | Partial | No | Full | No | **Full** |
 
+## Your Credentials Stay Yours
+
+Every other approach asks you to hand over something: an API key, an OAuth token, stored passwords, session cookies in a config file. BrowserForce asks for none of it.
+
+**Why?** Because you're already logged in. BrowserForce talks to your running Chrome — it doesn't extract credentials, store cookies, or replay tokens. The browser handles auth exactly as it always has. Your agent inherits your sessions the same way a new Chrome tab does.
+
+What you never need to provide:
+- No passwords
+- No API keys
+- No OAuth tokens
+- No session cookies in env vars or config files
+
+It's a security win *and* a setup win — there are no secrets to rotate, leak, or manage. Your logins live in Chrome. They stay in Chrome.
+
 ## Setup
 
 ### 1. Install
@@ -153,10 +167,76 @@ browserforce snapshot [n]       # Accessibility tree of tab n
 browserforce screenshot [n]     # Screenshot tab n (PNG to stdout)
 browserforce navigate <url>     # Open URL in a new tab
 browserforce -e "<code>"        # Run Playwright JavaScript (one-shot)
+browserforce plugin list        # List installed plugins
+browserforce plugin install <n> # Install a plugin from the registry
+browserforce plugin remove <n>  # Remove an installed plugin
 ```
 
 Each `-e` command is one-shot — state does not persist between calls. For persistent state, use the MCP server.
 
+## Plugins
+
+Plugins add custom helpers directly into the `execute` tool scope. Install once — your agent calls them like built-in functions.
+
+### Install a plugin
+
+```bash
+browserforce plugin install highlight
+```
+
+That's it. Restart MCP (or Claude Desktop) and `highlight()` is available in every `execute` call.
+
+### Official plugins
+
+| Plugin | What it adds | Install |
+|--------|-------------|---------|
+| `highlight` | `highlight(selector, color?)` — outlines matching elements; `clearHighlights()` — removes them | `browserforce plugin install highlight` |
+
+### Use an installed plugin
+
+After installing `highlight`, your agent can call it directly:
+
+```javascript
+// Outline all buttons in blue
+await highlight('button', 'blue');
+
+// Highlight the specific element you're about to click
+await highlight('[data-testid="submit"]', 'red');
+return await screenshotWithAccessibilityLabels();
+```
+
+The helper receives the active page, context, and state automatically — no plumbing needed.
+
+### Manage plugins
+
+```bash
+browserforce plugin list        # See what's installed
+browserforce plugin remove highlight   # Uninstall
+```
+
+Plugins are stored at `~/.browserforce/plugins/`. Each one is a folder with an `index.js`.
+
+### Write your own
+
+```javascript
+// ~/.browserforce/plugins/my-plugin/index.js
+export default {
+  name: 'my-plugin',
+  helpers: {
+    async scrollToBottom(page, ctx, state) {
+      await page.evaluate(() => window.scrollTo(0, document.body.scrollHeight));
+    },
+    async countLinks(page, ctx, state) {
+      return page.evaluate(() => document.querySelectorAll('a').length);
+    },
+  },
+};
+```
+
+Drop it in `~/.browserforce/plugins/my-plugin/`, restart MCP, and call `await scrollToBottom()` or `await countLinks()` from any `execute` call.
+
+Add a `SKILL.md` file alongside `index.js` and its content is automatically appended to the `execute` tool's description — so your agent knows the helpers exist without you having to explain them every time.
+
 ### Any Playwright Script
 
 ```javascript

From 43f7d553273697c75b207b88e751a0af0994ba8b Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 20:50:46 +0530
Subject: [PATCH 002/192] chore: update package version to 1.0.9 and adjust
 repository URL format

- Bump version from 1.0.8 to 1.0.9
- Change repository URL format to use 'git+' prefix
- Update bin path for browserforce to remove './' prefix
---
 package.json | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/package.json b/package.json
index e4037bf..abc9d59 100644
--- a/package.json
+++ b/package.json
@@ -1,12 +1,12 @@
 {
   "name": "browserforce",
-  "version": "1.0.8",
+  "version": "1.0.9",
   "type": "module",
   "description": "Give AI agents your real Chrome browser with progressive examples: simple reads, form interactions, multi-tab workflows, and state persistence. Search X and GitHub, extract ProductHunt data, test forms, compare A/B variants, monitor status pages. Works with OpenClaw, Claude, and any MCP agent.",
   "homepage": "https://github.com/ivalsaraj/browserforce",
   "repository": {
     "type": "git",
-    "url": "https://github.com/ivalsaraj/browserforce.git"
+    "url": "git+https://github.com/ivalsaraj/browserforce.git"
   },
   "license": "MIT",
   "keywords": [
@@ -24,7 +24,7 @@
     "node": ">=18.3.0"
   },
   "bin": {
-    "browserforce": "./bin.js"
+    "browserforce": "bin.js"
   },
   "files": [
     "README.md",

From 6ea5ce7f2220e7c147e4be236d53c252f1ea3cf0 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 20:51:07 +0530
Subject: [PATCH 003/192] docs: add comprehensive documentation for
 BrowserForce plugin system

- Introduced a new PLUGINS.md file detailing the plugin architecture, installation process, and usage examples.
- Included sections on various plugin functionalities such as HAR capture, DOM diffing, E2E test recording, and more.
- Provided code snippets for minimal plugin creation and advanced use cases for both developers and automated agents.
---
 docs/PLUGINS.md | 446 ++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 446 insertions(+)
 create mode 100644 docs/PLUGINS.md

diff --git a/docs/PLUGINS.md b/docs/PLUGINS.md
new file mode 100644
index 0000000..2720475
--- /dev/null
+++ b/docs/PLUGINS.md
@@ -0,0 +1,446 @@
+# BrowserForce Plugins
+
+Extend BrowserForce with local JS files — no framework, no build step, no registry.
+
+Plugins live in `~/.browserforce/plugins/`. Each file exports a plain object. The MCP server loads them at startup and merges their helpers, tools, and hooks into the runtime.
+
+**Minimal plugin — 10 lines:**
+
+```js
+// ~/.browserforce/plugins/hello.js
+export default {
+  name: 'hello',
+  helpers: {
+    async greet(page) {
+      const title = await page.title();
+      return `Hello from: ${title}`;
+    }
+  }
+}
+```
+
+After installing, `greet(page)` is available as a global inside every `execute()` call.
+
+---
+
+## How to Install a Plugin
+
+1. Drop a `.js` file in `~/.browserforce/plugins/`
+2. Restart the MCP server
+3. Done — helpers are injected, tools are registered
+
+No config changes. No manifest edits. The directory is auto-scanned on startup.
+
+---
+
+## For Developers
+
+Use cases for people with browser UI access — debugging, testing, and development workflows.
+
+---
+
+### HAR / Network Capture
+
+Record every network request and response during a session. Discover the private APIs powering a site's UI. Debug form submissions that silently fail.
+
+```js
+await startCapture(page);
+await page.click('#submit');
+const har = await stopCapture();
+// har.entries → full request/response log with timings and bodies
+```
+
+---
+
+### DOM Diff
+
+Snapshot the page's DOM before and after an action, then diff them. Know exactly what changed after a form submit, a route transition, or an AJAX update — without guessing.
+
+```js
+await snapshotDOM('before');
+await page.click('#apply-filters');
+await waitForPageLoad();
+const diff = await diffDOM('before', 'after');
+// diff → added/removed/changed nodes
+```
+
+---
+
+### E2E Test Recorder
+
+Every agent action gets recorded as Playwright test code. The agent explores a workflow once — the plugin auto-generates a `.test.js` regression file. Agents leave behind test suites instead of tribal knowledge.
+
+```js
+await startRecording();
+await page.click('#checkout');
+await page.fill('#card-number', '4111111111111111');
+await stopRecording('~/tests/checkout.test.js');
+// checkout.test.js is written to disk, ready to run
+```
+
+---
+
+### Request Interceptor / API Mocker
+
+Return fake data for specific endpoints without touching the backend. Test error states, empty states, and edge cases against a live UI.
+
+```js
+await mockAPI(page, '**/api/products', {
+  status: 200,
+  body: { products: [] }   // empty state
+});
+await page.reload();
+// UI now renders the empty state — no backend change needed
+```
+
+---
+
+### Session State Snapshots
+
+Capture all cookies, localStorage, and sessionStorage under a named key. Restore any state instantly. Test workflows as different user roles without logging out.
+
+```js
+await saveState('admin-logged-in');
+// ... test admin workflows ...
+await restoreState('free-user');
+// now running as free user — zero re-authentication
+```
+
+---
+
+### PDF Export
+
+Export the current page as a PDF. Generate reports, invoices, or documentation directly from browser content — pixel-perfect, with real fonts and styles.
+
+```js
+const buffer = await printBuffer({ format: 'A4', printBackground: true });
+// or write directly to disk:
+await savePDF('~/exports/invoice-2024.pdf');
+```
+
+---
+
+### Clipboard Bridge
+
+Read and write the system clipboard. Bypass sites that block copy-paste. Agents can write extracted data directly to clipboard for the user to paste elsewhere.
+
+```js
+// Write a result to clipboard
+await writeClipboard('Order ID: 98431-B');
+
+// Read what the user copied
+const copied = await readClipboard();
+```
+
+---
+
+## For OpenClaw & Automated Agents
+
+Use cases for headless and non-interactive workflows — AI agents running autonomously, no browser UI required.
+
+---
+
+### Zero Credential Exposure
+
+BrowserForce agents inherit the user's real browser sessions — no passwords, no API keys, no OAuth tokens in config. An `extractBearerToken` helper watches live network traffic and plucks `Authorization` headers, giving agents API access through the existing session. Credentials never leave the browser.
+
+This is the core differentiator vs every other agent tool.
+
+```js
+// In an automated workflow — no credentials configured anywhere
+const token = await extractBearerToken(page, 'api.example.com');
+// token → "Bearer eyJ..." pulled from live browser traffic
+// now usable for direct API calls within the same agent run
+```
+
+---
+
+### Download Capture
+
+Run a callback (e.g. click "Export CSV"), intercept the file download, return the content directly — no temp files, no manual download folder management. Sites that only expose data via download buttons become fully automatable.
+
+```js
+const csv = await captureDownload(async () => {
+  await page.click('#export-csv');
+});
+const rows = csv.split('\n').map(r => r.split(','));
+// process rows directly — no file system involved
+```
+
+---
+
+### Page Monitor
+
+Watch a URL and fire when content changes. Price trackers, job boards, CI dashboards, stock alerts. Long-running monitoring without constant agent polling.
+
+```js
+// MCP tool: monitor_page
+// Or use the helper directly:
+await waitForContentChange(page, '.price-display', { timeout: 3_600_000 });
+// resolves when the element's text changes — up to 1 hour wait
+```
+
+---
+
+### Desktop & Webhook Notifications
+
+System notifications and webhook delivery for long-running agent tasks. "When the product restocks, notify me" becomes a one-liner.
+
+```js
+// Desktop notification
+await notify('Restock Alert', 'Nike Air Max 90 is back in stock');
+
+// MCP tool: send_webhook
+// Or call directly:
+await sendWebhook('https://hooks.slack.com/...', {
+  text: 'Job scrape complete — 47 new listings found'
+});
+```
+
+---
+
+### Multi-Tab Session Orchestration
+
+BrowserForce sees every open tab. Plugins can extract data from one authenticated tab and inject it into another. Cross-tab RPA that no headless tool supports — because headless tools can't access existing logged-in sessions.
+
+```js
+const pages = await context.pages();
+const dashboardPage = pages.find(p => p.url().includes('/dashboard'));
+const data = await dashboardPage.evaluate(() => window.__APP_STATE__);
+
+const reportPage = pages.find(p => p.url().includes('/reports'));
+await reportPage.evaluate((d) => window.loadExternalData(d), data);
+```
+
+---
+
+### File Upload Helper
+
+Handle file inputs cleanly — from disk or from memory. Automate workflows that require uploading documents, images, or generated data without writing temp files.
+
+```js
+// Upload from disk
+await uploadFromDisk(page, '#profile-photo', '~/photos/avatar.png');
+
+// Upload generated content directly from memory
+await uploadFromMemory(page, '#import-csv', csvString, 'import.csv');
+```
+
+---
+
+## Building Your Own Plugin
+
+Full plugin shape — all fields are optional except `name`:
+
+```js
+// ~/.browserforce/plugins/my-plugin.js
+export default {
+  // Required. Must be unique across all plugins.
+  name: 'my-plugin',
+
+  // Runs once when the MCP server starts.
+  // Use for initializing state, opening connections, reading config.
+  async setup({ browser }) {
+    // browser → Playwright Browser instance
+  },
+
+  // Functions injected as globals into every execute() call.
+  // Signature: async (page, ...args) → any
+  helpers: {
+    async myHelper(page, param) {
+      return await page.evaluate((p) => window.someAPI(p), param);
+    }
+  },
+
+  // Standalone MCP tools registered alongside execute/reset/screenshot_with_labels.
+  // Agents can call these directly by name.
+  tools: [{
+    name: 'my_tool',
+    description: 'What this tool does and when to use it.',
+    schema: {
+      param: { type: 'string', description: 'Input value' }
+    },
+    async handler({ param }, { browser, context }) {
+      // browser → Playwright Browser
+      // context → Playwright BrowserContext
+      return {
+        content: [{ type: 'text', text: `Result: ${param}` }]
+      };
+    }
+  }],
+
+  // Playwright browser lifecycle hooks.
+  // Fired automatically — no agent action required.
+  hooks: {
+    onPage:       async (page) => {},           // new page created
+    onNavigation: async (page, url) => {},      // page navigated
+    onRequest:    async (request, page) => {},  // network request fired
+    onResponse:   async (response, page) => {}, // network response received
+  }
+}
+```
+
+
+| Field     | Type                                              | When to use                                                     |
+| --------- | ------------------------------------------------- | --------------------------------------------------------------- |
+| `setup`   | `async ({ browser }) => void`                     | One-time init — open DB connections, load config, warm caches   |
+| `helpers` | `{ name: async (page, ...args) => any }`          | Reusable page utilities injected into `execute()` scope         |
+| `tools`   | `[{ name, description, schema, handler }]`        | Standalone agent-callable MCP tools with their own input schema |
+| `hooks`   | `{ onPage, onNavigation, onRequest, onResponse }` | Passive observers — monitoring, logging, request interception   |
+
+
+---
+
+## Plugin Ecosystem
+
+### Contributing a Plugin
+
+Plugins live in the BrowserForce repo under `plugins/`. To publish one:
+
+1. Fork the repo
+2. Create a folder: `plugins/community/my-plugin/`
+3. Add `index.js` (the plugin code) and `SKILL.md` (AI instructions) inside it
+4. Add an entry to `plugins/registry.json`
+5. Open a PR — official plugins get reviewed and merged to main
+
+That's it. No separate registry service. No npm publishing required.
+
+---
+
+### The Registry
+
+A single JSON file at `plugins/registry.json` in the repo is the source of truth. The Chrome extension and CLI fetch it directly from GitHub's raw content URL — no server, no API.
+
+**Registry entry shape:**
+
+```json
+{
+  "name": "network",
+  "displayName": "HAR / Network Capture",
+  "description": "Record all network requests and responses during a session.",
+  "author": "browserforce",
+  "official": true,
+  "version": "1.0.0",
+  "audience": ["developer"],
+  "capabilities": ["helpers", "hooks"],
+  "file": "plugins/official/network/index.js",
+  "skill": "plugins/official/network/SKILL.md"
+}
+```
+
+
+| Field          | Description                                                         |
+| -------------- | ------------------------------------------------------------------- |
+| `official`     | `true` for BrowserForce-maintained plugins, `false` for community   |
+| `audience`     | `"developer"`, `"headless"`, or both                                |
+| `capabilities` | Which plugin surfaces it uses: `helpers`, `tools`, `hooks`, `setup` |
+| `file`         | Path to `index.js` in the repo — fetched on install                 |
+| `skill`        | Path to `SKILL.md` — fetched on install, injected into AI context   |
+
+
+---
+
+### Chrome Extension — Plugin Directory
+
+The extension popup gains a **Plugins** tab (or opens as a fullscreen options page). It:
+
+1. Fetches `registry.json` from GitHub on open (cached for 10 minutes)
+2. Shows all plugins — official first, community below — with audience tags and capability badges
+3. Marks which ones are currently installed
+4. Install/remove buttons call the relay's plugin API (the extension can't write to disk directly)
+
+**Why the relay is the bridge:**
+Chrome extensions have no filesystem access. The relay runs at `127.0.0.1:19222` and can write to `~/.browserforce/plugins/`. The extension POSTs to the relay; the relay fetches the plugin file from GitHub and writes it to disk.
+
+```
+Extension UI
+    │  POST /plugins/install { name: "network" }
+    ▼
+Relay (127.0.0.1:19222)
+    │  fetches index.js + SKILL.md from GitHub
+    │  writes to ~/.browserforce/plugins/network/
+    ▼
+~/.browserforce/plugins/
+```
+
+**Relay plugin endpoints:**
+
+
+| Method   | Path               | Action                                       |
+| -------- | ------------------ | -------------------------------------------- |
+| `GET`    | `/plugins`         | List installed plugins + their metadata      |
+| `POST`   | `/plugins/install` | Download plugin from registry, write to disk |
+| `DELETE` | `/plugins/:name`   | Remove plugin file from disk                 |
+
+
+Plugins take effect on next MCP server restart (the extension shows a restart prompt).
+
+---
+
+### CLI — For Headless Users
+
+Users without browser UI access manage plugins through the CLI:
+
+```bash
+# List all available plugins from the registry
+browserforce plugin list
+
+# Install a plugin
+browserforce plugin install network
+
+# Install from a local file (for development)
+browserforce plugin install ./my-plugin.js
+
+# Remove a plugin
+browserforce plugin remove network
+
+# Show installed plugins
+browserforce plugin status
+```
+
+`plugin install` fetches the JS directly from GitHub's raw content URL and writes it to `~/.browserforce/plugins/`. Same outcome as the extension UI, different path.
+
+---
+
+### Plugin Directory Structure (in repo)
+
+```
+plugins/
+  registry.json           ← single source of truth
+  official/
+    network/
+      index.js            ← HAR capture plugin code
+      SKILL.md            ← AI instructions for this plugin
+    session/
+      index.js
+      SKILL.md
+    pdf/
+      index.js
+      SKILL.md
+  community/
+    salesforce/           ← community-contributed
+      index.js
+      SKILL.md
+    linear/
+      index.js
+      SKILL.md
+```
+
+Official plugins are maintained by the BrowserForce team. Community plugins are reviewed for safety (no `eval`, no network calls to external servers, no credential exfiltration) before merge.
+
+---
+
+### Security Model
+
+Plugins are arbitrary JS running in Node.js — they have full filesystem and network access. The safety contract is:
+
+- **Official plugins**: reviewed and maintained by BrowserForce
+- **Community plugins**: reviewed before merge (same bar as official)
+- **Local plugins**: `~/.browserforce/plugins/*.js` — user's own files, not from the registry, fully trusted
+
+The relay install endpoint only fetches from the known GitHub repo URL — no arbitrary URLs. The extension UI only shows registry plugins. Users who want to run untrusted code drop files manually into the plugins folder.
+
+No sandboxing beyond that. Plugins are as trusted as any npm package you install.
+
+---
+

From 33b06848c64ea2610b45040413fd2fe229342287 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 21:14:55 +0530
Subject: [PATCH 004/192] docs: add comprehensive guide for building
 BrowserForce plugins

- Introduced a new `BUILDING_PLUGINS.md` file detailing the process of creating, testing, and submitting plugins.
- Included step-by-step instructions for building a sample highlight plugin, along with code snippets for helper functions and usage examples.
- Explained different plugin surfaces and the importance of the SKILL.md companion file for plugin context.
---
 docs/BUILDING_PLUGINS.md | 477 +++++++++++++++++++++++++++++++++++++++
 1 file changed, 477 insertions(+)
 create mode 100644 docs/BUILDING_PLUGINS.md

diff --git a/docs/BUILDING_PLUGINS.md b/docs/BUILDING_PLUGINS.md
new file mode 100644
index 0000000..a8c5be4
--- /dev/null
+++ b/docs/BUILDING_PLUGINS.md
@@ -0,0 +1,477 @@
+# Building BrowserForce Plugins
+
+Adding a plugin extends BrowserForce for yourself or the whole community. Personal plugins stay in `~/.browserforce/plugins/` and are never shared unless you choose to. Public plugins get reviewed and merged into the repo, appearing in the plugin directory for anyone to install.
+
+This guide walks through everything: building, testing, and submitting a plugin.
+
+---
+
+## 1. Build Your First Plugin
+
+### Step 1 — Create the folder
+
+```bash
+mkdir -p ~/.browserforce/plugins/highlight
+touch ~/.browserforce/plugins/highlight/index.js
+touch ~/.browserforce/plugins/highlight/SKILL.md
+```
+
+### Step 2 — Write the export
+
+Start with just `name` and one helper. Here is a complete `highlight.js` plugin that visually highlights any element on the page:
+
+```js
+// ~/.browserforce/plugins/highlight/index.js
+
+export default {
+  name: 'highlight',
+
+  helpers: {
+    /**
+     * Visually highlight a DOM element by selector.
+     *
+     * @param {import('playwright').Page} page
+     * @param {string} selector  - CSS selector for the element to highlight
+     * @param {string} [color]   - CSS color value (default: '#ff0' — yellow)
+     * @param {number} [duration] - ms to hold the highlight (0 = permanent, default: 2000)
+     * @returns {Promise<{ found: boolean, selector: string }>}
+     */
+    async highlight(page, selector, color = '#ff0', duration = 2000) {
+      const found = await page.evaluate(
+        ({ sel, col, dur }) => {
+          const el = document.querySelector(sel);
+          if (!el) return false;
+
+          const prev = el.style.cssText;
+          el.style.outline = `3px solid ${col}`;
+          el.style.backgroundColor = col;
+          el.style.transition = 'outline 0.1s, background-color 0.1s';
+          el.scrollIntoView({ behavior: 'smooth', block: 'center' });
+
+          if (dur > 0) {
+            setTimeout(() => {
+              el.style.cssText = prev;
+            }, dur);
+          }
+
+          return true;
+        },
+        { sel: selector, col: color, dur: duration }
+      );
+
+      return { found, selector };
+    },
+
+    /**
+     * Clear all highlights applied by this plugin.
+     *
+     * @param {import('playwright').Page} page
+     */
+    async clearHighlights(page) {
+      await page.evaluate(() => {
+        document.querySelectorAll('[data-bf-highlighted]').forEach(el => {
+          el.removeAttribute('style');
+          el.removeAttribute('data-bf-highlighted');
+        });
+      });
+    }
+  }
+};
+```
+
+### Step 3 — Restart the MCP server
+
+Plugins are loaded at startup. Kill and restart the MCP server after dropping a new file:
+
+```bash
+# If using Claude Desktop, restart it.
+# If running manually:
+pnpm mcp
+```
+
+### Step 4 — Call the helper from execute
+
+Once loaded, `highlight` and `clearHighlights` are available as globals inside every `execute()` call:
+
+```js
+// In an execute() block:
+const result = await highlight(page, 'button[type="submit"]', '#f90', 3000);
+if (!result.found) return 'Element not found';
+return `Highlighted: ${result.selector}`;
+```
+
+```js
+// Highlight multiple elements:
+await highlight(page, 'h1', '#0ff', 0);          // permanent cyan on heading
+await highlight(page, '.price', '#f0f', 0);       // permanent magenta on price
+```
+
+### Step 5 — Write a SKILL.md companion
+
+See [Section 4](#4-the-skillmd-companion) for what to include.
+
+### Step 6 — Submit as a PR (optional)
+
+See [Section 8](#8-submitting-a-plugin-pr-checklist) for the full checklist.
+
+---
+
+## 2. Choosing the Right Surface
+
+Every plugin capability maps to one of four surfaces. Pick the one that matches how the capability will be used.
+
+### `helpers` — page utilities called from `execute()`
+
+Use when the capability needs to compose with other execute code inline — extracting data, manipulating the DOM, reading state. The agent writes a script that calls your helper as a function and uses the return value immediately.
+
+```js
+helpers: {
+  async extractTableData(page, tableSelector) {
+    return page.evaluate((sel) => {
+      const rows = [...document.querySelectorAll(`${sel} tr`)];
+      return rows.map(row =>
+        [...row.querySelectorAll('td,th')].map(cell => cell.innerText.trim())
+      );
+    }, tableSelector);
+  }
+}
+```
+
+Called from execute:
+```js
+const data = await extractTableData(page, '#results-table');
+return JSON.stringify(data);
+```
+
+### `tools` — standalone MCP tools with their own schema
+
+Use when the capability stands alone and the AI should invoke it directly by name, not compose it inside a script. PDF export, sending a notification, or fetching data from an external system are good fits. Tools return the MCP content format directly.
+
+```js
+tools: [{
+  name: 'export_pdf',
+  description: 'Export the current page as a PDF file. Use when the user wants to save or share a page.',
+  schema: {
+    path: { type: 'string', description: 'Output file path, e.g. ~/exports/report.pdf' }
+  },
+  async handler({ path }, { browser, context }) {
+    const pages = context.pages();
+    const page = pages[pages.length - 1];
+    const resolvedPath = path.replace('~', process.env.HOME);
+    await page.pdf({ path: resolvedPath, format: 'A4', printBackground: true });
+    return { content: [{ type: 'text', text: `PDF saved to ${resolvedPath}` }] };
+  }
+}]
+```
+
+### `hooks` — passive browser lifecycle observers
+
+Use when you need to react to browser events without any agent action triggering them. Logging all navigations, capturing every network request, or building a HAR store automatically are hook use cases.
+
+```js
+hooks: {
+  onNavigation: async (page, url) => {
+    console.error(`[nav] ${url}`);
+  },
+  onRequest: async (request, page) => {
+    // fires for every network request — keep processing minimal
+    if (request.url().includes('/api/')) {
+      store.push({ url: request.url(), method: request.method() });
+    }
+  }
+}
+```
+
+> `onRequest` and `onResponse` fire for every network event on every page. Keep hook handlers fast. Anything slow here slows the whole browser.
+
+### `setup` — one-time init at MCP server startup
+
+Use when multiple helpers share state that needs to be initialized before any of them run: opening a database connection, creating an in-memory HAR store, loading a config file.
+
+```js
+let harStore = null;
+
+export default {
+  name: 'network',
+  async setup({ browser }) {
+    harStore = { entries: [], startedAt: Date.now() };
+  },
+  helpers: {
+    async startCapture(page) { harStore.capturing = true; },
+    async stopCapture(page) {
+      harStore.capturing = false;
+      return harStore;
+    }
+  }
+};
+```
+
+---
+
+## 3. The SKILL.md Companion
+
+Every plugin should ship a `SKILL.md` alongside the `.js` file. This file is read by the AI agent at startup. It tells the agent when to use the plugin, when not to, and how to call it correctly. Without it, the agent has no context for the plugin's capabilities.
+
+**Required sections:**
+
+```markdown
+# highlight plugin
+
+Use `highlight(page, selector, color, duration)` / `clearHighlights(page)` when you need to:
+- Visually mark an element for debugging or demonstration
+- Show a user which element the agent is about to interact with
+- Annotate a screenshot for reporting
+
+## When NOT to use this
+- Don't highlight before taking a screenshot if you need the original unmodified view
+- Don't leave permanent highlights (duration: 0) unless intentional — they persist across agent turns
+
+## Parameters
+- `selector` — any valid CSS selector
+- `color` — any CSS color value: `'#f90'`, `'red'`, `'rgba(255,0,0,0.3)'`
+- `duration` — milliseconds to hold the highlight; `0` = permanent until `clearHighlights()`
+
+## Example
+\`\`\`js
+// Highlight the submit button in orange for 3 seconds
+const { found } = await highlight(page, 'button[type="submit"]', '#f90', 3000);
+if (!found) return 'Submit button not found on this page';
+\`\`\`
+
+## Common mistakes
+- Calling `highlight` on a selector that matches zero elements — always check `result.found`
+- Forgetting to `clearHighlights()` before capturing a clean screenshot
+```
+
+---
+
+## 4. Rules — What's Not Allowed
+
+The following will cause a PR to be rejected without review.
+
+### Code quality
+
+- No obfuscated code — all plugin code must be readable line by line
+- No minified code — even if it's a build output, submit the readable source
+- No transpiled-only output — submit the original source, not compiled JS
+- No code that requires a build step to understand or modify
+
+### Security
+
+- No network requests to external servers — plugins run locally and must stay local
+- No `eval()`, `new Function(string)`, or any dynamic execution of remotely sourced strings
+- No credential harvesting — never read, log, store, or transmit passwords, tokens, session cookies, or API keys to anything outside the browser context
+- No shell execution (`child_process.exec`, `execSync`, `spawn`) unless the plugin is explicitly a local system integration and the shell command is hardcoded and clearly documented
+- No writing to paths outside `~/.browserforce/` without explicit user configuration
+
+### Behavior
+
+- No modifying BrowserForce's own runtime state or files
+- No overriding built-in helpers: `snapshot`, `waitForPageLoad`, `getLogs`, `clearLogs`
+- No relying on undocumented BrowserForce internals — only use the API surfaces defined in this guide
+
+---
+
+## 5. Best Practices
+
+**Single responsibility.** One plugin, one concern. Don't bundle 10 unrelated helpers into one file. If it needs its own README section, it needs its own plugin.
+
+**Name helpers specifically.** Helper names become globals in `execute()`. Use descriptive names that won't collide with built-ins or other plugins:
+
+| Bad | Good |
+|-----|------|
+| `capture` | `captureHAR` |
+| `save` | `saveSessionState` |
+| `extract` | `extractTableData` |
+
+**Handle errors and return useful values.** Wrap page interactions in try/catch. Return a summary the agent can act on — don't return `undefined` when you could return `{ found: false, reason: 'selector matched 0 elements' }`.
+
+```js
+// Bad
+async highlight(page, selector) {
+  await page.evaluate((sel) => {
+    document.querySelector(sel).style.outline = '3px solid red';
+  }, selector);
+}
+
+// Good
+async highlight(page, selector) {
+  try {
+    const found = await page.evaluate((sel) => {
+      const el = document.querySelector(sel);
+      if (!el) return false;
+      el.style.outline = '3px solid red';
+      return true;
+    }, selector);
+    return { found, selector };
+  } catch (err) {
+    return { found: false, selector, error: err.message };
+  }
+}
+```
+
+**Write MCP tool descriptions the AI can understand.** Vague descriptions produce wrong tool calls.
+
+| Bad | Good |
+|-----|------|
+| `"Exports the page"` | `"Export the current page as a PDF. Use when the user asks to save or share the page as a document."` |
+
+**Keep hooks lightweight.** `onRequest` and `onResponse` fire for every network event. Do not run async calls, DOM access, or heavy processing inside them. Accumulate to an in-memory array; process it in a helper when the agent asks.
+
+**Use `setup()` for shared state.** If multiple helpers share a data store, initialize it once in `setup()` and close over it. Module-level mutable globals can leak state across tool invocations.
+
+**Test against a real browser.** Plugins interact with a live Chrome session. Integration test on real pages, not mocks.
+
+---
+
+## 6. Testing Your Plugin
+
+Three levels before submitting.
+
+### Level 1 — Smoke test (required)
+
+Install locally, restart the MCP server, run a minimal `execute()` call:
+
+```js
+// Minimal smoke test
+return await highlight(page, 'body', '#ff0', 1000);
+```
+
+Verify: no crash, no uncaught exception, return value looks correct.
+
+### Level 2 — Real-world test (required)
+
+Run against at least one real website for each helper and tool the plugin exposes. Document what you ran and what came back in your PR description under `## Test Results`.
+
+Example test results entry:
+
+```
+highlight(page, 'h1', '#f90', 2000) on https://example.com
+→ { found: true, selector: 'h1' }
+Element glowed orange for 2s, reverted cleanly.
+
+highlight(page, '.nonexistent', '#f90') on https://example.com
+→ { found: false, selector: '.nonexistent' }
+No crash, correct not-found response.
+```
+
+### Level 3 — Data correctness (for helpers that extract or transform data)
+
+If your plugin extracts or transforms data, verify 2-3 representative cases: input page state → expected helper output. These can be manual — you are not required to write automated tests, but you must have confirmed the output is correct and stable before submitting.
+
+---
+
+## 7. Submitting a Plugin (PR Checklist)
+
+Before opening a PR, verify all of the following:
+
+- [ ] Plugin folder created at `plugins/community/your-plugin/`
+- [ ] `index.js` and `SKILL.md` both present inside that folder
+- [ ] Code is readable — no minification, no obfuscation
+- [ ] `registry.json` entry added with all required fields (see format below)
+- [ ] Plugin tested against at least one real website per helper/tool
+- [ ] No external network calls
+- [ ] No `eval()` or dynamic code execution
+- [ ] No credentials or secrets in code or comments
+- [ ] Helper names are specific enough to avoid collisions
+- [ ] PR description includes a `## Test Results` section with actual output
+
+### registry.json entry format
+
+```json
+{
+  "name": "highlight",
+  "displayName": "Element Highlighter",
+  "description": "Visually highlight DOM elements with a colored outline. Useful for debugging, demonstration, and annotated screenshots.",
+  "author": "your-github-handle",
+  "official": false,
+  "version": "1.0.0",
+  "audience": ["developer"],
+  "capabilities": ["helpers"],
+  "file": "plugins/community/highlight.js",
+  "readme": "plugins/community/highlight.md"
+}
+```
+
+---
+
+## 8. Plugin Versioning
+
+The registry references versioned releases, not the `main` branch directly.
+
+When updating an existing plugin:
+
+1. Bump `version` in `registry.json` (follow semver)
+2. Existing installs do not auto-update — users re-install to get the new version
+3. Breaking changes to helper signatures (renamed params, changed return shape) warrant a **major version bump**
+4. Add a `## Migration` section to `SKILL.md` for any breaking change
+
+```markdown
+## Migration — v1 → v2
+
+`highlight(page, selector, color)` now returns `{ found, selector }` instead of a boolean.
+
+Before:
+\`\`\`js
+const ok = await highlight(page, 'h1', '#f90');
+\`\`\`
+
+After:
+\`\`\`js
+const { found } = await highlight(page, 'h1', '#f90');
+\`\`\`
+```
+
+---
+
+## Full Plugin Shape Reference
+
+```js
+// ~/.browserforce/plugins/my-plugin.js
+
+export default {
+  // Required. Unique across all plugins.
+  name: 'my-plugin',
+
+  // One-time init when the MCP server starts.
+  async setup({ browser }) {
+    // browser → Playwright Browser instance
+  },
+
+  // Page utilities injected as globals into every execute() call.
+  // First argument is always `page`. Return values are available to the agent.
+  helpers: {
+    async myHelper(page, param) {
+      return await page.evaluate((p) => window.someAPI(p), param);
+    }
+  },
+
+  // Standalone MCP tools. Agents call these by name, not from execute().
+  tools: [{
+    name: 'my_tool',
+    description: 'What this tool does and when the agent should call it.',
+    schema: {
+      param: { type: 'string', description: 'Input value' }
+    },
+    async handler({ param }, { browser, context }) {
+      // Must return MCP content format.
+      return { content: [{ type: 'text', text: `Result: ${param}` }] };
+    }
+  }],
+
+  // Passive browser lifecycle observers. No agent trigger required.
+  hooks: {
+    onPage:       async (page) => {},            // new page created
+    onNavigation: async (page, url) => {},       // page navigated
+    onRequest:    async (request, page) => {},   // network request sent
+    onResponse:   async (response, page) => {},  // network response received
+  }
+};
+```
+
+| Surface | Receives | Returns | When to use |
+|---------|----------|---------|-------------|
+| `setup` | `{ browser }` | void | One-time init: open connections, warm state |
+| `helpers` | `(page, ...args)` | any | Inline page utilities composed inside `execute()` |
+| `tools` | `(params, { browser, context })` | MCP content | Standalone agent-callable actions with own schema |
+| `hooks` | varies by hook | void | Passive observers — logging, monitoring, interception |

From 8a1b7a590ba5143f3255a174252896008f8358ba Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 22:12:00 +0530
Subject: [PATCH 005/192] fix(plugins): correct registry URL to point at
 ivalsaraj/browserforce
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The highlight plugin url/skill_url pointed to the non-existent org
browserforce/plugins. Updated to the actual file location:
  ivalsaraj/browserforce/main/plugins/official/highlight/

Also removed the now-dead BASE_RAW constant from both installers —
it was a leftover from before the schema change (entry.url replaced
the old BASE_RAW + entry.file pattern).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 mcp/src/plugin-installer.js    | 1 -
 plugins/registry.json          | 4 ++--
 relay/src/plugin-installer.cjs | 1 -
 3 files changed, 2 insertions(+), 4 deletions(-)

diff --git a/mcp/src/plugin-installer.js b/mcp/src/plugin-installer.js
index 517abc7..647f998 100644
--- a/mcp/src/plugin-installer.js
+++ b/mcp/src/plugin-installer.js
@@ -4,7 +4,6 @@ import { createHash } from 'node:crypto';
 import https from 'node:https';
 
 const REGISTRY_URL = 'https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/registry.json';
-const BASE_RAW = 'https://raw.githubusercontent.com/ivalsaraj/browserforce/main/';
 
 function httpsGetRaw(url) {
   return new Promise((resolve, reject) => {
diff --git a/plugins/registry.json b/plugins/registry.json
index 8370cc1..ec5a90e 100644
--- a/plugins/registry.json
+++ b/plugins/registry.json
@@ -5,8 +5,8 @@
       "name": "highlight",
       "description": "Highlight elements on the page using CSS outline",
       "version": "1.0.0",
-      "url": "https://raw.githubusercontent.com/browserforce/plugins/main/highlight/index.js",
-      "skill_url": "https://raw.githubusercontent.com/browserforce/plugins/main/highlight/SKILL.md",
+      "url": "https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/official/highlight/index.js",
+      "skill_url": "https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/official/highlight/SKILL.md",
       "sha256": "d302bd9a0f6e96bd0c7a8666b560e01ab88f9f9e4c4694f14d97019f4cc04424"
     }
   ]
diff --git a/relay/src/plugin-installer.cjs b/relay/src/plugin-installer.cjs
index 084a5b3..bd7d51d 100644
--- a/relay/src/plugin-installer.cjs
+++ b/relay/src/plugin-installer.cjs
@@ -5,7 +5,6 @@ const crypto = require('node:crypto');
 const https = require('node:https');
 
 const REGISTRY_URL = 'https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/registry.json';
-const BASE_RAW = 'https://raw.githubusercontent.com/ivalsaraj/browserforce/main/';
 
 function httpsGetRaw(url) {
   return new Promise((resolve, reject) => {

From 40efa360ccb4dbcd89fe69f8abdfddcd1c31f956 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 22:32:20 +0530
Subject: [PATCH 006/192] feat: add non-blocking update notifier to CLI

Checks registry.npmjs.org/browserforce/latest once per day (cached
in ~/.browserforce/update-check.json). Runs async in background
alongside the command; shows a one-liner to stderr after completion
if a newer version is available. Skipped for long-running serve/mcp
commands. Zero new dependencies.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 bin.js | 69 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 69 insertions(+)

diff --git a/bin.js b/bin.js
index c3be658..c2c00a6 100644
--- a/bin.js
+++ b/bin.js
@@ -3,6 +3,7 @@
 
 import { parseArgs } from 'node:util';
 import http from 'node:http';
+import https from 'node:https';
 
 const { values, positionals } = parseArgs({
   options: {
@@ -68,6 +69,61 @@ function httpFetch(method, url, body, authToken) {
   });
 }
 
+// ─── Update Notifier ────────────────────────────────────────────────────────
+
+function semverGt(a, b) {
+  const pa = a.split('.').map(Number);
+  const pb = b.split('.').map(Number);
+  for (let i = 0; i < 3; i++) {
+    if ((pa[i] || 0) > (pb[i] || 0)) return true;
+    if ((pa[i] || 0) < (pb[i] || 0)) return false;
+  }
+  return false;
+}
+
+async function checkForUpdate() {
+  try {
+    const { readFileSync, writeFileSync, mkdirSync } = await import('node:fs');
+    const { join } = await import('node:path');
+    const { homedir } = await import('node:os');
+
+    const current = JSON.parse(readFileSync(new URL('./package.json', import.meta.url).pathname, 'utf8')).version;
+    const cacheDir = join(homedir(), '.browserforce');
+    const cacheFile = join(cacheDir, 'update-check.json');
+
+    // Return cached result if fresh (< 24 h)
+    try {
+      const cached = JSON.parse(readFileSync(cacheFile, 'utf8'));
+      if (Date.now() - cached.checkedAt < 86_400_000) {
+        return semverGt(cached.latest, current) ? { current, latest: cached.latest } : null;
+      }
+    } catch { /* no cache yet */ }
+
+    // Fetch latest version from npm registry
+    const latest = await new Promise((resolve, reject) => {
+      const req = https.get(
+        'https://registry.npmjs.org/browserforce/latest',
+        { headers: { 'User-Agent': 'browserforce-cli' } },
+        (res) => {
+          if (res.statusCode !== 200) { res.resume(); return reject(new Error(`HTTP ${res.statusCode}`)); }
+          let data = '';
+          res.on('data', d => (data += d));
+          res.on('end', () => { try { resolve(JSON.parse(data).version); } catch { reject(new Error('parse error')); } });
+        },
+      );
+      req.on('error', reject);
+      req.setTimeout(5000, () => { req.destroy(); reject(new Error('timeout')); });
+    });
+
+    // Persist cache
+    try { mkdirSync(cacheDir, { recursive: true }); writeFileSync(cacheFile, JSON.stringify({ checkedAt: Date.now(), latest })); } catch { /* ignore */ }
+
+    return semverGt(latest, current) ? { current, latest } : null;
+  } catch {
+    return null;
+  }
+}
+
 async function connectBrowser() {
   const { getCdpUrl, ensureRelay } = await import('./mcp/src/exec-engine.js');
   await ensureRelay();
@@ -356,9 +412,22 @@ if (!handler) {
   process.exit(1);
 }
 
+// Start update check in background — skipped for long-running commands
+const updatePromise = (command !== 'serve' && command !== 'mcp')
+  ? checkForUpdate()
+  : null;
+
 try {
   await handler();
 } catch (err) {
   console.error(`Error: ${err.message}`);
   process.exit(1);
 }
+
+// Show update notice after command finishes (wait at most 500 ms)
+if (updatePromise) {
+  const update = await Promise.race([updatePromise, new Promise(r => setTimeout(r, 500, null))]);
+  if (update) {
+    process.stderr.write(`\n  Update available: ${update.current} → ${update.latest}\n  Run: npm install -g browserforce\n\n`);
+  }
+}

From 8685fd875af6bcbeff21173b9dd9f74269da397f Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 22:37:24 +0530
Subject: [PATCH 007/192] feat: agent-visible update notices via MCP +
 browserforce update command

Extract update check logic to mcp/src/update-check.js (shared module).

MCP server: fires update check at startup, injects a one-line notice
into the first execute tool response when a newer version is on npm.
The agent surfaces this to the user who can say "update browserforce".

CLI: add `browserforce update` command (runs npm install -g browserforce)
so the agent has a single command to trigger the upgrade.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 bin.js                  | 82 ++++++++++++-----------------------------
 mcp/src/index.js        | 15 ++++++++
 mcp/src/update-check.js | 65 ++++++++++++++++++++++++++++++++
 3 files changed, 103 insertions(+), 59 deletions(-)
 create mode 100644 mcp/src/update-check.js

diff --git a/bin.js b/bin.js
index c2c00a6..c9bb38a 100644
--- a/bin.js
+++ b/bin.js
@@ -3,7 +3,7 @@
 
 import { parseArgs } from 'node:util';
 import http from 'node:http';
-import https from 'node:https';
+import { checkForUpdate } from './mcp/src/update-check.js';
 
 const { values, positionals } = parseArgs({
   options: {
@@ -69,61 +69,6 @@ function httpFetch(method, url, body, authToken) {
   });
 }
 
-// ─── Update Notifier ────────────────────────────────────────────────────────
-
-function semverGt(a, b) {
-  const pa = a.split('.').map(Number);
-  const pb = b.split('.').map(Number);
-  for (let i = 0; i < 3; i++) {
-    if ((pa[i] || 0) > (pb[i] || 0)) return true;
-    if ((pa[i] || 0) < (pb[i] || 0)) return false;
-  }
-  return false;
-}
-
-async function checkForUpdate() {
-  try {
-    const { readFileSync, writeFileSync, mkdirSync } = await import('node:fs');
-    const { join } = await import('node:path');
-    const { homedir } = await import('node:os');
-
-    const current = JSON.parse(readFileSync(new URL('./package.json', import.meta.url).pathname, 'utf8')).version;
-    const cacheDir = join(homedir(), '.browserforce');
-    const cacheFile = join(cacheDir, 'update-check.json');
-
-    // Return cached result if fresh (< 24 h)
-    try {
-      const cached = JSON.parse(readFileSync(cacheFile, 'utf8'));
-      if (Date.now() - cached.checkedAt < 86_400_000) {
-        return semverGt(cached.latest, current) ? { current, latest: cached.latest } : null;
-      }
-    } catch { /* no cache yet */ }
-
-    // Fetch latest version from npm registry
-    const latest = await new Promise((resolve, reject) => {
-      const req = https.get(
-        'https://registry.npmjs.org/browserforce/latest',
-        { headers: { 'User-Agent': 'browserforce-cli' } },
-        (res) => {
-          if (res.statusCode !== 200) { res.resume(); return reject(new Error(`HTTP ${res.statusCode}`)); }
-          let data = '';
-          res.on('data', d => (data += d));
-          res.on('end', () => { try { resolve(JSON.parse(data).version); } catch { reject(new Error('parse error')); } });
-        },
-      );
-      req.on('error', reject);
-      req.setTimeout(5000, () => { req.destroy(); reject(new Error('timeout')); });
-    });
-
-    // Persist cache
-    try { mkdirSync(cacheDir, { recursive: true }); writeFileSync(cacheFile, JSON.stringify({ checkedAt: Date.now(), latest })); } catch { /* ignore */ }
-
-    return semverGt(latest, current) ? { current, latest } : null;
-  } catch {
-    return null;
-  }
-}
-
 async function connectBrowser() {
   const { getCdpUrl, ensureRelay } = await import('./mcp/src/exec-engine.js');
   await ensureRelay();
@@ -360,6 +305,23 @@ async function cmdPlugin() {
   process.exit(1);
 }
 
+async function cmdUpdate() {
+  const { spawnSync } = await import('node:child_process');
+  console.log('Checking for updates...');
+  const update = await checkForUpdate();
+  if (!update) {
+    console.log('Already up to date.');
+    return;
+  }
+  console.log(`Updating ${update.current} → ${update.latest}...`);
+  const result = spawnSync('npm', ['install', '-g', 'browserforce'], { stdio: 'inherit' });
+  if (result.status !== 0) {
+    console.error('Update failed. Run manually: npm install -g browserforce');
+    process.exit(1);
+  }
+  console.log(`Updated to ${update.latest}.`);
+}
+
 function cmdHelp() {
   console.log(`
   BrowserForce — Give AI agents your real Chrome browser
@@ -375,6 +337,7 @@ function cmdHelp() {
     browserforce plugin list        List installed plugins
     browserforce plugin install <n> Install a plugin from the registry
     browserforce plugin remove <n>  Remove an installed plugin
+    browserforce update             Update to the latest version
     browserforce -e "<code>"        Execute Playwright JavaScript (one-shot)
 
   Options:
@@ -387,6 +350,7 @@ function cmdHelp() {
     browserforce tabs
     browserforce plugin list
     browserforce plugin install highlight
+    browserforce update
     browserforce -e "return await snapshot()"
     browserforce -e "await page.goto('https://github.com'); return await snapshot()"
     browserforce screenshot 0 > page.png
@@ -402,7 +366,7 @@ function cmdHelp() {
 const commands = {
   serve: cmdServe, mcp: cmdMcp, status: cmdStatus, tabs: cmdTabs,
   screenshot: cmdScreenshot, snapshot: cmdSnapshot, navigate: cmdNavigate,
-  execute: cmdExecute, plugin: cmdPlugin, help: cmdHelp,
+  execute: cmdExecute, plugin: cmdPlugin, update: cmdUpdate, help: cmdHelp,
 };
 
 const handler = commands[command];
@@ -412,8 +376,8 @@ if (!handler) {
   process.exit(1);
 }
 
-// Start update check in background — skipped for long-running commands
-const updatePromise = (command !== 'serve' && command !== 'mcp')
+// Start update check in background — skipped for long-running or self-update commands
+const updatePromise = (command !== 'serve' && command !== 'mcp' && command !== 'update')
   ? checkForUpdate()
   : null;
 
diff --git a/mcp/src/index.js b/mcp/src/index.js
index 5bb01e0..ad06862 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -11,6 +11,7 @@ import {
 } from './exec-engine.js';
 import { screenshotWithLabels } from './a11y-labels.js';
 import { loadPlugins, buildPluginHelpers, buildPluginSkillAppendix } from './plugin-loader.js';
+import { checkForUpdate } from './update-check.js';
 
 // ─── Console Log Capture ─────────────────────────────────────────────────────
 
@@ -107,6 +108,12 @@ let userState = {};
 let plugins = [];
 let pluginHelpers = {};
 
+// ─── Update State ────────────────────────────────────────────────────────────
+// Checked once at startup; notice injected into first execute response only.
+
+let pendingUpdate = null;    // { current, latest } or null
+let updateNoticeSent = false;
+
 // ─── MCP Server ──────────────────────────────────────────────────────────────
 
 const server = new McpServer({
@@ -319,6 +326,11 @@ function registerExecuteTool(skillAppendix = '') {
       try {
         const result = await runCode(code, execCtx, timeout);
         const formatted = formatResult(result);
+        // Inject update notice into the first text response of the session (once only)
+        if (pendingUpdate && !updateNoticeSent && formatted.type === 'text') {
+          updateNoticeSent = true;
+          formatted.text += `\n\n[BrowserForce update available: ${pendingUpdate.current} → ${pendingUpdate.latest}]\n[Run: browserforce update   or: npm install -g browserforce]`;
+        }
         return { content: [formatted] };
       } catch (err) {
         const isTimeout = err instanceof CodeExecutionTimeoutError;
@@ -453,6 +465,9 @@ async function main() {
   await initPlugins();
   registerExecuteTool(buildPluginSkillAppendix(plugins));
 
+  // Fire update check in background — result stored in pendingUpdate for execute handler
+  checkForUpdate().then(info => { pendingUpdate = info; }).catch(() => {});
+
   try {
     await ensureBrowser();
     process.stderr.write('[bf-mcp] Connected to relay\n');
diff --git a/mcp/src/update-check.js b/mcp/src/update-check.js
new file mode 100644
index 0000000..2815f7d
--- /dev/null
+++ b/mcp/src/update-check.js
@@ -0,0 +1,65 @@
+import https from 'node:https';
+import { readFileSync, writeFileSync, mkdirSync } from 'node:fs';
+import { join } from 'node:path';
+import { homedir } from 'node:os';
+
+export function semverGt(a, b) {
+  const pa = a.split('.').map(Number);
+  const pb = b.split('.').map(Number);
+  for (let i = 0; i < 3; i++) {
+    if ((pa[i] || 0) > (pb[i] || 0)) return true;
+    if ((pa[i] || 0) < (pb[i] || 0)) return false;
+  }
+  return false;
+}
+
+/**
+ * Check npm registry for a newer version of browserforce.
+ * Result is cached for 24 h in ~/.browserforce/update-check.json.
+ * Returns { current, latest } if an update is available, otherwise null.
+ * Never throws — all errors resolve to null.
+ */
+export async function checkForUpdate() {
+  try {
+    // package.json is two levels up from mcp/src/
+    const pkgPath = new URL('../../package.json', import.meta.url).pathname;
+    const current = JSON.parse(readFileSync(pkgPath, 'utf8')).version;
+
+    const cacheDir = join(homedir(), '.browserforce');
+    const cacheFile = join(cacheDir, 'update-check.json');
+
+    // Return cached result if still fresh (< 24 h)
+    try {
+      const cached = JSON.parse(readFileSync(cacheFile, 'utf8'));
+      if (Date.now() - cached.checkedAt < 86_400_000) {
+        return semverGt(cached.latest, current) ? { current, latest: cached.latest } : null;
+      }
+    } catch { /* no cache yet, or invalid */ }
+
+    // Fetch latest from npm registry
+    const latest = await new Promise((resolve, reject) => {
+      const req = https.get(
+        'https://registry.npmjs.org/browserforce/latest',
+        { headers: { 'User-Agent': 'browserforce-cli' } },
+        (res) => {
+          if (res.statusCode !== 200) { res.resume(); return reject(new Error(`HTTP ${res.statusCode}`)); }
+          let data = '';
+          res.on('data', d => (data += d));
+          res.on('end', () => { try { resolve(JSON.parse(data).version); } catch { reject(new Error('parse error')); } });
+        },
+      );
+      req.on('error', reject);
+      req.setTimeout(5000, () => { req.destroy(); reject(new Error('timeout')); });
+    });
+
+    // Persist to cache
+    try {
+      mkdirSync(cacheDir, { recursive: true });
+      writeFileSync(cacheFile, JSON.stringify({ checkedAt: Date.now(), latest }));
+    } catch { /* ignore cache write errors */ }
+
+    return semverGt(latest, current) ? { current, latest } : null;
+  } catch {
+    return null;
+  }
+}

From 5b7cdd1d68aefb969a276a333237483631bbe180 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 22:55:04 +0530
Subject: [PATCH 008/192] fix: correct path handling, update-check errors, and
 plugin docs
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

- Use fileURLToPath() instead of .pathname on file: URLs in
  update-check.js and bin.js — fixes silent failures on paths with
  spaces or on Windows
- checkForUpdate() now propagates network/parse errors instead of
  swallowing them as null; cmdUpdate catches and reports failures
  instead of printing "Already up to date." on offline/DNS errors
- Execute tool update banner is now a second content[] item rather
  than mutating the first result's text — prevents corrupting
  structured output parsed by callers
- Fix BUILDING_PLUGINS.md registry example: replace stale file/readme
  fields with the actual installer contract (url, sha256, skill_url)
- Bump version to 1.0.10

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 bin.js                   | 11 +++++--
 docs/BUILDING_PLUGINS.md | 10 ++++--
 mcp/src/index.js         |  7 ++--
 mcp/src/update-check.js  | 71 +++++++++++++++++++---------------------
 package.json             |  2 +-
 5 files changed, 56 insertions(+), 45 deletions(-)

diff --git a/bin.js b/bin.js
index c9bb38a..6e723a0 100644
--- a/bin.js
+++ b/bin.js
@@ -3,6 +3,7 @@
 
 import { parseArgs } from 'node:util';
 import http from 'node:http';
+import { fileURLToPath } from 'node:url';
 import { checkForUpdate } from './mcp/src/update-check.js';
 
 const { values, positionals } = parseArgs({
@@ -75,7 +76,7 @@ async function connectBrowser() {
   // playwright-core lives in mcp/node_modules (pnpm workspace sub-package).
   // Use createRequire from the mcp package context to locate it, then dynamic-import.
   const { createRequire } = await import('node:module');
-  const mReq = createRequire(new URL('./mcp/src/exec-engine.js', import.meta.url).pathname);
+  const mReq = createRequire(fileURLToPath(new URL('./mcp/src/exec-engine.js', import.meta.url)));
   const pwPath = mReq.resolve('playwright-core');
   const { default: pw } = await import(pwPath);
   const { chromium } = pw;
@@ -308,7 +309,13 @@ async function cmdPlugin() {
 async function cmdUpdate() {
   const { spawnSync } = await import('node:child_process');
   console.log('Checking for updates...');
-  const update = await checkForUpdate();
+  let update;
+  try {
+    update = await checkForUpdate();
+  } catch (err) {
+    console.error(`Update check failed: ${err.message}`);
+    return;
+  }
   if (!update) {
     console.log('Already up to date.');
     return;
diff --git a/docs/BUILDING_PLUGINS.md b/docs/BUILDING_PLUGINS.md
index a8c5be4..34e6928 100644
--- a/docs/BUILDING_PLUGINS.md
+++ b/docs/BUILDING_PLUGINS.md
@@ -388,11 +388,17 @@ Before opening a PR, verify all of the following:
   "version": "1.0.0",
   "audience": ["developer"],
   "capabilities": ["helpers"],
-  "file": "plugins/community/highlight.js",
-  "readme": "plugins/community/highlight.md"
+  "url": "https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/community/highlight/index.js",
+  "sha256": "abc123...",
+  "skill_url": "https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/community/highlight/SKILL.md"
 }
 ```
 
+> **Field reference**
+> - `url` *(required)* — absolute URL to the plugin JS file; fetched and installed by the plugin installer
+> - `sha256` *(recommended)* — hex SHA-256 of the JS file for integrity verification; omit only in dev/test mode
+> - `skill_url` *(optional)* — absolute URL to a `SKILL.md` Claude skill file bundled with the plugin
+
 ---
 
 ## 8. Plugin Versioning
diff --git a/mcp/src/index.js b/mcp/src/index.js
index ad06862..0f61800 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -326,12 +326,13 @@ function registerExecuteTool(skillAppendix = '') {
       try {
         const result = await runCode(code, execCtx, timeout);
         const formatted = formatResult(result);
-        // Inject update notice into the first text response of the session (once only)
+        const content = [formatted];
+        // Append update notice as a separate content item (once only per session)
         if (pendingUpdate && !updateNoticeSent && formatted.type === 'text') {
           updateNoticeSent = true;
-          formatted.text += `\n\n[BrowserForce update available: ${pendingUpdate.current} → ${pendingUpdate.latest}]\n[Run: browserforce update   or: npm install -g browserforce]`;
+          content.push({ type: 'text', text: `[BrowserForce update available: ${pendingUpdate.current} → ${pendingUpdate.latest}]\n[Run: browserforce update   or: npm install -g browserforce]` });
         }
-        return { content: [formatted] };
+        return { content };
       } catch (err) {
         const isTimeout = err instanceof CodeExecutionTimeoutError;
         const hint = isTimeout ? '' : '\n\n[If connection lost, call reset tool to reconnect]';
diff --git a/mcp/src/update-check.js b/mcp/src/update-check.js
index 2815f7d..8e6aa60 100644
--- a/mcp/src/update-check.js
+++ b/mcp/src/update-check.js
@@ -2,6 +2,7 @@ import https from 'node:https';
 import { readFileSync, writeFileSync, mkdirSync } from 'node:fs';
 import { join } from 'node:path';
 import { homedir } from 'node:os';
+import { fileURLToPath } from 'node:url';
 
 export function semverGt(a, b) {
   const pa = a.split('.').map(Number);
@@ -20,46 +21,42 @@ export function semverGt(a, b) {
  * Never throws — all errors resolve to null.
  */
 export async function checkForUpdate() {
-  try {
-    // package.json is two levels up from mcp/src/
-    const pkgPath = new URL('../../package.json', import.meta.url).pathname;
-    const current = JSON.parse(readFileSync(pkgPath, 'utf8')).version;
+  // package.json is two levels up from mcp/src/
+  const pkgPath = fileURLToPath(new URL('../../package.json', import.meta.url));
+  const current = JSON.parse(readFileSync(pkgPath, 'utf8')).version;
 
-    const cacheDir = join(homedir(), '.browserforce');
-    const cacheFile = join(cacheDir, 'update-check.json');
+  const cacheDir = join(homedir(), '.browserforce');
+  const cacheFile = join(cacheDir, 'update-check.json');
 
-    // Return cached result if still fresh (< 24 h)
-    try {
-      const cached = JSON.parse(readFileSync(cacheFile, 'utf8'));
-      if (Date.now() - cached.checkedAt < 86_400_000) {
-        return semverGt(cached.latest, current) ? { current, latest: cached.latest } : null;
-      }
-    } catch { /* no cache yet, or invalid */ }
+  // Return cached result if still fresh (< 24 h)
+  try {
+    const cached = JSON.parse(readFileSync(cacheFile, 'utf8'));
+    if (Date.now() - cached.checkedAt < 86_400_000) {
+      return semverGt(cached.latest, current) ? { current, latest: cached.latest } : null;
+    }
+  } catch { /* no cache yet, or invalid */ }
 
-    // Fetch latest from npm registry
-    const latest = await new Promise((resolve, reject) => {
-      const req = https.get(
-        'https://registry.npmjs.org/browserforce/latest',
-        { headers: { 'User-Agent': 'browserforce-cli' } },
-        (res) => {
-          if (res.statusCode !== 200) { res.resume(); return reject(new Error(`HTTP ${res.statusCode}`)); }
-          let data = '';
-          res.on('data', d => (data += d));
-          res.on('end', () => { try { resolve(JSON.parse(data).version); } catch { reject(new Error('parse error')); } });
-        },
-      );
-      req.on('error', reject);
-      req.setTimeout(5000, () => { req.destroy(); reject(new Error('timeout')); });
-    });
+  // Fetch latest from npm registry — let errors propagate to caller
+  const latest = await new Promise((resolve, reject) => {
+    const req = https.get(
+      'https://registry.npmjs.org/browserforce/latest',
+      { headers: { 'User-Agent': 'browserforce-cli' } },
+      (res) => {
+        if (res.statusCode !== 200) { res.resume(); return reject(new Error(`HTTP ${res.statusCode}`)); }
+        let data = '';
+        res.on('data', d => (data += d));
+        res.on('end', () => { try { resolve(JSON.parse(data).version); } catch { reject(new Error('parse error')); } });
+      },
+    );
+    req.on('error', reject);
+    req.setTimeout(5000, () => { req.destroy(); reject(new Error('timeout')); });
+  });
 
-    // Persist to cache
-    try {
-      mkdirSync(cacheDir, { recursive: true });
-      writeFileSync(cacheFile, JSON.stringify({ checkedAt: Date.now(), latest }));
-    } catch { /* ignore cache write errors */ }
+  // Persist to cache
+  try {
+    mkdirSync(cacheDir, { recursive: true });
+    writeFileSync(cacheFile, JSON.stringify({ checkedAt: Date.now(), latest }));
+  } catch { /* ignore cache write errors */ }
 
-    return semverGt(latest, current) ? { current, latest } : null;
-  } catch {
-    return null;
-  }
+  return semverGt(latest, current) ? { current, latest } : null;
 }
diff --git a/package.json b/package.json
index abc9d59..7fbc4e0 100644
--- a/package.json
+++ b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "browserforce",
-  "version": "1.0.9",
+  "version": "1.0.10",
   "type": "module",
   "description": "Give AI agents your real Chrome browser with progressive examples: simple reads, form interactions, multi-tab workflows, and state persistence. Search X and GitHub, extract ProductHunt data, test forms, compare A/B variants, monitor status pages. Works with OpenClaw, Claude, and any MCP agent.",
   "homepage": "https://github.com/ivalsaraj/browserforce",

From 67baea3e191dde03f6273871964b7fecc3eb60ac Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 23:23:46 +0530
Subject: [PATCH 009/192] fix: include extension/ in npm package files

The extension/ directory was previously omitted from the npm package,
preventing npm-installed users from accessing the Chrome extension files.
This change adds extension/ to the files array so the full extension
bundle (manifest, background.js, popup UI, and icons) ships with the
published package.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 package.json | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/package.json b/package.json
index 7fbc4e0..850577c 100644
--- a/package.json
+++ b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "browserforce",
-  "version": "1.0.10",
+  "version": "1.0.11",
   "type": "module",
   "description": "Give AI agents your real Chrome browser with progressive examples: simple reads, form interactions, multi-tab workflows, and state persistence. Search X and GitHub, extract ProductHunt data, test forms, compare A/B variants, monitor status pages. Works with OpenClaw, Claude, and any MCP agent.",
   "homepage": "https://github.com/ivalsaraj/browserforce",
@@ -29,6 +29,7 @@
   "files": [
     "README.md",
     "bin.js",
+    "extension/",
     "relay/src/",
     "relay/package.json",
     "mcp/src/",

From b7a116f831710507d9e547574b72cb36fd1e23ce Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 23:25:34 +0530
Subject: [PATCH 010/192] feat: add install-extension command with VERSION
 sentinel

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 bin.js           | 42 ++++++++++++++++++++++++++++++++++++++++--
 test/cli.test.js | 39 +++++++++++++++++++++++++++++++++++++++
 2 files changed, 79 insertions(+), 2 deletions(-)

diff --git a/bin.js b/bin.js
index 6e723a0..9f19d44 100644
--- a/bin.js
+++ b/bin.js
@@ -329,6 +329,42 @@ async function cmdUpdate() {
   console.log(`Updated to ${update.latest}.`);
 }
 
+async function doInstallExtension(quiet) {
+  const { cpSync, mkdirSync, writeFileSync, readFileSync } = await import('node:fs');
+  const { join, dirname } = await import('node:path');
+  const { homedir } = await import('node:os');
+
+  const pkgDir = dirname(fileURLToPath(import.meta.url));
+  const src = join(pkgDir, 'extension');
+  const dest = process.env.BF_EXT_DIR || join(homedir(), '.browserforce', 'extension');
+
+  mkdirSync(dest, { recursive: true });
+  cpSync(src, dest, { recursive: true });
+
+  // VERSION sentinel — tracks npm package version, NOT manifest.json version (those are separate tracks)
+  const pkgVersion = JSON.parse(readFileSync(join(pkgDir, 'package.json'), 'utf8')).version;
+  writeFileSync(join(dest, 'VERSION'), pkgVersion);
+
+  if (!quiet) {
+    console.log(`Extension installed to: ${dest}`);
+    console.log('');
+    console.log('To load in Chrome:');
+    console.log('  1. Open chrome://extensions/');
+    console.log('  2. Enable Developer mode (toggle, top-right)');
+    console.log('  3. Click "Load unpacked" → select:');
+    console.log(`     ${dest}`);
+    console.log('');
+    console.log('❗ After any BrowserForce update, re-run: browserforce install-extension');
+    console.log('   Then reload the extension in chrome://extensions/ (click the ↺ icon).');
+  }
+
+  return { dest, pkgVersion };
+}
+
+async function cmdInstallExtension() {
+  await doInstallExtension(false);
+}
+
 function cmdHelp() {
   console.log(`
   BrowserForce — Give AI agents your real Chrome browser
@@ -345,6 +381,7 @@ function cmdHelp() {
     browserforce plugin install <n> Install a plugin from the registry
     browserforce plugin remove <n>  Remove an installed plugin
     browserforce update             Update to the latest version
+    browserforce install-extension  Copy extension to ~/.browserforce/extension/
     browserforce -e "<code>"        Execute Playwright JavaScript (one-shot)
 
   Options:
@@ -373,7 +410,8 @@ function cmdHelp() {
 const commands = {
   serve: cmdServe, mcp: cmdMcp, status: cmdStatus, tabs: cmdTabs,
   screenshot: cmdScreenshot, snapshot: cmdSnapshot, navigate: cmdNavigate,
-  execute: cmdExecute, plugin: cmdPlugin, update: cmdUpdate, help: cmdHelp,
+  execute: cmdExecute, plugin: cmdPlugin, update: cmdUpdate,
+  'install-extension': cmdInstallExtension, help: cmdHelp,
 };
 
 const handler = commands[command];
@@ -385,7 +423,7 @@ if (!handler) {
 
 // Start update check in background — skipped for long-running or self-update commands
 const updatePromise = (command !== 'serve' && command !== 'mcp' && command !== 'update')
-  ? checkForUpdate()
+  ? checkForUpdate().catch(() => null)
   : null;
 
 try {
diff --git a/test/cli.test.js b/test/cli.test.js
index b13b4ed..307d259 100644
--- a/test/cli.test.js
+++ b/test/cli.test.js
@@ -198,3 +198,42 @@ describe('CLI plugin commands', () => {
     }
   });
 });
+
+describe('CLI install-extension', () => {
+  let tmpExt;
+
+  before(() => {
+    tmpExt = join(tmpdir(), `bf-ext-${Math.random().toString(36).slice(2)}`);
+  });
+
+  after(() => {
+    rmSync(tmpExt, { recursive: true, force: true });
+  });
+
+  it('install-extension copies extension files and writes VERSION', async () => {
+    const { stdout } = await exec('node', ['bin.js', 'install-extension'], {
+      env: { ...process.env, BF_EXT_DIR: tmpExt },
+    });
+    // Check output
+    assert.ok(stdout.includes('Extension installed to:'));
+    assert.ok(stdout.includes(tmpExt));
+    assert.ok(stdout.includes('Load unpacked'));
+    assert.ok(stdout.includes('❗'));
+    assert.ok(stdout.includes('↺'));
+
+    // Check files were copied
+    const { existsSync, readFileSync } = await import('node:fs');
+    assert.ok(existsSync(join(tmpExt, 'manifest.json')));
+    assert.ok(existsSync(join(tmpExt, 'background.js')));
+
+    // Check VERSION sentinel
+    const version = readFileSync(join(tmpExt, 'VERSION'), 'utf8').trim();
+    const pkgVersion = JSON.parse(readFileSync('package.json', 'utf8')).version;
+    assert.equal(version, pkgVersion);
+  });
+
+  it('install-extension is listed in help', async () => {
+    const { stdout } = await exec('node', ['bin.js', 'help']);
+    assert.ok(stdout.includes('install-extension'));
+  });
+});

From 2109e009ff0587b9801c01e555dd96c5a9cf0cbc Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 23:27:12 +0530
Subject: [PATCH 011/192] feat: auto-sync extension after browserforce update

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 bin.js           | 15 +++++++++++++++
 test/cli.test.js | 15 +++++++++++++++
 2 files changed, 30 insertions(+)

diff --git a/bin.js b/bin.js
index 9f19d44..e061b80 100644
--- a/bin.js
+++ b/bin.js
@@ -327,6 +327,21 @@ async function cmdUpdate() {
     process.exit(1);
   }
   console.log(`Updated to ${update.latest}.`);
+
+  // Auto-sync extension if user has previously run install-extension
+  const { readFileSync: readFs } = await import('node:fs');
+  const { join: pathJoin } = await import('node:path');
+  const { homedir: osHomedir } = await import('node:os');
+  const extDir = process.env.BF_EXT_DIR || pathJoin(osHomedir(), '.browserforce', 'extension');
+  try {
+    readFs(pathJoin(extDir, 'VERSION'), 'utf8'); // existence check
+    const { dest } = await doInstallExtension(true);
+    console.log(`Extension updated in ${dest}`);
+    console.log('❗ Reload the extension in chrome://extensions/ (click the ↺ icon).');
+  } catch {
+    // No VERSION file — user hasn't run install-extension yet
+    console.log('Tip: run browserforce install-extension to set up the Chrome extension.');
+  }
 }
 
 async function doInstallExtension(quiet) {
diff --git a/test/cli.test.js b/test/cli.test.js
index 307d259..f5bf1d5 100644
--- a/test/cli.test.js
+++ b/test/cli.test.js
@@ -236,4 +236,19 @@ describe('CLI install-extension', () => {
     const { stdout } = await exec('node', ['bin.js', 'help']);
     assert.ok(stdout.includes('install-extension'));
   });
+
+  it('install-extension replaces stale VERSION with current package version', async () => {
+    const { mkdirSync, writeFileSync, readFileSync } = await import('node:fs');
+    // Simulate a previously-installed-but-stale extension
+    mkdirSync(tmpExt, { recursive: true });
+    writeFileSync(join(tmpExt, 'VERSION'), '0.0.1');
+
+    // Re-running install-extension should overwrite VERSION with current version
+    await exec('node', ['bin.js', 'install-extension'], {
+      env: { ...process.env, BF_EXT_DIR: tmpExt },
+    });
+    const version = readFileSync(join(tmpExt, 'VERSION'), 'utf8').trim();
+    const pkgVersion = JSON.parse(readFileSync('package.json', 'utf8')).version;
+    assert.equal(version, pkgVersion);
+  });
 });

From 08985dd8aec1d93e130b14349336722db7c55786 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 23:30:14 +0530
Subject: [PATCH 012/192] feat: warn in serve when installed extension is
 outdated

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 bin.js           | 16 +++++++++++++
 test/cli.test.js | 58 ++++++++++++++++++++++++++++++++++++++++++++++++
 2 files changed, 74 insertions(+)

diff --git a/bin.js b/bin.js
index e061b80..94d723a 100644
--- a/bin.js
+++ b/bin.js
@@ -231,6 +231,22 @@ async function cmdExecute() {
 }
 
 async function cmdServe() {
+  // Warn if installed extension is outdated vs current package
+  try {
+    const { readFileSync } = await import('node:fs');
+    const { join, dirname } = await import('node:path');
+    const { homedir } = await import('node:os');
+    const pkgDir = dirname(fileURLToPath(import.meta.url));
+    const pkgVersion = JSON.parse(readFileSync(join(pkgDir, 'package.json'), 'utf8')).version;
+    const extDir = process.env.BF_EXT_DIR || join(homedir(), '.browserforce', 'extension');
+    const installedVersion = readFileSync(join(extDir, 'VERSION'), 'utf8').trim();
+    if (installedVersion !== pkgVersion) {
+      process.stderr.write(`⚠  Extension is outdated (installed: ${installedVersion}, current: ${pkgVersion}).\n`);
+      process.stderr.write(`   Run: browserforce install-extension\n`);
+      process.stderr.write(`❗ Then reload the extension in chrome://extensions/ (click the ↺ icon).\n\n`);
+    }
+  } catch { /* no VERSION file — git clone or first install; skip */ }
+
   const { RelayServer } = await import('./relay/src/index.js');
   const port = parseInt(process.env.RELAY_PORT || positionals[1] || '19222', 10);
   const relay = new RelayServer(port);
diff --git a/test/cli.test.js b/test/cli.test.js
index f5bf1d5..fdddfc5 100644
--- a/test/cli.test.js
+++ b/test/cli.test.js
@@ -251,4 +251,62 @@ describe('CLI install-extension', () => {
     const pkgVersion = JSON.parse(readFileSync('package.json', 'utf8')).version;
     assert.equal(version, pkgVersion);
   });
+
+  it('serve warns when extension VERSION is outdated', async () => {
+    const { mkdirSync, writeFileSync } = await import('node:fs');
+    const staleDir = join(tmpdir(), `bf-ext-stale-${Math.random().toString(36).slice(2)}`);
+    mkdirSync(staleDir, { recursive: true });
+    writeFileSync(join(staleDir, 'VERSION'), '0.0.1'); // intentionally stale
+
+    const warning = await new Promise((resolve, reject) => {
+      const child = spawn('node', ['bin.js', 'serve'], {
+        env: { ...process.env, BF_EXT_DIR: staleDir },
+      });
+      let stderr = '';
+      const timer = setTimeout(() => {
+        child.kill('SIGKILL');
+        reject(new Error('serve timed out without producing stderr'));
+      }, 5000);
+      child.stderr.on('data', (chunk) => {
+        stderr += chunk.toString();
+        if (stderr.includes('❗')) {
+          clearTimeout(timer);
+          child.kill('SIGKILL');
+          resolve(stderr);
+        }
+      });
+      child.on('error', (err) => { clearTimeout(timer); reject(err); });
+    });
+
+    assert.ok(warning.includes('outdated'));
+    assert.ok(warning.includes('install-extension'));
+    assert.ok(warning.includes('❗'));
+
+    rmSync(staleDir, { recursive: true, force: true });
+  });
+
+  it('serve does NOT warn when VERSION matches current package', async () => {
+    const { mkdirSync, writeFileSync, readFileSync } = await import('node:fs');
+    const freshDir = join(tmpdir(), `bf-ext-fresh-${Math.random().toString(36).slice(2)}`);
+    mkdirSync(freshDir, { recursive: true });
+    const currentVersion = JSON.parse(readFileSync('package.json', 'utf8')).version;
+    writeFileSync(join(freshDir, 'VERSION'), currentVersion);
+
+    const result = await new Promise((resolve) => {
+      const child = spawn('node', ['bin.js', 'serve'], {
+        env: { ...process.env, BF_EXT_DIR: freshDir },
+      });
+      let stderr = '';
+      // Give it 1.5s to produce any warning, then declare "no warning"
+      setTimeout(() => {
+        child.kill('SIGKILL');
+        resolve(stderr);
+      }, 1500);
+      child.stderr.on('data', (chunk) => { stderr += chunk.toString(); });
+    });
+
+    assert.ok(!result.includes('outdated'), `Unexpected warning: ${result}`);
+
+    rmSync(freshDir, { recursive: true, force: true });
+  });
 });

From 67ddaade1ee23dfc64ad9e30c1b557dab22cd32c Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 23:31:12 +0530
Subject: [PATCH 013/192] docs: update setup instructions for npm
 install-extension flow

---
 README.md | 16 ++++++++++++++--
 1 file changed, 14 insertions(+), 2 deletions(-)

diff --git a/README.md b/README.md
index c6efe63..ec89a30 100644
--- a/README.md
+++ b/README.md
@@ -55,10 +55,22 @@ pnpm install
 
 ### 2. Load the Chrome extension
 
+**If you installed via npm:**
+
+1. Run: `browserforce install-extension`
+2. Open `chrome://extensions/` in Chrome
+3. Enable **Developer mode** (top-right toggle)
+4. Click **Load unpacked** → select the path printed in step 1
+
+❗ After every BrowserForce update, re-run `browserforce install-extension`, then reload the extension in `chrome://extensions/` (click the ↺ icon next to BrowserForce).
+
+**If you cloned the repo:**
+
 1. Open `chrome://extensions/` in Chrome
 2. Enable **Developer mode** (top-right toggle)
 3. Click **Load unpacked** → select the `extension/` folder
-4. Extension icon appears in your toolbar (gray = disconnected)
+
+After loading, the extension icon appears in your toolbar (gray = disconnected).
 
 ### 3. Done
 
@@ -79,7 +91,7 @@ Most OpenClaw users chat with their agent from Telegram or WhatsApp. BrowserForc
 **Quick setup** (copy-paste into your terminal):
 
 ```bash
-npm install -g browserforce && npx -y skills add ivalsaraj/browserforce
+npm install -g browserforce && browserforce install-extension && npx -y skills add ivalsaraj/browserforce
 ```
 
 Then start the relay (keep this running):

From 5b7cca9c625f288f6693178a7c08e4c17f1a9efd Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 23:32:36 +0530
Subject: [PATCH 014/192] chore: bump version to 1.0.12

---
 package.json | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/package.json b/package.json
index 850577c..51c77d4 100644
--- a/package.json
+++ b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "browserforce",
-  "version": "1.0.11",
+  "version": "1.0.12",
   "type": "module",
   "description": "Give AI agents your real Chrome browser with progressive examples: simple reads, form interactions, multi-tab workflows, and state persistence. Search X and GitHub, extract ProductHunt data, test forms, compare A/B variants, monitor status pages. Works with OpenClaw, Claude, and any MCP agent.",
   "homepage": "https://github.com/ivalsaraj/browserforce",

From 93ccfefcec0f114afdd0bee07f908755cc20594e Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 23 Feb 2026 23:39:32 +0530
Subject: [PATCH 015/192] =?UTF-8?q?fix:=20review=20findings=20=E2=80=94=20?=
 =?UTF-8?q?stale-file=20purge,=20src=20guard,=20test=20port=20isolation,?=
 =?UTF-8?q?=20update=20error=20separation?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---
 README.md        |  2 ++
 bin.js           | 15 ++++++++++-----
 test/cli.test.js |  4 ++--
 3 files changed, 14 insertions(+), 7 deletions(-)

diff --git a/README.md b/README.md
index ec89a30..e071c6b 100644
--- a/README.md
+++ b/README.md
@@ -182,6 +182,8 @@ browserforce -e "<code>"        # Run Playwright JavaScript (one-shot)
 browserforce plugin list        # List installed plugins
 browserforce plugin install <n> # Install a plugin from the registry
 browserforce plugin remove <n>  # Remove an installed plugin
+browserforce update             # Update to the latest version
+browserforce install-extension  # Copy extension to ~/.browserforce/extension/
 ```
 
 Each `-e` command is one-shot — state does not persist between calls. For persistent state, use the MCP server.
diff --git a/bin.js b/bin.js
index 94d723a..dfdabfc 100644
--- a/bin.js
+++ b/bin.js
@@ -349,19 +349,19 @@ async function cmdUpdate() {
   const { join: pathJoin } = await import('node:path');
   const { homedir: osHomedir } = await import('node:os');
   const extDir = process.env.BF_EXT_DIR || pathJoin(osHomedir(), '.browserforce', 'extension');
-  try {
-    readFs(pathJoin(extDir, 'VERSION'), 'utf8'); // existence check
+  let hasVersion = false;
+  try { readFs(pathJoin(extDir, 'VERSION'), 'utf8'); hasVersion = true; } catch { /* not installed */ }
+  if (hasVersion) {
     const { dest } = await doInstallExtension(true);
     console.log(`Extension updated in ${dest}`);
     console.log('❗ Reload the extension in chrome://extensions/ (click the ↺ icon).');
-  } catch {
-    // No VERSION file — user hasn't run install-extension yet
+  } else {
     console.log('Tip: run browserforce install-extension to set up the Chrome extension.');
   }
 }
 
 async function doInstallExtension(quiet) {
-  const { cpSync, mkdirSync, writeFileSync, readFileSync } = await import('node:fs');
+  const { cpSync, mkdirSync, writeFileSync, readFileSync, existsSync, rmSync } = await import('node:fs');
   const { join, dirname } = await import('node:path');
   const { homedir } = await import('node:os');
 
@@ -369,6 +369,11 @@ async function doInstallExtension(quiet) {
   const src = join(pkgDir, 'extension');
   const dest = process.env.BF_EXT_DIR || join(homedir(), '.browserforce', 'extension');
 
+  if (!existsSync(src)) {
+    throw new Error(`Extension source not found at ${src}.\nIs browserforce installed via npm? Try: npm install -g browserforce`);
+  }
+
+  rmSync(dest, { recursive: true, force: true });
   mkdirSync(dest, { recursive: true });
   cpSync(src, dest, { recursive: true });
 
diff --git a/test/cli.test.js b/test/cli.test.js
index fdddfc5..e6c2ed9 100644
--- a/test/cli.test.js
+++ b/test/cli.test.js
@@ -260,7 +260,7 @@ describe('CLI install-extension', () => {
 
     const warning = await new Promise((resolve, reject) => {
       const child = spawn('node', ['bin.js', 'serve'], {
-        env: { ...process.env, BF_EXT_DIR: staleDir },
+        env: { ...process.env, BF_EXT_DIR: staleDir, RELAY_PORT: String(getRandomPort()) },
       });
       let stderr = '';
       const timer = setTimeout(() => {
@@ -294,7 +294,7 @@ describe('CLI install-extension', () => {
 
     const result = await new Promise((resolve) => {
       const child = spawn('node', ['bin.js', 'serve'], {
-        env: { ...process.env, BF_EXT_DIR: freshDir },
+        env: { ...process.env, BF_EXT_DIR: freshDir, RELAY_PORT: String(getRandomPort()) },
       });
       let stderr = '';
       // Give it 1.5s to produce any warning, then declare "no warning"

From bef3093086a371b7ec54a086a6b36e94f072dcec Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 15:53:39 +0530
Subject: [PATCH 016/192] feat: implement extension reload functionality via
 HTTP endpoint

- Added a new endpoint to the relay server for reloading the extension, which acknowledges the reload request and waits for an acknowledgment from the extension before confirming the reload status.
- Updated the installation command to attempt reloading the extension after installation, providing feedback based on whether the reload was successful.
- Enhanced the background script to handle reload messages from the relay server.
- Added tests for the new reload endpoint to ensure proper authentication and response handling.

This feature improves the user experience by automating the extension reload process after updates.
---
 bin.js                          |  41 ++++++++++--
 extension/background.js         |   8 +++
 relay/src/index.js              |  37 +++++++++++
 relay/test/relay-server.test.js | 107 ++++++++++++++++++++++++++++++++
 4 files changed, 188 insertions(+), 5 deletions(-)

diff --git a/bin.js b/bin.js
index dfdabfc..3a32ced 100644
--- a/bin.js
+++ b/bin.js
@@ -352,14 +352,39 @@ async function cmdUpdate() {
   let hasVersion = false;
   try { readFs(pathJoin(extDir, 'VERSION'), 'utf8'); hasVersion = true; } catch { /* not installed */ }
   if (hasVersion) {
-    const { dest } = await doInstallExtension(true);
+    const { dest, reloaded } = await doInstallExtension(true);
     console.log(`Extension updated in ${dest}`);
-    console.log('❗ Reload the extension in chrome://extensions/ (click the ↺ icon).');
+    if (reloaded) {
+      console.log('  Reloading extension... ✓');
+    } else {
+      console.log('❗ Reload the extension in chrome://extensions/ (click the ↺ icon).');
+    }
   } else {
     console.log('Tip: run browserforce install-extension to set up the Chrome extension.');
   }
 }
 
+async function attemptExtensionReload() {
+  const { readFileSync } = await import('node:fs');
+  const { join } = await import('node:path');
+  const { homedir } = await import('node:os');
+  const tokenFile = join(homedir(), '.browserforce', 'auth-token');
+  let authToken = '';
+  try { authToken = readFileSync(tokenFile, 'utf8').trim(); } catch { return false; }
+  if (!authToken) return false;
+
+  const { getRelayHttpUrl } = await import('./mcp/src/exec-engine.js');
+  let baseUrl;
+  try { baseUrl = getRelayHttpUrl(); } catch { baseUrl = 'http://127.0.0.1:19222'; }
+
+  try {
+    const { status, body } = await httpFetch('POST', `${baseUrl}/extension/reload`, {}, authToken);
+    return status === 200 && body?.reloaded === true;
+  } catch {
+    return false; // relay not running
+  }
+}
+
 async function doInstallExtension(quiet) {
   const { cpSync, mkdirSync, writeFileSync, readFileSync, existsSync, rmSync } = await import('node:fs');
   const { join, dirname } = await import('node:path');
@@ -381,6 +406,8 @@ async function doInstallExtension(quiet) {
   const pkgVersion = JSON.parse(readFileSync(join(pkgDir, 'package.json'), 'utf8')).version;
   writeFileSync(join(dest, 'VERSION'), pkgVersion);
 
+  const reloaded = await attemptExtensionReload();
+
   if (!quiet) {
     console.log(`Extension installed to: ${dest}`);
     console.log('');
@@ -390,11 +417,15 @@ async function doInstallExtension(quiet) {
     console.log('  3. Click "Load unpacked" → select:');
     console.log(`     ${dest}`);
     console.log('');
-    console.log('❗ After any BrowserForce update, re-run: browserforce install-extension');
-    console.log('   Then reload the extension in chrome://extensions/ (click the ↺ icon).');
+    if (reloaded) {
+      console.log('  Reloading extension... ✓');
+    } else {
+      console.log('❗ After any BrowserForce update, re-run: browserforce install-extension');
+      console.log('   Then reload the extension in chrome://extensions/ (click the ↺ icon).');
+    }
   }
 
-  return { dest, pkgVersion };
+  return { dest, pkgVersion, reloaded };
 }
 
 async function cmdInstallExtension() {
diff --git a/extension/background.js b/extension/background.js
index e32bf9c..efc8447 100644
--- a/extension/background.js
+++ b/extension/background.js
@@ -146,6 +146,14 @@ function handleRelayMessage(msg) {
     return;
   }
 
+  if (msg.method === 'reload') {
+    // Ack before restarting so relay knows the message was received
+    send({ method: 'reload-ack' });
+    // Yield so the send flushes before the service worker restarts
+    setTimeout(() => chrome.runtime.reload(), 0);
+    return;
+  }
+
   // Command from relay (has id)
   if (msg.id !== undefined) {
     executeCommand(msg)
diff --git a/relay/src/index.js b/relay/src/index.js
index 0fca042..5459f5e 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -148,6 +148,9 @@ class RelayServer {
     // State
     this.autoAttachEnabled = false;
     this.autoAttachParams = null;
+
+    // Pending extension reload ack resolver (at most one at a time)
+    this._extReloadResolve = null;
   }
 
   start({ writeCdpUrl = true } = {}) {
@@ -295,6 +298,35 @@ class RelayServer {
       return;
     }
 
+    if (url.pathname === '/extension/reload' && req.method === 'POST') {
+      if (!this._requireAuth(req, res)) return;
+      if (!this.ext || this.ext.ws.readyState !== WebSocket.OPEN) {
+        res.end(JSON.stringify({ reloaded: false, reason: 'not connected' }));
+        return;
+      }
+      // Await ack with 2.5s timeout; extension sends 'reload-ack' before restarting
+      const reloaded = await new Promise((resolve) => {
+        const timer = setTimeout(() => {
+          this._extReloadResolve = null;
+          resolve(false);
+        }, 2500);
+        this._extReloadResolve = () => {
+          clearTimeout(timer);
+          this._extReloadResolve = null;
+          resolve(true);
+        };
+        try {
+          this.ext.ws.send(JSON.stringify({ method: 'reload' }));
+        } catch {
+          clearTimeout(timer);
+          this._extReloadResolve = null;
+          resolve(false);
+        }
+      });
+      res.end(JSON.stringify({ reloaded }));
+      return;
+    }
+
     res.statusCode = 404;
     res.end(JSON.stringify({ error: 'Not found' }));
   }
@@ -439,6 +471,11 @@ class RelayServer {
     // Events from extension
     if (msg.method === 'pong') return;
 
+    if (msg.method === 'reload-ack') {
+      if (this._extReloadResolve) this._extReloadResolve();
+      return;
+    }
+
     if (msg.method === 'cdpEvent') {
       this._handleCdpEventFromExt(msg.params);
       return;
diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index dc4ad66..593bc70 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -240,6 +240,113 @@ describe('Plugin API Endpoints', () => {
   });
 });
 
+// ─── Extension Reload Endpoint ───────────────────────────────────────────────
+
+describe('Extension Reload Endpoint', () => {
+  let relay, port;
+
+  function httpRequest(method, url, body, headers = {}) {
+    return new Promise((resolve, reject) => {
+      const opts = new URL(url);
+      const payload = body ? JSON.stringify(body) : undefined;
+      const req = http.request({
+        hostname: opts.hostname, port: opts.port,
+        path: opts.pathname, method,
+        headers: {
+          'Content-Type': 'application/json',
+          ...(payload ? { 'Content-Length': Buffer.byteLength(payload) } : {}),
+          ...headers,
+        },
+      }, (res) => {
+        let data = '';
+        res.on('data', c => { data += c; });
+        res.on('end', () => {
+          try { resolve({ status: res.statusCode, body: JSON.parse(data) }); }
+          catch { resolve({ status: res.statusCode, body: data }); }
+        });
+      });
+      req.on('error', reject);
+      if (payload) req.write(payload);
+      req.end();
+    });
+  }
+
+  before(async () => {
+    port = getRandomPort();
+    relay = new RelayServer(port);
+    relay.start({ writeCdpUrl: false });
+    await sleep(200);
+  });
+
+  after(() => relay.stop());
+
+  it('POST /extension/reload without token returns 401', async () => {
+    const { status, body } = await httpRequest('POST', `http://127.0.0.1:${port}/extension/reload`, {});
+    assert.equal(status, 401);
+    assert.ok(body.error.includes('Unauthorized'));
+  });
+
+  it('POST /extension/reload with invalid token returns 401', async () => {
+    const { status, body } = await httpRequest('POST', `http://127.0.0.1:${port}/extension/reload`, {}, {
+      Authorization: 'Bearer bad-token',
+    });
+    assert.equal(status, 401);
+    assert.ok(body.error.includes('Unauthorized'));
+  });
+
+  it('POST /extension/reload with valid token but no extension returns { reloaded: false }', async () => {
+    const { status, body } = await httpRequest('POST', `http://127.0.0.1:${port}/extension/reload`, {}, {
+      Authorization: `Bearer ${relay.authToken}`,
+    });
+    assert.equal(status, 200);
+    assert.equal(body.reloaded, false);
+  });
+
+  it('POST /extension/reload with extension connected and ack returns { reloaded: true }', async () => {
+    // Connect a mock extension that sends reload-ack
+    const extWs = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://test' },
+    });
+
+    extWs.on('message', (data) => {
+      const msg = JSON.parse(data.toString());
+      if (msg.method === 'reload') {
+        extWs.send(JSON.stringify({ method: 'reload-ack' }));
+      }
+    });
+
+    const { status, body } = await httpRequest('POST', `http://127.0.0.1:${port}/extension/reload`, {}, {
+      Authorization: `Bearer ${relay.authToken}`,
+    });
+
+    extWs.close();
+    assert.equal(status, 200);
+    assert.equal(body.reloaded, true);
+  });
+
+  it('POST /extension/reload with extension connected but no ack times out to { reloaded: false }', async () => {
+    // Re-start relay to get a fresh extension slot (previous test's close may not have fully cleaned up)
+    relay.stop();
+    await sleep(100);
+    relay = new RelayServer(port);
+    relay.start({ writeCdpUrl: false });
+    await sleep(200);
+
+    // Connect a mock extension that does NOT send reload-ack
+    const extWs = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://test' },
+    });
+
+    const { status, body } = await httpRequest('POST', `http://127.0.0.1:${port}/extension/reload`, {}, {
+      Authorization: `Bearer ${relay.authToken}`,
+    });
+
+    extWs.close();
+    assert.equal(status, 200);
+    assert.equal(body.reloaded, false);
+  });
+});
+
 // ─── WebSocket Security ──────────────────────────────────────────────────────
 
 describe('WebSocket Security', () => {

From 5884b6db042df837857a0c189745f823cf643b24 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 16:39:49 +0530
Subject: [PATCH 017/192] docs: align tool references with execute-only helper
 model

---
 GUIDE.md        | 5 ++---
 docs/PLUGINS.md | 3 +--
 2 files changed, 3 insertions(+), 5 deletions(-)

diff --git a/GUIDE.md b/GUIDE.md
index 4b5b2f9..966a2d2 100644
--- a/GUIDE.md
+++ b/GUIDE.md
@@ -212,12 +212,11 @@ browserforce navigate https://gmail.com
 
 ## MCP Tools Reference
 
-When connected via MCP (OpenClaw, Claude Desktop, Claude Code), the AI has three tools:
+When connected via MCP (OpenClaw, Claude Desktop, Claude Code), the AI has two tools:
 
 | Tool | What it does |
 |------|-------------|
-| `execute` | Run Playwright JavaScript in your real Chrome. Access `page`, `context`, `state`, `snapshot()`, `waitForPageLoad()`, `getLogs()`, and Node.js globals. |
-| `screenshot_with_labels` | Take a screenshot with Vimium-style accessibility labels overlaid on interactive elements. |
+| `execute` | Run Playwright JavaScript in your real Chrome. Access `page`, `context`, `state`, `snapshot()`, `waitForPageLoad()`, `getLogs()`, `screenshotWithAccessibilityLabels()`, `cleanHTML()`, `pageMarkdown()`, and Node.js globals. |
 | `reset` | Reconnect to the relay and clear state. Use when the connection drops. |
 
 The `execute` tool gives the agent full Playwright access — it can navigate, click, type, screenshot, read accessibility trees, and run JavaScript in the page context. All within your real browser session.
diff --git a/docs/PLUGINS.md b/docs/PLUGINS.md
index 2720475..0a0c362 100644
--- a/docs/PLUGINS.md
+++ b/docs/PLUGINS.md
@@ -252,7 +252,7 @@ export default {
     }
   },
 
-  // Standalone MCP tools registered alongside execute/reset/screenshot_with_labels.
+  // Standalone MCP tools registered alongside execute/reset.
   // Agents can call these directly by name.
   tools: [{
     name: 'my_tool',
@@ -443,4 +443,3 @@ The relay install endpoint only fetches from the known GitHub repo URL — no ar
 No sandboxing beyond that. Plugins are as trusted as any npm package you install.
 
 ---
-

From 477f8e372d766023041f4ea80caa6af0913fe8d1 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 16:40:03 +0530
Subject: [PATCH 018/192] feat(mcp): consolidate screenshot and content helpers
 into execute

---
 README.md                            |    3 +-
 mcp/src/clean-html.js                |  166 +++
 mcp/src/exec-engine.js               |   23 +
 mcp/src/index.js                     |   92 +-
 mcp/src/page-markdown.js             |  114 ++
 mcp/src/vendor/readability.bundle.js | 2064 ++++++++++++++++++++++++++
 mcp/test/exec-engine-plugins.test.js |   29 +-
 mcp/test/mcp-tools.test.js           |   34 +-
 8 files changed, 2420 insertions(+), 105 deletions(-)
 create mode 100644 mcp/src/clean-html.js
 create mode 100644 mcp/src/page-markdown.js
 create mode 100644 mcp/src/vendor/readability.bundle.js

diff --git a/README.md b/README.md
index e071c6b..74a8a59 100644
--- a/README.md
+++ b/README.md
@@ -303,8 +303,7 @@ state.results = await page.evaluate(() => document.title);
 
 | Tool | Description |
 |------|-------------|
-| `execute` | Run Playwright JavaScript in your real Chrome. Access `page`, `context`, `state`, `snapshot()`, `waitForPageLoad()`, `getLogs()`, and Node.js globals. |
-| `screenshot_with_labels` | Take a screenshot with Vimium-style accessibility labels overlaid on interactive elements. |
+| `execute` | Run Playwright JavaScript in your real Chrome. Access `page`, `context`, `state`, `snapshot()`, `waitForPageLoad()`, `getLogs()`, `screenshotWithAccessibilityLabels()`, `cleanHTML()`, `pageMarkdown()`, and Node.js globals. |
 | `reset` | Reconnect to the relay and clear state. Use when the connection drops. |
 
 ## Examples
diff --git a/mcp/src/clean-html.js b/mcp/src/clean-html.js
new file mode 100644
index 0000000..c47261b
--- /dev/null
+++ b/mcp/src/clean-html.js
@@ -0,0 +1,166 @@
+// Clean HTML extraction — runs entirely in the browser via page.evaluate().
+// Strips scripts, styles, decorative elements; keeps semantic attributes.
+
+/**
+ * Extracts cleaned HTML from a Playwright page or locator.
+ * All processing happens in-page via DOM manipulation — no server-side parsing deps.
+ *
+ * @param {import('playwright-core').Page} page
+ * @param {string} [selector] - CSS selector to scope extraction (default: document)
+ * @param {{ maxAttrLen?: number, maxContentLen?: number }} [opts]
+ * @returns {Promise<string>}
+ */
+export async function getCleanHTML(page, selector, opts = {}) {
+  const maxAttrLen = opts.maxAttrLen ?? 200;
+  const maxContentLen = opts.maxContentLen ?? 500;
+
+  const html = await page.evaluate(({ selector, maxAttrLen, maxContentLen }) => {
+    const TAGS_TO_REMOVE = new Set([
+      'script', 'style', 'link', 'meta', 'noscript',
+      'svg', 'head', 'iframe', 'object', 'embed',
+    ]);
+
+    const ATTRS_TO_KEEP = new Set([
+      'href', 'src', 'alt', 'title', 'name', 'value', 'checked',
+      'placeholder', 'type', 'role', 'target', 'label', 'for',
+      'aria-label', 'aria-placeholder', 'aria-valuetext',
+      'aria-roledescription', 'aria-hidden', 'aria-expanded',
+      'aria-checked', 'aria-selected', 'aria-disabled',
+      'aria-pressed', 'aria-required', 'aria-current',
+      'data-testid', 'data-test', 'data-cy', 'data-qa',
+    ]);
+
+    const SEMANTIC_TAGS = new Set([
+      'html', 'body', 'main', 'header', 'footer',
+      'nav', 'section', 'article', 'aside',
+    ]);
+
+    const FORM_TAGS = new Set(['input', 'select', 'textarea', 'button']);
+
+    function truncate(str, max) {
+      if (str.length <= max) return str;
+      return str.slice(0, max) + '...[' + (str.length - max) + ' more]';
+    }
+
+    function shouldKeepAttr(name) {
+      if (ATTRS_TO_KEEP.has(name)) return true;
+      if (name.startsWith('aria-')) return true;
+      if (name.startsWith('data-test') || name.startsWith('data-cy') || name.startsWith('data-qa')) return true;
+      return false;
+    }
+
+    function hasUsefulContent(el) {
+      if (el.nodeType === Node.TEXT_NODE) {
+        return el.textContent.trim().length > 0;
+      }
+      if (el.nodeType !== Node.ELEMENT_NODE) return false;
+
+      const tag = el.tagName.toLowerCase();
+      if (FORM_TAGS.has(tag)) return true;
+      if (tag === 'img' && el.getAttribute('alt')?.trim()) return true;
+      if (tag === 'a' && el.getAttribute('href')) return true;
+
+      for (const child of el.childNodes) {
+        if (hasUsefulContent(child)) return true;
+      }
+      return false;
+    }
+
+    function cleanNode(el) {
+      if (el.nodeType === Node.COMMENT_NODE) {
+        el.remove();
+        return;
+      }
+
+      if (el.nodeType === Node.TEXT_NODE) {
+        if (el.textContent.trim().length === 0) return;
+        el.textContent = truncate(el.textContent, maxContentLen);
+        return;
+      }
+
+      if (el.nodeType !== Node.ELEMENT_NODE) return;
+
+      const tag = el.tagName.toLowerCase();
+
+      if (TAGS_TO_REMOVE.has(tag)) {
+        el.remove();
+        return;
+      }
+
+      if (el.getAttribute('aria-hidden') === 'true') {
+        el.remove();
+        return;
+      }
+
+      // Strip non-semantic attributes
+      const attrsToRemove = [];
+      for (const attr of el.attributes) {
+        if (!shouldKeepAttr(attr.name)) {
+          attrsToRemove.push(attr.name);
+        }
+      }
+      for (const name of attrsToRemove) {
+        el.removeAttribute(name);
+      }
+
+      // Truncate long attribute values
+      for (const attr of el.attributes) {
+        if (attr.value.length > maxAttrLen) {
+          el.setAttribute(attr.name, truncate(attr.value, maxAttrLen));
+        }
+      }
+
+      // Recurse children (iterate in reverse since we may remove)
+      const children = Array.from(el.childNodes);
+      for (const child of children) {
+        cleanNode(child);
+      }
+
+      // After cleaning children: remove decorative elements (no text, no form elements)
+      if (!SEMANTIC_TAGS.has(tag) && !FORM_TAGS.has(tag) && !hasUsefulContent(el)) {
+        el.remove();
+        return;
+      }
+
+      // Unwrap unnecessary wrappers: single-child divs/spans with no attributes
+      if (el.attributes.length === 0 && el.children.length === 1 && el.childNodes.length === 1) {
+        const onlyChild = el.children[0];
+        if (onlyChild && onlyChild.nodeType === Node.ELEMENT_NODE) {
+          el.replaceWith(onlyChild);
+        }
+      }
+    }
+
+    // Determine root to clean
+    let root;
+    if (selector) {
+      const target = document.querySelector(selector);
+      if (!target) return '<empty />';
+      root = target.cloneNode(true);
+    } else {
+      root = document.documentElement.cloneNode(true);
+    }
+
+    cleanNode(root);
+
+    // Remove empty elements in multiple passes
+    let changed = true;
+    while (changed) {
+      changed = false;
+      for (const el of root.querySelectorAll('*')) {
+        if (
+          el.attributes.length === 0 &&
+          el.childNodes.length === 0 &&
+          !FORM_TAGS.has(el.tagName.toLowerCase())
+        ) {
+          el.remove();
+          changed = true;
+        }
+      }
+    }
+
+    return root.outerHTML || root.innerHTML || '';
+  }, { selector: selector || null, maxAttrLen, maxContentLen });
+
+  return html;
+}
diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index e59f0b7..96ca77c 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -10,6 +10,9 @@ import {
   TEST_ID_ATTRS,
   buildSnapshotText, parseSearchPattern, annotateStableAttrs,
 } from './snapshot.js';
+import { screenshotWithLabels } from './a11y-labels.js';
+import { getCleanHTML } from './clean-html.js';
+import { getPageMarkdown } from './page-markdown.js';
 
 // ─── Configuration ───────────────────────────────────────────────────────────
 
@@ -444,6 +447,19 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     if (consoleLogs) consoleLogs.set(activePage(), []);
   };
 
+  const screenshotWithAccessibilityLabels = async ({ selector, interactiveOnly = true } = {}) => {
+    const page = activePage();
+    const { screenshot, snapshot: snapText, labelCount } = await screenshotWithLabels(page, {
+      selector,
+      interactiveOnly,
+    });
+    return { _bf_type: 'labeled_screenshot', screenshot, snapshot: snapText, labelCount };
+  };
+
+  const cleanHTML = (selector, opts) => getCleanHTML(activePage(), selector, opts);
+
+  const pageMarkdown = () => getPageMarkdown(activePage());
+
   // Wrap plugin helpers to auto-inject (page, ctx, state) as first three args
   const wrappedPluginHelpers = {};
   for (const [name, fn] of Object.entries(pluginHelpers)) {
@@ -458,6 +474,7 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     ...wrappedPluginHelpers,           // plugin helpers spread first — built-ins always win
     page: defaultPage, context: ctx, state: userState,
     snapshot, waitForPageLoad, getLogs, clearLogs,
+    screenshotWithAccessibilityLabels, cleanHTML, pageMarkdown,
     fetch, URL, URLSearchParams, Buffer, setTimeout, clearTimeout,
     TextEncoder, TextDecoder,
   };
@@ -481,6 +498,12 @@ export function formatResult(result) {
   if (result === undefined || result === null) {
     return { type: 'text', text: String(result) };
   }
+  if (result && typeof result === 'object' && result._bf_type === 'labeled_screenshot') {
+    return [
+      { type: 'image', data: result.screenshot.toString('base64'), mimeType: 'image/jpeg' },
+      { type: 'text', text: `Labels: ${result.labelCount} interactive elements\n\n${result.snapshot}` },
+    ];
+  }
   if (Buffer.isBuffer(result)) {
     return { type: 'image', data: result.toString('base64'), mimeType: 'image/png' };
   }
diff --git a/mcp/src/index.js b/mcp/src/index.js
index 0f61800..51edc83 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -1,5 +1,5 @@
 // BrowserForce — MCP Server
-// 3-tool architecture: execute (run Playwright code) + reset (reconnect) + screenshot_with_labels (visual a11y labels)
+// 2-tool architecture: execute (run Playwright code) + reset (reconnect)
 // Connects to the relay via Playwright's CDP client.
 
 import { McpServer } from '@modelcontextprotocol/sdk/server/mcp.js';
@@ -9,7 +9,6 @@ import { chromium } from 'playwright-core';
 import {
   getCdpUrl, ensureRelay, CodeExecutionTimeoutError, buildExecContext, runCode, formatResult,
 } from './exec-engine.js';
-import { screenshotWithLabels } from './a11y-labels.js';
 import { loadPlugins, buildPluginHelpers, buildPluginSkillAppendix } from './plugin-loader.js';
 import { checkForUpdate } from './update-check.js';
 
@@ -138,6 +137,16 @@ Helpers:
   waitForPageLoad({ timeout? })      Smart load detection (filters analytics/ads, polls readyState).
   getLogs({ count? })                Browser console logs captured for current page.
   clearLogs()                        Clear captured console logs.
+  screenshotWithAccessibilityLabels({ selector?, interactiveOnly? })
+                                     Vimium-style labeled screenshot + accessibility snapshot.
+                                     Returns image with color-coded element labels (e1, e2...) and
+                                     matching text snapshot. Use when visual layout matters.
+  cleanHTML(selector?, opts?)        Cleaned HTML — strips scripts, styles, decorative elements.
+                                     Keeps semantic attrs: href, src, role, aria-*, data-testid.
+                                     opts: { maxAttrLen?, maxContentLen? }
+  pageMarkdown()                     Article content via Mozilla Readability (Firefox Reader View).
+                                     Strips nav/ads/sidebars. Returns title + metadata + body text.
+                                     Falls back to raw body text for non-article pages.
 
 Globals: fetch, URL, URLSearchParams, Buffer, setTimeout, clearTimeout, TextEncoder, TextDecoder
 
@@ -309,7 +318,7 @@ function registerExecuteTool(skillAppendix = '') {
     'execute',
     EXECUTE_PROMPT + skillAppendix,
     {
-      code: z.string().describe('JavaScript to run — page/context/state/snapshot/waitForPageLoad/getLogs in scope'),
+      code: z.string().describe('JavaScript to run — page/context/state/snapshot/waitForPageLoad/getLogs/cleanHTML/pageMarkdown in scope'),
       timeout: z.number().optional().describe('Max execution time in ms (default: 30000)'),
     },
     async ({ code, timeout = 30000 }) => {
@@ -326,9 +335,9 @@ function registerExecuteTool(skillAppendix = '') {
       try {
         const result = await runCode(code, execCtx, timeout);
         const formatted = formatResult(result);
-        const content = [formatted];
+        const content = Array.isArray(formatted) ? [...formatted] : [formatted];
         // Append update notice as a separate content item (once only per session)
-        if (pendingUpdate && !updateNoticeSent && formatted.type === 'text') {
+        if (pendingUpdate && !updateNoticeSent && content[0]?.type === 'text') {
           updateNoticeSent = true;
           content.push({ type: 'text', text: `[BrowserForce update available: ${pendingUpdate.current} → ${pendingUpdate.latest}]\n[Run: browserforce update   or: npm install -g browserforce]` });
         }
@@ -373,79 +382,6 @@ server.tool(
   }
 );
 
-// ─── Screenshot with Labels Tool ──────────────────────────────────────────────
-
-const SCREENSHOT_LABELS_PROMPT = `Take a screenshot with Vimium-style accessibility labels on interactive elements.
-
-Returns TWO content items:
-1. JPEG screenshot with color-coded labels (e1, e2, e3...) on buttons, links, inputs, etc.
-2. Text accessibility snapshot with matching refs and role/name locators
-
-Labels are color-coded by role:
-- Yellow: links
-- Orange: buttons, menu items, tabs
-- Red/pink: text inputs, search boxes
-- Green: checkboxes, radio buttons
-- Blue: sliders, spinbuttons, media
-- Purple: switches
-
-Use this tool when:
-- You need to understand the visual layout of a page
-- Text snapshot alone can't convey spatial relationships
-- You need to verify element positions (dashboards, grids, maps)
-- You need both visual context AND element refs for interaction
-
-After getting the screenshot, use the refs to interact via the execute tool:
-  await state.page.locator('role=button[name="Submit"]').click();
-
-Parameters:
-- selector: CSS selector to scope labels to part of the page (e.g., '#main', '.sidebar'). Main frame only.
-- interactiveOnly: Only label interactive elements like buttons/links/inputs (default: true)
-
-Limitations:
-- Main frame only — does not label elements inside cross-origin iframes
-- Locators are role/name based — no data-testid matching`;
-
-server.tool(
-  'screenshot_with_labels',
-  SCREENSHOT_LABELS_PROMPT,
-  {
-    selector: z.string().optional().describe('CSS selector to scope labels to a subtree of the main frame'),
-    interactiveOnly: z.boolean().optional().describe('Only label interactive elements (default: true)'),
-  },
-  async ({ selector, interactiveOnly = true }) => {
-    await ensureBrowser();
-    const ctx = getContext();
-    const page = (userState.page && !userState.page.isClosed())
-      ? userState.page
-      : ctx.pages()[0] || null;
-    if (!page) {
-      return {
-        content: [{ type: 'text', text: 'Error: No pages available. Open a tab first.' }],
-        isError: true,
-      };
-    }
-
-    try {
-      const { screenshot, snapshot, labelCount } = await screenshotWithLabels(page, {
-        selector,
-        interactiveOnly,
-      });
-      return {
-        content: [
-          { type: 'image', data: screenshot.toString('base64'), mimeType: 'image/jpeg' },
-          { type: 'text', text: `Labels: ${labelCount} interactive elements\n\n${snapshot}` },
-        ],
-      };
-    } catch (err) {
-      return {
-        content: [{ type: 'text', text: `Error: ${err.message}` }],
-        isError: true,
-      };
-    }
-  }
-);
-
 // ─── Plugin Init ─────────────────────────────────────────────────────────────
 
 async function initPlugins() {
diff --git a/mcp/src/page-markdown.js b/mcp/src/page-markdown.js
new file mode 100644
index 0000000..cd2ece9
--- /dev/null
+++ b/mcp/src/page-markdown.js
@@ -0,0 +1,114 @@
+// Page markdown extraction — uses Mozilla Readability (Firefox Reader View algorithm).
+// Injects a pre-bundled Readability IIFE into the page, then extracts article content.
+
+import { readFileSync } from 'node:fs';
+import { join, dirname } from 'node:path';
+import { fileURLToPath } from 'node:url';
+
+const __dirname = dirname(fileURLToPath(import.meta.url));
+
+let readabilityCode = null;
+
+function getReadabilityCode() {
+  if (readabilityCode) return readabilityCode;
+  const bundlePath = join(__dirname, 'vendor', 'readability.bundle.js');
+  readabilityCode = readFileSync(bundlePath, 'utf-8');
+  return readabilityCode;
+}
+
+/**
+ * Extracts page content as structured markdown using Mozilla Readability.
+ * Strips nav, ads, sidebars — returns article body with metadata.
+ *
+ * @param {import('playwright-core').Page} page
+ * @returns {Promise<string>}
+ */
+export async function getPageMarkdown(page) {
+  // Inject Readability if not already present
+  const hasReadability = await page.evaluate(() => !!globalThis.__readability);
+  if (!hasReadability) {
+    await page.evaluate(getReadabilityCode());
+  }
+
+  const result = await page.evaluate(() => {
+    const { Readability, isProbablyReaderable } = globalThis.__readability;
+
+    const documentClone = document.cloneNode(true);
+
+    if (!isProbablyReaderable(documentClone)) {
+      return {
+        content: document.body?.innerText || '',
+        title: document.title || null,
+        author: null,
+        excerpt: null,
+        siteName: null,
+        lang: document.documentElement?.lang || null,
+        publishedTime: null,
+        wordCount: (document.body?.innerText || '').split(/\s+/).filter(Boolean).length,
+        readable: false,
+      };
+    }
+
+    const article = new Readability(documentClone).parse();
+
+    if (!article) {
+      return {
+        content: document.body?.innerText || '',
+        title: document.title || null,
+        author: null,
+        excerpt: null,
+        siteName: null,
+        lang: document.documentElement?.lang || null,
+        publishedTime: null,
+        wordCount: (document.body?.innerText || '').split(/\s+/).filter(Boolean).length,
+        readable: false,
+      };
+    }
+
+    return {
+      content: article.textContent || '',
+      title: article.title || null,
+      author: article.byline || null,
+      excerpt: article.excerpt || null,
+      siteName: article.siteName || null,
+      lang: article.lang || null,
+      publishedTime: article.publishedTime || null,
+      wordCount: (article.textContent || '').split(/\s+/).filter(Boolean).length,
+      readable: true,
+    };
+  });
+
+  // Format output as structured markdown
+  const lines = [];
+
+  if (result.title) {
+    lines.push(`# ${result.title}`, '');
+  }
+
+  const metadata = [];
+  if (result.author) metadata.push(`Author: ${result.author}`);
+  if (result.siteName) metadata.push(`Site: ${result.siteName}`);
+  if (result.publishedTime) metadata.push(`Published: ${result.publishedTime}`);
+  if (metadata.length > 0) {
+    lines.push(metadata.join(' | '), '');
+  }
+
+  if (result.excerpt && result.content && result.excerpt !== result.content.slice(0, result.excerpt.length)) {
+    lines.push(`> ${result.excerpt}`, '');
+  }
+
+  lines.push(result.content);
+
+  if (!result.readable) {
+    lines.push('', '---', '_Note: Page was not recognized as an article. Returned raw body text._');
+  }
+
+  let markdown = lines.join('\n').trim();
+
+  // Sanitize unpaired surrogates that break JSON encoding
+  if (typeof markdown.toWellFormed === 'function') {
+    markdown = markdown.toWellFormed();
+  }
+
+  return markdown;
+}
diff --git a/mcp/src/vendor/readability.bundle.js b/mcp/src/vendor/readability.bundle.js
new file mode 100644
index 0000000..0288cf9
--- /dev/null
+++ b/mcp/src/vendor/readability.bundle.js
@@ -0,0 +1,2064 @@
+// Auto-generated by scripts/bundle-readability.js — do not edit
+(() => {
+  var __getOwnPropNames = Object.getOwnPropertyNames;
+  var __commonJS = (cb, mod) => function __require() {
+    return mod || (0, cb[__getOwnPropNames(cb)[0]])((mod = { exports: {} }).exports, mod), mod.exports;
+  };
+
+  // ../node_modules/.pnpm/@mozilla+readability@0.6.0/node_modules/@mozilla/readability/Readability.js
+  var require_Readability = __commonJS({
+    "../node_modules/.pnpm/@mozilla+readability@0.6.0/node_modules/@mozilla/readability/Readability.js"(exports, module) {
+      function Readability(doc, options) {
+        if (options && options.documentElement) {
+          doc = options;
+          options = arguments[2];
+        } else if (!doc || !doc.documentElement) {
+          throw new Error(
+            "First argument to Readability constructor should be a document object."
+          );
+        }
+        options = options || {};
+        this._doc = doc;
+        this._docJSDOMParser = this._doc.firstChild.__JSDOMParser__;
+        this._articleTitle = null;
+        this._articleByline = null;
+        this._articleDir = null;
+        this._articleSiteName = null;
+        this._attempts = [];
+        this._metadata = {};
+        this._debug = !!options.debug;
+        this._maxElemsToParse = options.maxElemsToParse || this.DEFAULT_MAX_ELEMS_TO_PARSE;
+        this._nbTopCandidates = options.nbTopCandidates || this.DEFAULT_N_TOP_CANDIDATES;
+        this._charThreshold = options.charThreshold || this.DEFAULT_CHAR_THRESHOLD;
+        this._classesToPreserve = this.CLASSES_TO_PRESERVE.concat(
+          options.classesToPreserve || []
+        );
+        this._keepClasses = !!options.keepClasses;
+        this._serializer = options.serializer || function(el) {
+          return el.innerHTML;
+        };
+        this._disableJSONLD = !!options.disableJSONLD;
+        this._allowedVideoRegex = options.allowedVideoRegex || this.REGEXPS.videos;
+        this._linkDensityModifier = options.linkDensityModifier || 0;
+        this._flags = this.FLAG_STRIP_UNLIKELYS | this.FLAG_WEIGHT_CLASSES | this.FLAG_CLEAN_CONDITIONALLY;
+        if (this._debug) {
+          let logNode = function(node) {
+            if (node.nodeType == node.TEXT_NODE) {
+              return `${node.nodeName} ("${node.textContent}")`;
+            }
+            let attrPairs = Array.from(node.attributes || [], function(attr) {
+              return `${attr.name}="${attr.value}"`;
+            }).join(" ");
+            return `<${node.localName} ${attrPairs}>`;
+          };
+          this.log = function() {
+            if (typeof console !== "undefined") {
+              let args = Array.from(arguments, (arg) => {
+                if (arg && arg.nodeType == this.ELEMENT_NODE) {
+                  return logNode(arg);
+                }
+                return arg;
+              });
+              args.unshift("Reader: (Readability)");
+              console.log(...args);
+            } else if (typeof dump !== "undefined") {
+              var msg = Array.prototype.map.call(arguments, function(x) {
+                return x && x.nodeName ? logNode(x) : x;
+              }).join(" ");
+              dump("Reader: (Readability) " + msg + "\n");
+            }
+          };
+        } else {
+          this.log = function() {
+          };
+        }
+      }
+      Readability.prototype = {
+        FLAG_STRIP_UNLIKELYS: 1,
+        FLAG_WEIGHT_CLASSES: 2,
+        FLAG_CLEAN_CONDITIONALLY: 4,
+        // https://developer.mozilla.org/en-US/docs/Web/API/Node/nodeType
+        ELEMENT_NODE: 1,
+        TEXT_NODE: 3,
+        // Max number of nodes supported by this parser. Default: 0 (no limit)
+        DEFAULT_MAX_ELEMS_TO_PARSE: 0,
+        // The number of top candidates to consider when analysing how
+        // tight the competition is among candidates.
+        DEFAULT_N_TOP_CANDIDATES: 5,
+        // Element tags to score by default.
+        DEFAULT_TAGS_TO_SCORE: "section,h2,h3,h4,h5,h6,p,td,pre".toUpperCase().split(","),
+        // The default number of chars an article must have in order to return a result
+        DEFAULT_CHAR_THRESHOLD: 500,
+        // All of the regular expressions in use within readability.
+        // Defined up here so we don't instantiate them repeatedly in loops.
+        REGEXPS: {
+          // NOTE: These two regular expressions are duplicated in
+          // Readability-readerable.js. Please keep both copies in sync.
+          unlikelyCandidates: /-ad-|ai2html|banner|breadcrumbs|combx|comment|community|cover-wrap|disqus|extra|footer|gdpr|header|legends|menu|related|remark|replies|rss|shoutbox|sidebar|skyscraper|social|sponsor|supplemental|ad-break|agegate|pagination|pager|popup|yom-remote/i,
+          okMaybeItsACandidate: /and|article|body|column|content|main|shadow/i,
+          positive: /article|body|content|entry|hentry|h-entry|main|page|pagination|post|text|blog|story/i,
+          negative: /-ad-|hidden|^hid$| hid$| hid |^hid |banner|combx|comment|com-|contact|footer|gdpr|masthead|media|meta|outbrain|promo|related|scroll|share|shoutbox|sidebar|skyscraper|sponsor|shopping|tags|widget/i,
+          extraneous: /print|archive|comment|discuss|e[\-]?mail|share|reply|all|login|sign|single|utility/i,
+          byline: /byline|author|dateline|writtenby|p-author/i,
+          replaceFonts: /<(\/?)font[^>]*>/gi,
+          normalize: /\s{2,}/g,
+          videos: /\/\/(www\.)?((dailymotion|youtube|youtube-nocookie|player\.vimeo|v\.qq)\.com|(archive|upload\.wikimedia)\.org|player\.twitch\.tv)/i,
+          shareElements: /(\b|_)(share|sharedaddy)(\b|_)/i,
+          nextLink: /(next|weiter|continue|>([^\|]|$)|»([^\|]|$))/i,
+          prevLink: /(prev|earl|old|new|<|«)/i,
+          tokenize: /\W+/g,
+          whitespace: /^\s*$/,
+          hasContent: /\S$/,
+          hashUrl: /^#.+/,
+          srcsetUrl: /(\S+)(\s+[\d.]+[xw])?(\s*(?:,|$))/g,
+          b64DataUrl: /^data:\s*([^\s;,]+)\s*;\s*base64\s*,/i,
+          // Commas as used in Latin, Sindhi, Chinese and various other scripts.
+          // see: https://en.wikipedia.org/wiki/Comma#Comma_variants
+          commas: /\u002C|\u060C|\uFE50|\uFE10|\uFE11|\u2E41|\u2E34|\u2E32|\uFF0C/g,
+          // See: https://schema.org/Article
+          jsonLdArticleTypes: /^Article|AdvertiserContentArticle|NewsArticle|AnalysisNewsArticle|AskPublicNewsArticle|BackgroundNewsArticle|OpinionNewsArticle|ReportageNewsArticle|ReviewNewsArticle|Report|SatiricalArticle|ScholarlyArticle|MedicalScholarlyArticle|SocialMediaPosting|BlogPosting|LiveBlogPosting|DiscussionForumPosting|TechArticle|APIReference$/,
+          // used to see if a node's content matches words commonly used for ad blocks or loading indicators
+          adWords: /^(ad(vertising|vertisement)?|pub(licité)?|werb(ung)?|广告|Реклама|Anuncio)$/iu,
+          loadingWords: /^((loading|正在加载|Загрузка|chargement|cargando)(…|\.\.\.)?)$/iu
+        },
+        UNLIKELY_ROLES: [
+          "menu",
+          "menubar",
+          "complementary",
+          "navigation",
+          "alert",
+          "alertdialog",
+          "dialog"
+        ],
+        DIV_TO_P_ELEMS: /* @__PURE__ */ new Set([
+          "BLOCKQUOTE",
+          "DL",
+          "DIV",
+          "IMG",
+          "OL",
+          "P",
+          "PRE",
+          "TABLE",
+          "UL"
+        ]),
+        ALTER_TO_DIV_EXCEPTIONS: ["DIV", "ARTICLE", "SECTION", "P", "OL", "UL"],
+        PRESENTATIONAL_ATTRIBUTES: [
+          "align",
+          "background",
+          "bgcolor",
+          "border",
+          "cellpadding",
+          "cellspacing",
+          "frame",
+          "hspace",
+          "rules",
+          "style",
+          "valign",
+          "vspace"
+        ],
+        DEPRECATED_SIZE_ATTRIBUTE_ELEMS: ["TABLE", "TH", "TD", "HR", "PRE"],
+        // The commented out elements qualify as phrasing content but tend to be
+        // removed by readability when put into paragraphs, so we ignore them here.
+        PHRASING_ELEMS: [
+          // "CANVAS", "IFRAME", "SVG", "VIDEO",
+          "ABBR",
+          "AUDIO",
+          "B",
+          "BDO",
+          "BR",
+          "BUTTON",
+          "CITE",
+          "CODE",
+          "DATA",
+          "DATALIST",
+          "DFN",
+          "EM",
+          "EMBED",
+          "I",
+          "IMG",
+          "INPUT",
+          "KBD",
+          "LABEL",
+          "MARK",
+          "MATH",
+          "METER",
+          "NOSCRIPT",
+          "OBJECT",
+          "OUTPUT",
+          "PROGRESS",
+          "Q",
+          "RUBY",
+          "SAMP",
+          "SCRIPT",
+          "SELECT",
+          "SMALL",
+          "SPAN",
+          "STRONG",
+          "SUB",
+          "SUP",
+          "TEXTAREA",
+          "TIME",
+          "VAR",
+          "WBR"
+        ],
+        // These are the classes that readability sets itself.
+        CLASSES_TO_PRESERVE: ["page"],
+        // These are the list of HTML entities that need to be escaped.
+        HTML_ESCAPE_MAP: {
+          lt: "<",
+          gt: ">",
+          amp: "&",
+          quot: '"',
+          apos: "'"
+        },
+        /**
+         * Run any post-process modifications to article content as necessary.
+         *
+         * @param Element
+         * @return void
+         **/
+        _postProcessContent(articleContent) {
+          this._fixRelativeUris(articleContent);
+          this._simplifyNestedElements(articleContent);
+          if (!this._keepClasses) {
+            this._cleanClasses(articleContent);
+          }
+        },
+        /**
+         * Iterates over a NodeList, calls `filterFn` for each node and removes node
+         * if function returned `true`.
+         *
+         * If function is not passed, removes all the nodes in node list.
+         *
+         * @param NodeList nodeList The nodes to operate on
+         * @param Function filterFn the function to use as a filter
+         * @return void
+         */
+        _removeNodes(nodeList, filterFn) {
+          if (this._docJSDOMParser && nodeList._isLiveNodeList) {
+            throw new Error("Do not pass live node lists to _removeNodes");
+          }
+          for (var i = nodeList.length - 1; i >= 0; i--) {
+            var node = nodeList[i];
+            var parentNode = node.parentNode;
+            if (parentNode) {
+              if (!filterFn || filterFn.call(this, node, i, nodeList)) {
+                parentNode.removeChild(node);
+              }
+            }
+          }
+        },
+        /**
+         * Iterates over a NodeList, and calls _setNodeTag for each node.
+         *
+         * @param NodeList nodeList The nodes to operate on
+         * @param String newTagName the new tag name to use
+         * @return void
+         */
+        _replaceNodeTags(nodeList, newTagName) {
+          if (this._docJSDOMParser && nodeList._isLiveNodeList) {
+            throw new Error("Do not pass live node lists to _replaceNodeTags");
+          }
+          for (const node of nodeList) {
+            this._setNodeTag(node, newTagName);
+          }
+        },
+        /**
+         * Iterate over a NodeList, which doesn't natively fully implement the Array
+         * interface.
+         *
+         * For convenience, the current object context is applied to the provided
+         * iterate function.
+         *
+         * @param  NodeList nodeList The NodeList.
+         * @param  Function fn       The iterate function.
+         * @return void
+         */
+        _forEachNode(nodeList, fn) {
+          Array.prototype.forEach.call(nodeList, fn, this);
+        },
+        /**
+         * Iterate over a NodeList, and return the first node that passes
+         * the supplied test function
+         *
+         * For convenience, the current object context is applied to the provided
+         * test function.
+         *
+         * @param  NodeList nodeList The NodeList.
+         * @param  Function fn       The test function.
+         * @return void
+         */
+        _findNode(nodeList, fn) {
+          return Array.prototype.find.call(nodeList, fn, this);
+        },
+        /**
+         * Iterate over a NodeList, return true if any of the provided iterate
+         * function calls returns true, false otherwise.
+         *
+         * For convenience, the current object context is applied to the
+         * provided iterate function.
+         *
+         * @param  NodeList nodeList The NodeList.
+         * @param  Function fn       The iterate function.
+         * @return Boolean
+         */
+        _someNode(nodeList, fn) {
+          return Array.prototype.some.call(nodeList, fn, this);
+        },
+        /**
+         * Iterate over a NodeList, return true if all of the provided iterate
+         * function calls return true, false otherwise.
+         *
+         * For convenience, the current object context is applied to the
+         * provided iterate function.
+         *
+         * @param  NodeList nodeList The NodeList.
+         * @param  Function fn       The iterate function.
+         * @return Boolean
+         */
+        _everyNode(nodeList, fn) {
+          return Array.prototype.every.call(nodeList, fn, this);
+        },
+        _getAllNodesWithTag(node, tagNames) {
+          if (node.querySelectorAll) {
+            return node.querySelectorAll(tagNames.join(","));
+          }
+          return [].concat.apply(
+            [],
+            tagNames.map(function(tag) {
+              var collection = node.getElementsByTagName(tag);
+              return Array.isArray(collection) ? collection : Array.from(collection);
+            })
+          );
+        },
+        /**
+         * Removes the class="" attribute from every element in the given
+         * subtree, except those that match CLASSES_TO_PRESERVE and
+         * the classesToPreserve array from the options object.
+         *
+         * @param Element
+         * @return void
+         */
+        _cleanClasses(node) {
+          var classesToPreserve = this._classesToPreserve;
+          var className = (node.getAttribute("class") || "").split(/\s+/).filter((cls) => classesToPreserve.includes(cls)).join(" ");
+          if (className) {
+            node.setAttribute("class", className);
+          } else {
+            node.removeAttribute("class");
+          }
+          for (node = node.firstElementChild; node; node = node.nextElementSibling) {
+            this._cleanClasses(node);
+          }
+        },
+        /**
+         * Tests whether a string is a URL or not.
+         *
+         * @param {string} str The string to test
+         * @return {boolean} true if str is a URL, false if not
+         */
+        _isUrl(str) {
+          try {
+            new URL(str);
+            return true;
+          } catch {
+            return false;
+          }
+        },
+        /**
+         * Converts each <a> and <img> uri in the given element to an absolute URI,
+         * ignoring #ref URIs.
+         *
+         * @param Element
+         * @return void
+         */
+        _fixRelativeUris(articleContent) {
+          var baseURI = this._doc.baseURI;
+          var documentURI = this._doc.documentURI;
+          function toAbsoluteURI(uri) {
+            if (baseURI == documentURI && uri.charAt(0) == "#") {
+              return uri;
+            }
+            try {
+              return new URL(uri, baseURI).href;
+            } catch (ex) {
+            }
+            return uri;
+          }
+          var links = this._getAllNodesWithTag(articleContent, ["a"]);
+          this._forEachNode(links, function(link) {
+            var href = link.getAttribute("href");
+            if (href) {
+              if (href.indexOf("javascript:") === 0) {
+                if (link.childNodes.length === 1 && link.childNodes[0].nodeType === this.TEXT_NODE) {
+                  var text = this._doc.createTextNode(link.textContent);
+                  link.parentNode.replaceChild(text, link);
+                } else {
+                  var container = this._doc.createElement("span");
+                  while (link.firstChild) {
+                    container.appendChild(link.firstChild);
+                  }
+                  link.parentNode.replaceChild(container, link);
+                }
+              } else {
+                link.setAttribute("href", toAbsoluteURI(href));
+              }
+            }
+          });
+          var medias = this._getAllNodesWithTag(articleContent, [
+            "img",
+            "picture",
+            "figure",
+            "video",
+            "audio",
+            "source"
+          ]);
+          this._forEachNode(medias, function(media) {
+            var src = media.getAttribute("src");
+            var poster = media.getAttribute("poster");
+            var srcset = media.getAttribute("srcset");
+            if (src) {
+              media.setAttribute("src", toAbsoluteURI(src));
+            }
+            if (poster) {
+              media.setAttribute("poster", toAbsoluteURI(poster));
+            }
+            if (srcset) {
+              var newSrcset = srcset.replace(
+                this.REGEXPS.srcsetUrl,
+                function(_, p1, p2, p3) {
+                  return toAbsoluteURI(p1) + (p2 || "") + p3;
+                }
+              );
+              media.setAttribute("srcset", newSrcset);
+            }
+          });
+        },
+        _simplifyNestedElements(articleContent) {
+          var node = articleContent;
+          while (node) {
+            if (node.parentNode && ["DIV", "SECTION"].includes(node.tagName) && !(node.id && node.id.startsWith("readability"))) {
+              if (this._isElementWithoutContent(node)) {
+                node = this._removeAndGetNext(node);
+                continue;
+              } else if (this._hasSingleTagInsideElement(node, "DIV") || this._hasSingleTagInsideElement(node, "SECTION")) {
+                var child = node.children[0];
+                for (var i = 0; i < node.attributes.length; i++) {
+                  child.setAttributeNode(node.attributes[i].cloneNode());
+                }
+                node.parentNode.replaceChild(child, node);
+                node = child;
+                continue;
+              }
+            }
+            node = this._getNextNode(node);
+          }
+        },
+        /**
+         * Get the article title as an H1.
+         *
+         * @return string
+         **/
+        _getArticleTitle() {
+          var doc = this._doc;
+          var curTitle = "";
+          var origTitle = "";
+          try {
+            curTitle = origTitle = doc.title.trim();
+            if (typeof curTitle !== "string") {
+              curTitle = origTitle = this._getInnerText(
+                doc.getElementsByTagName("title")[0]
+              );
+            }
+          } catch (e) {
+          }
+          var titleHadHierarchicalSeparators = false;
+          function wordCount(str) {
+            return str.split(/\s+/).length;
+          }
+          if (/ [\|\-\\\/>»] /.test(curTitle)) {
+            titleHadHierarchicalSeparators = / [\\\/>»] /.test(curTitle);
+            let allSeparators = Array.from(origTitle.matchAll(/ [\|\-\\\/>»] /gi));
+            curTitle = origTitle.substring(0, allSeparators.pop().index);
+            if (wordCount(curTitle) < 3) {
+              curTitle = origTitle.replace(/^[^\|\-\\\/>»]*[\|\-\\\/>»]/gi, "");
+            }
+          } else if (curTitle.includes(": ")) {
+            var headings = this._getAllNodesWithTag(doc, ["h1", "h2"]);
+            var trimmedTitle = curTitle.trim();
+            var match = this._someNode(headings, function(heading) {
+              return heading.textContent.trim() === trimmedTitle;
+            });
+            if (!match) {
+              curTitle = origTitle.substring(origTitle.lastIndexOf(":") + 1);
+              if (wordCount(curTitle) < 3) {
+                curTitle = origTitle.substring(origTitle.indexOf(":") + 1);
+              } else if (wordCount(origTitle.substr(0, origTitle.indexOf(":"))) > 5) {
+                curTitle = origTitle;
+              }
+            }
+          } else if (curTitle.length > 150 || curTitle.length < 15) {
+            var hOnes = doc.getElementsByTagName("h1");
+            if (hOnes.length === 1) {
+              curTitle = this._getInnerText(hOnes[0]);
+            }
+          }
+          curTitle = curTitle.trim().replace(this.REGEXPS.normalize, " ");
+          var curTitleWordCount = wordCount(curTitle);
+          if (curTitleWordCount <= 4 && (!titleHadHierarchicalSeparators || curTitleWordCount != wordCount(origTitle.replace(/[\|\-\\\/>»]+/g, "")) - 1)) {
+            curTitle = origTitle;
+          }
+          return curTitle;
+        },
+        /**
+         * Prepare the HTML document for readability to scrape it.
+         * This includes things like stripping javascript, CSS, and handling terrible markup.
+         *
+         * @return void
+         **/
+        _prepDocument() {
+          var doc = this._doc;
+          this._removeNodes(this._getAllNodesWithTag(doc, ["style"]));
+          if (doc.body) {
+            this._replaceBrs(doc.body);
+          }
+          this._replaceNodeTags(this._getAllNodesWithTag(doc, ["font"]), "SPAN");
+        },
+        /**
+         * Finds the next node, starting from the given node, and ignoring
+         * whitespace in between. If the given node is an element, the same node is
+         * returned.
+         */
+        _nextNode(node) {
+          var next = node;
+          while (next && next.nodeType != this.ELEMENT_NODE && this.REGEXPS.whitespace.test(next.textContent)) {
+            next = next.nextSibling;
+          }
+          return next;
+        },
+        /**
+         * Replaces 2 or more successive <br> elements with a single <p>.
+         * Whitespace between <br> elements are ignored. For example:
+         *   <div>foo<br>bar<br> <br><br>abc</div>
+         * will become:
+         *   <div>foo<br>bar<p>abc</p></div>
+         */
+        _replaceBrs(elem) {
+          this._forEachNode(this._getAllNodesWithTag(elem, ["br"]), function(br) {
+            var next = br.nextSibling;
+            var replaced = false;
+            while ((next = this._nextNode(next)) && next.tagName == "BR") {
+              replaced = true;
+              var brSibling = next.nextSibling;
+              next.remove();
+              next = brSibling;
+            }
+            if (replaced) {
+              var p = this._doc.createElement("p");
+              br.parentNode.replaceChild(p, br);
+              next = p.nextSibling;
+              while (next) {
+                if (next.tagName == "BR") {
+                  var nextElem = this._nextNode(next.nextSibling);
+                  if (nextElem && nextElem.tagName == "BR") {
+                    break;
+                  }
+                }
+                if (!this._isPhrasingContent(next)) {
+                  break;
+                }
+                var sibling = next.nextSibling;
+                p.appendChild(next);
+                next = sibling;
+              }
+              while (p.lastChild && this._isWhitespace(p.lastChild)) {
+                p.lastChild.remove();
+              }
+              if (p.parentNode.tagName === "P") {
+                this._setNodeTag(p.parentNode, "DIV");
+              }
+            }
+          });
+        },
+        _setNodeTag(node, tag) {
+          this.log("_setNodeTag", node, tag);
+          if (this._docJSDOMParser) {
+            node.localName = tag.toLowerCase();
+            node.tagName = tag.toUpperCase();
+            return node;
+          }
+          var replacement = node.ownerDocument.createElement(tag);
+          while (node.firstChild) {
+            replacement.appendChild(node.firstChild);
+          }
+          node.parentNode.replaceChild(replacement, node);
+          if (node.readability) {
+            replacement.readability = node.readability;
+          }
+          for (var i = 0; i < node.attributes.length; i++) {
+            replacement.setAttributeNode(node.attributes[i].cloneNode());
+          }
+          return replacement;
+        },
+        /**
+         * Prepare the article node for display. Clean out any inline styles,
+         * iframes, forms, strip extraneous <p> tags, etc.
+         *
+         * @param Element
+         * @return void
+         **/
+        _prepArticle(articleContent) {
+          this._cleanStyles(articleContent);
+          this._markDataTables(articleContent);
+          this._fixLazyImages(articleContent);
+          this._cleanConditionally(articleContent, "form");
+          this._cleanConditionally(articleContent, "fieldset");
+          this._clean(articleContent, "object");
+          this._clean(articleContent, "embed");
+          this._clean(articleContent, "footer");
+          this._clean(articleContent, "link");
+          this._clean(articleContent, "aside");
+          var shareElementThreshold = this.DEFAULT_CHAR_THRESHOLD;
+          this._forEachNode(articleContent.children, function(topCandidate) {
+            this._cleanMatchedNodes(topCandidate, function(node, matchString) {
+              return this.REGEXPS.shareElements.test(matchString) && node.textContent.length < shareElementThreshold;
+            });
+          });
+          this._clean(articleContent, "iframe");
+          this._clean(articleContent, "input");
+          this._clean(articleContent, "textarea");
+          this._clean(articleContent, "select");
+          this._clean(articleContent, "button");
+          this._cleanHeaders(articleContent);
+          this._cleanConditionally(articleContent, "table");
+          this._cleanConditionally(articleContent, "ul");
+          this._cleanConditionally(articleContent, "div");
+          this._replaceNodeTags(
+            this._getAllNodesWithTag(articleContent, ["h1"]),
+            "h2"
+          );
+          this._removeNodes(
+            this._getAllNodesWithTag(articleContent, ["p"]),
+            function(paragraph) {
+              var contentElementCount = this._getAllNodesWithTag(paragraph, [
+                "img",
+                "embed",
+                "object",
+                "iframe"
+              ]).length;
+              return contentElementCount === 0 && !this._getInnerText(paragraph, false);
+            }
+          );
+          this._forEachNode(
+            this._getAllNodesWithTag(articleContent, ["br"]),
+            function(br) {
+              var next = this._nextNode(br.nextSibling);
+              if (next && next.tagName == "P") {
+                br.remove();
+              }
+            }
+          );
+          this._forEachNode(
+            this._getAllNodesWithTag(articleContent, ["table"]),
+            function(table) {
+              var tbody = this._hasSingleTagInsideElement(table, "TBODY") ? table.firstElementChild : table;
+              if (this._hasSingleTagInsideElement(tbody, "TR")) {
+                var row = tbody.firstElementChild;
+                if (this._hasSingleTagInsideElement(row, "TD")) {
+                  var cell = row.firstElementChild;
+                  cell = this._setNodeTag(
+                    cell,
+                    this._everyNode(cell.childNodes, this._isPhrasingContent) ? "P" : "DIV"
+                  );
+                  table.parentNode.replaceChild(cell, table);
+                }
+              }
+            }
+          );
+        },
+        /**
+         * Initialize a node with the readability object. Also checks the
+         * className/id for special names to add to its score.
+         *
+         * @param Element
+         * @return void
+         **/
+        _initializeNode(node) {
+          node.readability = { contentScore: 0 };
+          switch (node.tagName) {
+            case "DIV":
+              node.readability.contentScore += 5;
+              break;
+            case "PRE":
+            case "TD":
+            case "BLOCKQUOTE":
+              node.readability.contentScore += 3;
+              break;
+            case "ADDRESS":
+            case "OL":
+            case "UL":
+            case "DL":
+            case "DD":
+            case "DT":
+            case "LI":
+            case "FORM":
+              node.readability.contentScore -= 3;
+              break;
+            case "H1":
+            case "H2":
+            case "H3":
+            case "H4":
+            case "H5":
+            case "H6":
+            case "TH":
+              node.readability.contentScore -= 5;
+              break;
+          }
+          node.readability.contentScore += this._getClassWeight(node);
+        },
+        _removeAndGetNext(node) {
+          var nextNode = this._getNextNode(node, true);
+          node.remove();
+          return nextNode;
+        },
+        /**
+         * Traverse the DOM from node to node, starting at the node passed in.
+         * Pass true for the second parameter to indicate this node itself
+         * (and its kids) are going away, and we want the next node over.
+         *
+         * Calling this in a loop will traverse the DOM depth-first.
+         *
+         * @param {Element} node
+         * @param {boolean} ignoreSelfAndKids
+         * @return {Element}
+         */
+        _getNextNode(node, ignoreSelfAndKids) {
+          if (!ignoreSelfAndKids && node.firstElementChild) {
+            return node.firstElementChild;
+          }
+          if (node.nextElementSibling) {
+            return node.nextElementSibling;
+          }
+          do {
+            node = node.parentNode;
+          } while (node && !node.nextElementSibling);
+          return node && node.nextElementSibling;
+        },
+        // compares second text to first one
+        // 1 = same text, 0 = completely different text
+        // works the way that it splits both texts into words and then finds words that are unique in second text
+        // the result is given by the lower length of unique parts
+        _textSimilarity(textA, textB) {
+          var tokensA = textA.toLowerCase().split(this.REGEXPS.tokenize).filter(Boolean);
+          var tokensB = textB.toLowerCase().split(this.REGEXPS.tokenize).filter(Boolean);
+          if (!tokensA.length || !tokensB.length) {
+            return 0;
+          }
+          var uniqTokensB = tokensB.filter((token) => !tokensA.includes(token));
+          var distanceB = uniqTokensB.join(" ").length / tokensB.join(" ").length;
+          return 1 - distanceB;
+        },
+        /**
+         * Checks whether an element node contains a valid byline
+         *
+         * @param node {Element}
+         * @param matchString {string}
+         * @return boolean
+         */
+        _isValidByline(node, matchString) {
+          var rel = node.getAttribute("rel");
+          var itemprop = node.getAttribute("itemprop");
+          var bylineLength = node.textContent.trim().length;
+          return (rel === "author" || itemprop && itemprop.includes("author") || this.REGEXPS.byline.test(matchString)) && !!bylineLength && bylineLength < 100;
+        },
+        _getNodeAncestors(node, maxDepth) {
+          maxDepth = maxDepth || 0;
+          var i = 0, ancestors = [];
+          while (node.parentNode) {
+            ancestors.push(node.parentNode);
+            if (maxDepth && ++i === maxDepth) {
+              break;
+            }
+            node = node.parentNode;
+          }
+          return ancestors;
+        },
+        /***
+         * grabArticle - Using a variety of metrics (content score, classname, element types), find the content that is
+         *         most likely to be the stuff a user wants to read. Then return it wrapped up in a div.
+         *
+         * @param page a document to run upon. Needs to be a full document, complete with body.
+         * @return Element
+         **/
+        /* eslint-disable-next-line complexity */
+        _grabArticle(page) {
+          this.log("**** grabArticle ****");
+          var doc = this._doc;
+          var isPaging = page !== null;
+          page = page ? page : this._doc.body;
+          if (!page) {
+            this.log("No body found in document. Abort.");
+            return null;
+          }
+          var pageCacheHtml = page.innerHTML;
+          while (true) {
+            this.log("Starting grabArticle loop");
+            var stripUnlikelyCandidates = this._flagIsActive(
+              this.FLAG_STRIP_UNLIKELYS
+            );
+            var elementsToScore = [];
+            var node = this._doc.documentElement;
+            let shouldRemoveTitleHeader = true;
+            while (node) {
+              if (node.tagName === "HTML") {
+                this._articleLang = node.getAttribute("lang");
+              }
+              var matchString = node.className + " " + node.id;
+              if (!this._isProbablyVisible(node)) {
+                this.log("Removing hidden node - " + matchString);
+                node = this._removeAndGetNext(node);
+                continue;
+              }
+              if (node.getAttribute("aria-modal") == "true" && node.getAttribute("role") == "dialog") {
+                node = this._removeAndGetNext(node);
+                continue;
+              }
+              if (!this._articleByline && !this._metadata.byline && this._isValidByline(node, matchString)) {
+                var endOfSearchMarkerNode = this._getNextNode(node, true);
+                var next = this._getNextNode(node);
+                var itemPropNameNode = null;
+                while (next && next != endOfSearchMarkerNode) {
+                  var itemprop = next.getAttribute("itemprop");
+                  if (itemprop && itemprop.includes("name")) {
+                    itemPropNameNode = next;
+                    break;
+                  } else {
+                    next = this._getNextNode(next);
+                  }
+                }
+                this._articleByline = (itemPropNameNode ?? node).textContent.trim();
+                node = this._removeAndGetNext(node);
+                continue;
+              }
+              if (shouldRemoveTitleHeader && this._headerDuplicatesTitle(node)) {
+                this.log(
+                  "Removing header: ",
+                  node.textContent.trim(),
+                  this._articleTitle.trim()
+                );
+                shouldRemoveTitleHeader = false;
+                node = this._removeAndGetNext(node);
+                continue;
+              }
+              if (stripUnlikelyCandidates) {
+                if (this.REGEXPS.unlikelyCandidates.test(matchString) && !this.REGEXPS.okMaybeItsACandidate.test(matchString) && !this._hasAncestorTag(node, "table") && !this._hasAncestorTag(node, "code") && node.tagName !== "BODY" && node.tagName !== "A") {
+                  this.log("Removing unlikely candidate - " + matchString);
+                  node = this._removeAndGetNext(node);
+                  continue;
+                }
+                if (this.UNLIKELY_ROLES.includes(node.getAttribute("role"))) {
+                  this.log(
+                    "Removing content with role " + node.getAttribute("role") + " - " + matchString
+                  );
+                  node = this._removeAndGetNext(node);
+                  continue;
+                }
+              }
+              if ((node.tagName === "DIV" || node.tagName === "SECTION" || node.tagName === "HEADER" || node.tagName === "H1" || node.tagName === "H2" || node.tagName === "H3" || node.tagName === "H4" || node.tagName === "H5" || node.tagName === "H6") && this._isElementWithoutContent(node)) {
+                node = this._removeAndGetNext(node);
+                continue;
+              }
+              if (this.DEFAULT_TAGS_TO_SCORE.includes(node.tagName)) {
+                elementsToScore.push(node);
+              }
+              if (node.tagName === "DIV") {
+                var p = null;
+                var childNode = node.firstChild;
+                while (childNode) {
+                  var nextSibling = childNode.nextSibling;
+                  if (this._isPhrasingContent(childNode)) {
+                    if (p !== null) {
+                      p.appendChild(childNode);
+                    } else if (!this._isWhitespace(childNode)) {
+                      p = doc.createElement("p");
+                      node.replaceChild(p, childNode);
+                      p.appendChild(childNode);
+                    }
+                  } else if (p !== null) {
+                    while (p.lastChild && this._isWhitespace(p.lastChild)) {
+                      p.lastChild.remove();
+                    }
+                    p = null;
+                  }
+                  childNode = nextSibling;
+                }
+                if (this._hasSingleTagInsideElement(node, "P") && this._getLinkDensity(node) < 0.25) {
+                  var newNode = node.children[0];
+                  node.parentNode.replaceChild(newNode, node);
+                  node = newNode;
+                  elementsToScore.push(node);
+                } else if (!this._hasChildBlockElement(node)) {
+                  node = this._setNodeTag(node, "P");
+                  elementsToScore.push(node);
+                }
+              }
+              node = this._getNextNode(node);
+            }
+            var candidates = [];
+            this._forEachNode(elementsToScore, function(elementToScore) {
+              if (!elementToScore.parentNode || typeof elementToScore.parentNode.tagName === "undefined") {
+                return;
+              }
+              var innerText = this._getInnerText(elementToScore);
+              if (innerText.length < 25) {
+                return;
+              }
+              var ancestors2 = this._getNodeAncestors(elementToScore, 5);
+              if (ancestors2.length === 0) {
+                return;
+              }
+              var contentScore = 0;
+              contentScore += 1;
+              contentScore += innerText.split(this.REGEXPS.commas).length;
+              contentScore += Math.min(Math.floor(innerText.length / 100), 3);
+              this._forEachNode(ancestors2, function(ancestor, level) {
+                if (!ancestor.tagName || !ancestor.parentNode || typeof ancestor.parentNode.tagName === "undefined") {
+                  return;
+                }
+                if (typeof ancestor.readability === "undefined") {
+                  this._initializeNode(ancestor);
+                  candidates.push(ancestor);
+                }
+                if (level === 0) {
+                  var scoreDivider = 1;
+                } else if (level === 1) {
+                  scoreDivider = 2;
+                } else {
+                  scoreDivider = level * 3;
+                }
+                ancestor.readability.contentScore += contentScore / scoreDivider;
+              });
+            });
+            var topCandidates = [];
+            for (var c = 0, cl = candidates.length; c < cl; c += 1) {
+              var candidate = candidates[c];
+              var candidateScore = candidate.readability.contentScore * (1 - this._getLinkDensity(candidate));
+              candidate.readability.contentScore = candidateScore;
+              this.log("Candidate:", candidate, "with score " + candidateScore);
+              for (var t = 0; t < this._nbTopCandidates; t++) {
+                var aTopCandidate = topCandidates[t];
+                if (!aTopCandidate || candidateScore > aTopCandidate.readability.contentScore) {
+                  topCandidates.splice(t, 0, candidate);
+                  if (topCandidates.length > this._nbTopCandidates) {
+                    topCandidates.pop();
+                  }
+                  break;
+                }
+              }
+            }
+            var topCandidate = topCandidates[0] || null;
+            var neededToCreateTopCandidate = false;
+            var parentOfTopCandidate;
+            if (topCandidate === null || topCandidate.tagName === "BODY") {
+              topCandidate = doc.createElement("DIV");
+              neededToCreateTopCandidate = true;
+              while (page.firstChild) {
+                this.log("Moving child out:", page.firstChild);
+                topCandidate.appendChild(page.firstChild);
+              }
+              page.appendChild(topCandidate);
+              this._initializeNode(topCandidate);
+            } else if (topCandidate) {
+              var alternativeCandidateAncestors = [];
+              for (var i = 1; i < topCandidates.length; i++) {
+                if (topCandidates[i].readability.contentScore / topCandidate.readability.contentScore >= 0.75) {
+                  alternativeCandidateAncestors.push(
+                    this._getNodeAncestors(topCandidates[i])
+                  );
+                }
+              }
+              var MINIMUM_TOPCANDIDATES = 3;
+              if (alternativeCandidateAncestors.length >= MINIMUM_TOPCANDIDATES) {
+                parentOfTopCandidate = topCandidate.parentNode;
+                while (parentOfTopCandidate.tagName !== "BODY") {
+                  var listsContainingThisAncestor = 0;
+                  for (var ancestorIndex = 0; ancestorIndex < alternativeCandidateAncestors.length && listsContainingThisAncestor < MINIMUM_TOPCANDIDATES; ancestorIndex++) {
+                    listsContainingThisAncestor += Number(
+                      alternativeCandidateAncestors[ancestorIndex].includes(
+                        parentOfTopCandidate
+                      )
+                    );
+                  }
+                  if (listsContainingThisAncestor >= MINIMUM_TOPCANDIDATES) {
+                    topCandidate = parentOfTopCandidate;
+                    break;
+                  }
+                  parentOfTopCandidate = parentOfTopCandidate.parentNode;
+                }
+              }
+              if (!topCandidate.readability) {
+                this._initializeNode(topCandidate);
+              }
+              parentOfTopCandidate = topCandidate.parentNode;
+              var lastScore = topCandidate.readability.contentScore;
+              var scoreThreshold = lastScore / 3;
+              while (parentOfTopCandidate.tagName !== "BODY") {
+                if (!parentOfTopCandidate.readability) {
+                  parentOfTopCandidate = parentOfTopCandidate.parentNode;
+                  continue;
+                }
+                var parentScore = parentOfTopCandidate.readability.contentScore;
+                if (parentScore < scoreThreshold) {
+                  break;
+                }
+                if (parentScore > lastScore) {
+                  topCandidate = parentOfTopCandidate;
+                  break;
+                }
+                lastScore = parentOfTopCandidate.readability.contentScore;
+                parentOfTopCandidate = parentOfTopCandidate.parentNode;
+              }
+              parentOfTopCandidate = topCandidate.parentNode;
+              while (parentOfTopCandidate.tagName != "BODY" && parentOfTopCandidate.children.length == 1) {
+                topCandidate = parentOfTopCandidate;
+                parentOfTopCandidate = topCandidate.parentNode;
+              }
+              if (!topCandidate.readability) {
+                this._initializeNode(topCandidate);
+              }
+            }
+            var articleContent = doc.createElement("DIV");
+            if (isPaging) {
+              articleContent.id = "readability-content";
+            }
+            var siblingScoreThreshold = Math.max(
+              10,
+              topCandidate.readability.contentScore * 0.2
+            );
+            parentOfTopCandidate = topCandidate.parentNode;
+            var siblings = parentOfTopCandidate.children;
+            for (var s = 0, sl = siblings.length; s < sl; s++) {
+              var sibling = siblings[s];
+              var append = false;
+              this.log(
+                "Looking at sibling node:",
+                sibling,
+                sibling.readability ? "with score " + sibling.readability.contentScore : ""
+              );
+              this.log(
+                "Sibling has score",
+                sibling.readability ? sibling.readability.contentScore : "Unknown"
+              );
+              if (sibling === topCandidate) {
+                append = true;
+              } else {
+                var contentBonus = 0;
+                if (sibling.className === topCandidate.className && topCandidate.className !== "") {
+                  contentBonus += topCandidate.readability.contentScore * 0.2;
+                }
+                if (sibling.readability && sibling.readability.contentScore + contentBonus >= siblingScoreThreshold) {
+                  append = true;
+                } else if (sibling.nodeName === "P") {
+                  var linkDensity = this._getLinkDensity(sibling);
+                  var nodeContent = this._getInnerText(sibling);
+                  var nodeLength = nodeContent.length;
+                  if (nodeLength > 80 && linkDensity < 0.25) {
+                    append = true;
+                  } else if (nodeLength < 80 && nodeLength > 0 && linkDensity === 0 && nodeContent.search(/\.( |$)/) !== -1) {
+                    append = true;
+                  }
+                }
+              }
+              if (append) {
+                this.log("Appending node:", sibling);
+                if (!this.ALTER_TO_DIV_EXCEPTIONS.includes(sibling.nodeName)) {
+                  this.log("Altering sibling:", sibling, "to div.");
+                  sibling = this._setNodeTag(sibling, "DIV");
+                }
+                articleContent.appendChild(sibling);
+                siblings = parentOfTopCandidate.children;
+                s -= 1;
+                sl -= 1;
+              }
+            }
+            if (this._debug) {
+              this.log("Article content pre-prep: " + articleContent.innerHTML);
+            }
+            this._prepArticle(articleContent);
+            if (this._debug) {
+              this.log("Article content post-prep: " + articleContent.innerHTML);
+            }
+            if (neededToCreateTopCandidate) {
+              topCandidate.id = "readability-page-1";
+              topCandidate.className = "page";
+            } else {
+              var div = doc.createElement("DIV");
+              div.id = "readability-page-1";
+              div.className = "page";
+              while (articleContent.firstChild) {
+                div.appendChild(articleContent.firstChild);
+              }
+              articleContent.appendChild(div);
+            }
+            if (this._debug) {
+              this.log("Article content after paging: " + articleContent.innerHTML);
+            }
+            var parseSuccessful = true;
+            var textLength = this._getInnerText(articleContent, true).length;
+            if (textLength < this._charThreshold) {
+              parseSuccessful = false;
+              page.innerHTML = pageCacheHtml;
+              this._attempts.push({
+                articleContent,
+                textLength
+              });
+              if (this._flagIsActive(this.FLAG_STRIP_UNLIKELYS)) {
+                this._removeFlag(this.FLAG_STRIP_UNLIKELYS);
+              } else if (this._flagIsActive(this.FLAG_WEIGHT_CLASSES)) {
+                this._removeFlag(this.FLAG_WEIGHT_CLASSES);
+              } else if (this._flagIsActive(this.FLAG_CLEAN_CONDITIONALLY)) {
+                this._removeFlag(this.FLAG_CLEAN_CONDITIONALLY);
+              } else {
+                this._attempts.sort(function(a, b) {
+                  return b.textLength - a.textLength;
+                });
+                if (!this._attempts[0].textLength) {
+                  return null;
+                }
+                articleContent = this._attempts[0].articleContent;
+                parseSuccessful = true;
+              }
+            }
+            if (parseSuccessful) {
+              var ancestors = [parentOfTopCandidate, topCandidate].concat(
+                this._getNodeAncestors(parentOfTopCandidate)
+              );
+              this._someNode(ancestors, function(ancestor) {
+                if (!ancestor.tagName) {
+                  return false;
+                }
+                var articleDir = ancestor.getAttribute("dir");
+                if (articleDir) {
+                  this._articleDir = articleDir;
+                  return true;
+                }
+                return false;
+              });
+              return articleContent;
+            }
+          }
+        },
+        /**
+         * Converts some of the common HTML entities in string to their corresponding characters.
+         *
+         * @param str {string} - a string to unescape.
+         * @return string without HTML entity.
+         */
+        _unescapeHtmlEntities(str) {
+          if (!str) {
+            return str;
+          }
+          var htmlEscapeMap = this.HTML_ESCAPE_MAP;
+          return str.replace(/&(quot|amp|apos|lt|gt);/g, function(_, tag) {
+            return htmlEscapeMap[tag];
+          }).replace(/&#(?:x([0-9a-f]+)|([0-9]+));/gi, function(_, hex, numStr) {
+            var num = parseInt(hex || numStr, hex ? 16 : 10);
+            if (num == 0 || num > 1114111 || num >= 55296 && num <= 57343) {
+              num = 65533;
+            }
+            return String.fromCodePoint(num);
+          });
+        },
+        /**
+         * Try to extract metadata from JSON-LD object.
+         * For now, only Schema.org objects of type Article or its subtypes are supported.
+         * @return Object with any metadata that could be extracted (possibly none)
+         */
+        _getJSONLD(doc) {
+          var scripts = this._getAllNodesWithTag(doc, ["script"]);
+          var metadata;
+          this._forEachNode(scripts, function(jsonLdElement) {
+            if (!metadata && jsonLdElement.getAttribute("type") === "application/ld+json") {
+              try {
+                var content = jsonLdElement.textContent.replace(
+                  /^\s*<!\[CDATA\[|\]\]>\s*$/g,
+                  ""
+                );
+                var parsed = JSON.parse(content);
+                if (Array.isArray(parsed)) {
+                  parsed = parsed.find((it) => {
+                    return it["@type"] && it["@type"].match(this.REGEXPS.jsonLdArticleTypes);
+                  });
+                  if (!parsed) {
+                    return;
+                  }
+                }
+                var schemaDotOrgRegex = /^https?\:\/\/schema\.org\/?$/;
+                var matches = typeof parsed["@context"] === "string" && parsed["@context"].match(schemaDotOrgRegex) || typeof parsed["@context"] === "object" && typeof parsed["@context"]["@vocab"] == "string" && parsed["@context"]["@vocab"].match(schemaDotOrgRegex);
+                if (!matches) {
+                  return;
+                }
+                if (!parsed["@type"] && Array.isArray(parsed["@graph"])) {
+                  parsed = parsed["@graph"].find((it) => {
+                    return (it["@type"] || "").match(this.REGEXPS.jsonLdArticleTypes);
+                  });
+                }
+                if (!parsed || !parsed["@type"] || !parsed["@type"].match(this.REGEXPS.jsonLdArticleTypes)) {
+                  return;
+                }
+                metadata = {};
+                if (typeof parsed.name === "string" && typeof parsed.headline === "string" && parsed.name !== parsed.headline) {
+                  var title = this._getArticleTitle();
+                  var nameMatches = this._textSimilarity(parsed.name, title) > 0.75;
+                  var headlineMatches = this._textSimilarity(parsed.headline, title) > 0.75;
+                  if (headlineMatches && !nameMatches) {
+                    metadata.title = parsed.headline;
+                  } else {
+                    metadata.title = parsed.name;
+                  }
+                } else if (typeof parsed.name === "string") {
+                  metadata.title = parsed.name.trim();
+                } else if (typeof parsed.headline === "string") {
+                  metadata.title = parsed.headline.trim();
+                }
+                if (parsed.author) {
+                  if (typeof parsed.author.name === "string") {
+                    metadata.byline = parsed.author.name.trim();
+                  } else if (Array.isArray(parsed.author) && parsed.author[0] && typeof parsed.author[0].name === "string") {
+                    metadata.byline = parsed.author.filter(function(author) {
+                      return author && typeof author.name === "string";
+                    }).map(function(author) {
+                      return author.name.trim();
+                    }).join(", ");
+                  }
+                }
+                if (typeof parsed.description === "string") {
+                  metadata.excerpt = parsed.description.trim();
+                }
+                if (parsed.publisher && typeof parsed.publisher.name === "string") {
+                  metadata.siteName = parsed.publisher.name.trim();
+                }
+                if (typeof parsed.datePublished === "string") {
+                  metadata.datePublished = parsed.datePublished.trim();
+                }
+              } catch (err) {
+                this.log(err.message);
+              }
+            }
+          });
+          return metadata ? metadata : {};
+        },
+        /**
+         * Attempts to get excerpt and byline metadata for the article.
+         *
+         * @param {Object} jsonld — object containing any metadata that
+         * could be extracted from JSON-LD object.
+         *
+         * @return Object with optional "excerpt" and "byline" properties
+         */
+        _getArticleMetadata(jsonld) {
+          var metadata = {};
+          var values = {};
+          var metaElements = this._doc.getElementsByTagName("meta");
+          var propertyPattern = /\s*(article|dc|dcterm|og|twitter)\s*:\s*(author|creator|description|published_time|title|site_name)\s*/gi;
+          var namePattern = /^\s*(?:(dc|dcterm|og|twitter|parsely|weibo:(article|webpage))\s*[-\.:]\s*)?(author|creator|pub-date|description|title|site_name)\s*$/i;
+          this._forEachNode(metaElements, function(element) {
+            var elementName = element.getAttribute("name");
+            var elementProperty = element.getAttribute("property");
+            var content = element.getAttribute("content");
+            if (!content) {
+              return;
+            }
+            var matches = null;
+            var name = null;
+            if (elementProperty) {
+              matches = elementProperty.match(propertyPattern);
+              if (matches) {
+                name = matches[0].toLowerCase().replace(/\s/g, "");
+                values[name] = content.trim();
+              }
+            }
+            if (!matches && elementName && namePattern.test(elementName)) {
+              name = elementName;
+              if (content) {
+                name = name.toLowerCase().replace(/\s/g, "").replace(/\./g, ":");
+                values[name] = content.trim();
+              }
+            }
+          });
+          metadata.title = jsonld.title || values["dc:title"] || values["dcterm:title"] || values["og:title"] || values["weibo:article:title"] || values["weibo:webpage:title"] || values.title || values["twitter:title"] || values["parsely-title"];
+          if (!metadata.title) {
+            metadata.title = this._getArticleTitle();
+          }
+          const articleAuthor = typeof values["article:author"] === "string" && !this._isUrl(values["article:author"]) ? values["article:author"] : void 0;
+          metadata.byline = jsonld.byline || values["dc:creator"] || values["dcterm:creator"] || values.author || values["parsely-author"] || articleAuthor;
+          metadata.excerpt = jsonld.excerpt || values["dc:description"] || values["dcterm:description"] || values["og:description"] || values["weibo:article:description"] || values["weibo:webpage:description"] || values.description || values["twitter:description"];
+          metadata.siteName = jsonld.siteName || values["og:site_name"];
+          metadata.publishedTime = jsonld.datePublished || values["article:published_time"] || values["parsely-pub-date"] || null;
+          metadata.title = this._unescapeHtmlEntities(metadata.title);
+          metadata.byline = this._unescapeHtmlEntities(metadata.byline);
+          metadata.excerpt = this._unescapeHtmlEntities(metadata.excerpt);
+          metadata.siteName = this._unescapeHtmlEntities(metadata.siteName);
+          metadata.publishedTime = this._unescapeHtmlEntities(metadata.publishedTime);
+          return metadata;
+        },
+        /**
+         * Check if node is image, or if node contains exactly only one image
+         * whether as a direct child or as its descendants.
+         *
+         * @param Element
+         **/
+        _isSingleImage(node) {
+          while (node) {
+            if (node.tagName === "IMG") {
+              return true;
+            }
+            if (node.children.length !== 1 || node.textContent.trim() !== "") {
+              return false;
+            }
+            node = node.children[0];
+          }
+          return false;
+        },
+        /**
+         * Find all <noscript> that are located after <img> nodes, and which contain only one
+         * <img> element. Replace the first image with the image from inside the <noscript> tag,
+         * and remove the <noscript> tag. This improves the quality of the images we use on
+         * some sites (e.g. Medium).
+         *
+         * @param Element
+         **/
+        _unwrapNoscriptImages(doc) {
+          var imgs = Array.from(doc.getElementsByTagName("img"));
+          this._forEachNode(imgs, function(img) {
+            for (var i = 0; i < img.attributes.length; i++) {
+              var attr = img.attributes[i];
+              switch (attr.name) {
+                case "src":
+                case "srcset":
+                case "data-src":
+                case "data-srcset":
+                  return;
+              }
+              if (/\.(jpg|jpeg|png|webp)/i.test(attr.value)) {
+                return;
+              }
+            }
+            img.remove();
+          });
+          var noscripts = Array.from(doc.getElementsByTagName("noscript"));
+          this._forEachNode(noscripts, function(noscript) {
+            if (!this._isSingleImage(noscript)) {
+              return;
+            }
+            var tmp = doc.createElement("div");
+            tmp.innerHTML = noscript.innerHTML;
+            var prevElement = noscript.previousElementSibling;
+            if (prevElement && this._isSingleImage(prevElement)) {
+              var prevImg = prevElement;
+              if (prevImg.tagName !== "IMG") {
+                prevImg = prevElement.getElementsByTagName("img")[0];
+              }
+              var newImg = tmp.getElementsByTagName("img")[0];
+              for (var i = 0; i < prevImg.attributes.length; i++) {
+                var attr = prevImg.attributes[i];
+                if (attr.value === "") {
+                  continue;
+                }
+                if (attr.name === "src" || attr.name === "srcset" || /\.(jpg|jpeg|png|webp)/i.test(attr.value)) {
+                  if (newImg.getAttribute(attr.name) === attr.value) {
+                    continue;
+                  }
+                  var attrName = attr.name;
+                  if (newImg.hasAttribute(attrName)) {
+                    attrName = "data-old-" + attrName;
+                  }
+                  newImg.setAttribute(attrName, attr.value);
+                }
+              }
+              noscript.parentNode.replaceChild(tmp.firstElementChild, prevElement);
+            }
+          });
+        },
+        /**
+         * Removes script tags from the document.
+         *
+         * @param Element
+         **/
+        _removeScripts(doc) {
+          this._removeNodes(this._getAllNodesWithTag(doc, ["script", "noscript"]));
+        },
+        /**
+         * Check if this node has only whitespace and a single element with given tag
+         * Returns false if the DIV node contains non-empty text nodes
+         * or if it contains no element with given tag or more than 1 element.
+         *
+         * @param Element
+         * @param string tag of child element
+         **/
+        _hasSingleTagInsideElement(element, tag) {
+          if (element.children.length != 1 || element.children[0].tagName !== tag) {
+            return false;
+          }
+          return !this._someNode(element.childNodes, function(node) {
+            return node.nodeType === this.TEXT_NODE && this.REGEXPS.hasContent.test(node.textContent);
+          });
+        },
+        _isElementWithoutContent(node) {
+          return node.nodeType === this.ELEMENT_NODE && !node.textContent.trim().length && (!node.children.length || node.children.length == node.getElementsByTagName("br").length + node.getElementsByTagName("hr").length);
+        },
+        /**
+         * Determine whether element has any children block level elements.
+         *
+         * @param Element
+         */
+        _hasChildBlockElement(element) {
+          return this._someNode(element.childNodes, function(node) {
+            return this.DIV_TO_P_ELEMS.has(node.tagName) || this._hasChildBlockElement(node);
+          });
+        },
+        /***
+         * Determine if a node qualifies as phrasing content.
+         * https://developer.mozilla.org/en-US/docs/Web/Guide/HTML/Content_categories#Phrasing_content
+         **/
+        _isPhrasingContent(node) {
+          return node.nodeType === this.TEXT_NODE || this.PHRASING_ELEMS.includes(node.tagName) || (node.tagName === "A" || node.tagName === "DEL" || node.tagName === "INS") && this._everyNode(node.childNodes, this._isPhrasingContent);
+        },
+        _isWhitespace(node) {
+          return node.nodeType === this.TEXT_NODE && node.textContent.trim().length === 0 || node.nodeType === this.ELEMENT_NODE && node.tagName === "BR";
+        },
+        /**
+         * Get the inner text of a node - cross browser compatibly.
+         * This also strips out any excess whitespace to be found.
+         *
+         * @param Element
+         * @param Boolean normalizeSpaces (default: true)
+         * @return string
+         **/
+        _getInnerText(e, normalizeSpaces) {
+          normalizeSpaces = typeof normalizeSpaces === "undefined" ? true : normalizeSpaces;
+          var textContent = e.textContent.trim();
+          if (normalizeSpaces) {
+            return textContent.replace(this.REGEXPS.normalize, " ");
+          }
+          return textContent;
+        },
+        /**
+         * Get the number of times a string s appears in the node e.
+         *
+         * @param Element
+         * @param string - what to split on. Default is ","
+         * @return number (integer)
+         **/
+        _getCharCount(e, s) {
+          s = s || ",";
+          return this._getInnerText(e).split(s).length - 1;
+        },
+        /**
+         * Remove the style attribute on every e and under.
+         * TODO: Test if getElementsByTagName(*) is faster.
+         *
+         * @param Element
+         * @return void
+         **/
+        _cleanStyles(e) {
+          if (!e || e.tagName.toLowerCase() === "svg") {
+            return;
+          }
+          for (var i = 0; i < this.PRESENTATIONAL_ATTRIBUTES.length; i++) {
+            e.removeAttribute(this.PRESENTATIONAL_ATTRIBUTES[i]);
+          }
+          if (this.DEPRECATED_SIZE_ATTRIBUTE_ELEMS.includes(e.tagName)) {
+            e.removeAttribute("width");
+            e.removeAttribute("height");
+          }
+          var cur = e.firstElementChild;
+          while (cur !== null) {
+            this._cleanStyles(cur);
+            cur = cur.nextElementSibling;
+          }
+        },
+        /**
+         * Get the density of links as a percentage of the content
+         * This is the amount of text that is inside a link divided by the total text in the node.
+         *
+         * @param Element
+         * @return number (float)
+         **/
+        _getLinkDensity(element) {
+          var textLength = this._getInnerText(element).length;
+          if (textLength === 0) {
+            return 0;
+          }
+          var linkLength = 0;
+          this._forEachNode(element.getElementsByTagName("a"), function(linkNode) {
+            var href = linkNode.getAttribute("href");
+            var coefficient = href && this.REGEXPS.hashUrl.test(href) ? 0.3 : 1;
+            linkLength += this._getInnerText(linkNode).length * coefficient;
+          });
+          return linkLength / textLength;
+        },
+        /**
+         * Get an elements class/id weight. Uses regular expressions to tell if this
+         * element looks good or bad.
+         *
+         * @param Element
+         * @return number (Integer)
+         **/
+        _getClassWeight(e) {
+          if (!this._flagIsActive(this.FLAG_WEIGHT_CLASSES)) {
+            return 0;
+          }
+          var weight = 0;
+          if (typeof e.className === "string" && e.className !== "") {
+            if (this.REGEXPS.negative.test(e.className)) {
+              weight -= 25;
+            }
+            if (this.REGEXPS.positive.test(e.className)) {
+              weight += 25;
+            }
+          }
+          if (typeof e.id === "string" && e.id !== "") {
+            if (this.REGEXPS.negative.test(e.id)) {
+              weight -= 25;
+            }
+            if (this.REGEXPS.positive.test(e.id)) {
+              weight += 25;
+            }
+          }
+          return weight;
+        },
+        /**
+         * Clean a node of all elements of type "tag".
+         * (Unless it's a youtube/vimeo video. People love movies.)
+         *
+         * @param Element
+         * @param string tag to clean
+         * @return void
+         **/
+        _clean(e, tag) {
+          var isEmbed = ["object", "embed", "iframe"].includes(tag);
+          this._removeNodes(this._getAllNodesWithTag(e, [tag]), function(element) {
+            if (isEmbed) {
+              for (var i = 0; i < element.attributes.length; i++) {
+                if (this._allowedVideoRegex.test(element.attributes[i].value)) {
+                  return false;
+                }
+              }
+              if (element.tagName === "object" && this._allowedVideoRegex.test(element.innerHTML)) {
+                return false;
+              }
+            }
+            return true;
+          });
+        },
+        /**
+         * Check if a given node has one of its ancestor tag name matching the
+         * provided one.
+         * @param  HTMLElement node
+         * @param  String      tagName
+         * @param  Number      maxDepth
+         * @param  Function    filterFn a filter to invoke to determine whether this node 'counts'
+         * @return Boolean
+         */
+        _hasAncestorTag(node, tagName, maxDepth, filterFn) {
+          maxDepth = maxDepth || 3;
+          tagName = tagName.toUpperCase();
+          var depth = 0;
+          while (node.parentNode) {
+            if (maxDepth > 0 && depth > maxDepth) {
+              return false;
+            }
+            if (node.parentNode.tagName === tagName && (!filterFn || filterFn(node.parentNode))) {
+              return true;
+            }
+            node = node.parentNode;
+            depth++;
+          }
+          return false;
+        },
+        /**
+         * Return an object indicating how many rows and columns this table has.
+         */
+        _getRowAndColumnCount(table) {
+          var rows = 0;
+          var columns = 0;
+          var trs = table.getElementsByTagName("tr");
+          for (var i = 0; i < trs.length; i++) {
+            var rowspan = trs[i].getAttribute("rowspan") || 0;
+            if (rowspan) {
+              rowspan = parseInt(rowspan, 10);
+            }
+            rows += rowspan || 1;
+            var columnsInThisRow = 0;
+            var cells = trs[i].getElementsByTagName("td");
+            for (var j = 0; j < cells.length; j++) {
+              var colspan = cells[j].getAttribute("colspan") || 0;
+              if (colspan) {
+                colspan = parseInt(colspan, 10);
+              }
+              columnsInThisRow += colspan || 1;
+            }
+            columns = Math.max(columns, columnsInThisRow);
+          }
+          return { rows, columns };
+        },
+        /**
+         * Look for 'data' (as opposed to 'layout') tables, for which we use
+         * similar checks as
+         * https://searchfox.org/mozilla-central/rev/f82d5c549f046cb64ce5602bfd894b7ae807c8f8/accessible/generic/TableAccessible.cpp#19
+         */
+        _markDataTables(root) {
+          var tables = root.getElementsByTagName("table");
+          for (var i = 0; i < tables.length; i++) {
+            var table = tables[i];
+            var role = table.getAttribute("role");
+            if (role == "presentation") {
+              table._readabilityDataTable = false;
+              continue;
+            }
+            var datatable = table.getAttribute("datatable");
+            if (datatable == "0") {
+              table._readabilityDataTable = false;
+              continue;
+            }
+            var summary = table.getAttribute("summary");
+            if (summary) {
+              table._readabilityDataTable = true;
+              continue;
+            }
+            var caption = table.getElementsByTagName("caption")[0];
+            if (caption && caption.childNodes.length) {
+              table._readabilityDataTable = true;
+              continue;
+            }
+            var dataTableDescendants = ["col", "colgroup", "tfoot", "thead", "th"];
+            var descendantExists = function(tag) {
+              return !!table.getElementsByTagName(tag)[0];
+            };
+            if (dataTableDescendants.some(descendantExists)) {
+              this.log("Data table because found data-y descendant");
+              table._readabilityDataTable = true;
+              continue;
+            }
+            if (table.getElementsByTagName("table")[0]) {
+              table._readabilityDataTable = false;
+              continue;
+            }
+            var sizeInfo = this._getRowAndColumnCount(table);
+            if (sizeInfo.columns == 1 || sizeInfo.rows == 1) {
+              table._readabilityDataTable = false;
+              continue;
+            }
+            if (sizeInfo.rows >= 10 || sizeInfo.columns > 4) {
+              table._readabilityDataTable = true;
+              continue;
+            }
+            table._readabilityDataTable = sizeInfo.rows * sizeInfo.columns > 10;
+          }
+        },
+        /* convert images and figures that have properties like data-src into images that can be loaded without JS */
+        _fixLazyImages(root) {
+          this._forEachNode(
+            this._getAllNodesWithTag(root, ["img", "picture", "figure"]),
+            function(elem) {
+              if (elem.src && this.REGEXPS.b64DataUrl.test(elem.src)) {
+                var parts = this.REGEXPS.b64DataUrl.exec(elem.src);
+                if (parts[1] === "image/svg+xml") {
+                  return;
+                }
+                var srcCouldBeRemoved = false;
+                for (var i = 0; i < elem.attributes.length; i++) {
+                  var attr = elem.attributes[i];
+                  if (attr.name === "src") {
+                    continue;
+                  }
+                  if (/\.(jpg|jpeg|png|webp)/i.test(attr.value)) {
+                    srcCouldBeRemoved = true;
+                    break;
+                  }
+                }
+                if (srcCouldBeRemoved) {
+                  var b64starts = parts[0].length;
+                  var b64length = elem.src.length - b64starts;
+                  if (b64length < 133) {
+                    elem.removeAttribute("src");
+                  }
+                }
+              }
+              if ((elem.src || elem.srcset && elem.srcset != "null") && !elem.className.toLowerCase().includes("lazy")) {
+                return;
+              }
+              for (var j = 0; j < elem.attributes.length; j++) {
+                attr = elem.attributes[j];
+                if (attr.name === "src" || attr.name === "srcset" || attr.name === "alt") {
+                  continue;
+                }
+                var copyTo = null;
+                if (/\.(jpg|jpeg|png|webp)\s+\d/.test(attr.value)) {
+                  copyTo = "srcset";
+                } else if (/^\s*\S+\.(jpg|jpeg|png|webp)\S*\s*$/.test(attr.value)) {
+                  copyTo = "src";
+                }
+                if (copyTo) {
+                  if (elem.tagName === "IMG" || elem.tagName === "PICTURE") {
+                    elem.setAttribute(copyTo, attr.value);
+                  } else if (elem.tagName === "FIGURE" && !this._getAllNodesWithTag(elem, ["img", "picture"]).length) {
+                    var img = this._doc.createElement("img");
+                    img.setAttribute(copyTo, attr.value);
+                    elem.appendChild(img);
+                  }
+                }
+              }
+            }
+          );
+        },
+        _getTextDensity(e, tags) {
+          var textLength = this._getInnerText(e, true).length;
+          if (textLength === 0) {
+            return 0;
+          }
+          var childrenLength = 0;
+          var children = this._getAllNodesWithTag(e, tags);
+          this._forEachNode(
+            children,
+            (child) => childrenLength += this._getInnerText(child, true).length
+          );
+          return childrenLength / textLength;
+        },
+        /**
+         * Clean an element of all tags of type "tag" if they look fishy.
+         * "Fishy" is an algorithm based on content length, classnames, link density, number of images & embeds, etc.
+         *
+         * @return void
+         **/
+        _cleanConditionally(e, tag) {
+          if (!this._flagIsActive(this.FLAG_CLEAN_CONDITIONALLY)) {
+            return;
+          }
+          this._removeNodes(this._getAllNodesWithTag(e, [tag]), function(node) {
+            var isDataTable = function(t) {
+              return t._readabilityDataTable;
+            };
+            var isList = tag === "ul" || tag === "ol";
+            if (!isList) {
+              var listLength = 0;
+              var listNodes = this._getAllNodesWithTag(node, ["ul", "ol"]);
+              this._forEachNode(
+                listNodes,
+                (list) => listLength += this._getInnerText(list).length
+              );
+              isList = listLength / this._getInnerText(node).length > 0.9;
+            }
+            if (tag === "table" && isDataTable(node)) {
+              return false;
+            }
+            if (this._hasAncestorTag(node, "table", -1, isDataTable)) {
+              return false;
+            }
+            if (this._hasAncestorTag(node, "code")) {
+              return false;
+            }
+            if ([...node.getElementsByTagName("table")].some(
+              (tbl) => tbl._readabilityDataTable
+            )) {
+              return false;
+            }
+            var weight = this._getClassWeight(node);
+            this.log("Cleaning Conditionally", node);
+            var contentScore = 0;
+            if (weight + contentScore < 0) {
+              return true;
+            }
+            if (this._getCharCount(node, ",") < 10) {
+              var p = node.getElementsByTagName("p").length;
+              var img = node.getElementsByTagName("img").length;
+              var li = node.getElementsByTagName("li").length - 100;
+              var input = node.getElementsByTagName("input").length;
+              var headingDensity = this._getTextDensity(node, [
+                "h1",
+                "h2",
+                "h3",
+                "h4",
+                "h5",
+                "h6"
+              ]);
+              var embedCount = 0;
+              var embeds = this._getAllNodesWithTag(node, [
+                "object",
+                "embed",
+                "iframe"
+              ]);
+              for (var i = 0; i < embeds.length; i++) {
+                for (var j = 0; j < embeds[i].attributes.length; j++) {
+                  if (this._allowedVideoRegex.test(embeds[i].attributes[j].value)) {
+                    return false;
+                  }
+                }
+                if (embeds[i].tagName === "object" && this._allowedVideoRegex.test(embeds[i].innerHTML)) {
+                  return false;
+                }
+                embedCount++;
+              }
+              var innerText = this._getInnerText(node);
+              if (this.REGEXPS.adWords.test(innerText) || this.REGEXPS.loadingWords.test(innerText)) {
+                return true;
+              }
+              var contentLength = innerText.length;
+              var linkDensity = this._getLinkDensity(node);
+              var textishTags = ["SPAN", "LI", "TD"].concat(
+                Array.from(this.DIV_TO_P_ELEMS)
+              );
+              var textDensity = this._getTextDensity(node, textishTags);
+              var isFigureChild = this._hasAncestorTag(node, "figure");
+              const shouldRemoveNode = () => {
+                const errs = [];
+                if (!isFigureChild && img > 1 && p / img < 0.5) {
+                  errs.push(`Bad p to img ratio (img=${img}, p=${p})`);
+                }
+                if (!isList && li > p) {
+                  errs.push(`Too many li's outside of a list. (li=${li} > p=${p})`);
+                }
+                if (input > Math.floor(p / 3)) {
+                  errs.push(`Too many inputs per p. (input=${input}, p=${p})`);
+                }
+                if (!isList && !isFigureChild && headingDensity < 0.9 && contentLength < 25 && (img === 0 || img > 2) && linkDensity > 0) {
+                  errs.push(
+                    `Suspiciously short. (headingDensity=${headingDensity}, img=${img}, linkDensity=${linkDensity})`
+                  );
+                }
+                if (!isList && weight < 25 && linkDensity > 0.2 + this._linkDensityModifier) {
+                  errs.push(
+                    `Low weight and a little linky. (linkDensity=${linkDensity})`
+                  );
+                }
+                if (weight >= 25 && linkDensity > 0.5 + this._linkDensityModifier) {
+                  errs.push(
+                    `High weight and mostly links. (linkDensity=${linkDensity})`
+                  );
+                }
+                if (embedCount === 1 && contentLength < 75 || embedCount > 1) {
+                  errs.push(
+                    `Suspicious embed. (embedCount=${embedCount}, contentLength=${contentLength})`
+                  );
+                }
+                if (img === 0 && textDensity === 0) {
+                  errs.push(
+                    `No useful content. (img=${img}, textDensity=${textDensity})`
+                  );
+                }
+                if (errs.length) {
+                  this.log("Checks failed", errs);
+                  return true;
+                }
+                return false;
+              };
+              var haveToRemove = shouldRemoveNode();
+              if (isList && haveToRemove) {
+                for (var x = 0; x < node.children.length; x++) {
+                  let child = node.children[x];
+                  if (child.children.length > 1) {
+                    return haveToRemove;
+                  }
+                }
+                let li_count = node.getElementsByTagName("li").length;
+                if (img == li_count) {
+                  return false;
+                }
+              }
+              return haveToRemove;
+            }
+            return false;
+          });
+        },
+        /**
+         * Clean out elements that match the specified conditions
+         *
+         * @param Element
+         * @param Function determines whether a node should be removed
+         * @return void
+         **/
+        _cleanMatchedNodes(e, filter) {
+          var endOfSearchMarkerNode = this._getNextNode(e, true);
+          var next = this._getNextNode(e);
+          while (next && next != endOfSearchMarkerNode) {
+            if (filter.call(this, next, next.className + " " + next.id)) {
+              next = this._removeAndGetNext(next);
+            } else {
+              next = this._getNextNode(next);
+            }
+          }
+        },
+        /**
+         * Clean out spurious headers from an Element.
+         *
+         * @param Element
+         * @return void
+         **/
+        _cleanHeaders(e) {
+          let headingNodes = this._getAllNodesWithTag(e, ["h1", "h2"]);
+          this._removeNodes(headingNodes, function(node) {
+            let shouldRemove = this._getClassWeight(node) < 0;
+            if (shouldRemove) {
+              this.log("Removing header with low class weight:", node);
+            }
+            return shouldRemove;
+          });
+        },
+        /**
+         * Check if this node is an H1 or H2 element whose content is mostly
+         * the same as the article title.
+         *
+         * @param Element  the node to check.
+         * @return boolean indicating whether this is a title-like header.
+         */
+        _headerDuplicatesTitle(node) {
+          if (node.tagName != "H1" && node.tagName != "H2") {
+            return false;
+          }
+          var heading = this._getInnerText(node, false);
+          this.log("Evaluating similarity of header:", heading, this._articleTitle);
+          return this._textSimilarity(this._articleTitle, heading) > 0.75;
+        },
+        _flagIsActive(flag) {
+          return (this._flags & flag) > 0;
+        },
+        _removeFlag(flag) {
+          this._flags = this._flags & ~flag;
+        },
+        _isProbablyVisible(node) {
+          return (!node.style || node.style.display != "none") && (!node.style || node.style.visibility != "hidden") && !node.hasAttribute("hidden") && //check for "fallback-image" so that wikimedia math images are displayed
+          (!node.hasAttribute("aria-hidden") || node.getAttribute("aria-hidden") != "true" || node.className && node.className.includes && node.className.includes("fallback-image"));
+        },
+        /**
+         * Runs readability.
+         *
+         * Workflow:
+         *  1. Prep the document by removing script tags, css, etc.
+         *  2. Build readability's DOM tree.
+         *  3. Grab the article content from the current dom tree.
+         *  4. Replace the current DOM tree with the new one.
+         *  5. Read peacefully.
+         *
+         * @return void
+         **/
+        parse() {
+          if (this._maxElemsToParse > 0) {
+            var numTags = this._doc.getElementsByTagName("*").length;
+            if (numTags > this._maxElemsToParse) {
+              throw new Error(
+                "Aborting parsing document; " + numTags + " elements found"
+              );
+            }
+          }
+          this._unwrapNoscriptImages(this._doc);
+          var jsonLd = this._disableJSONLD ? {} : this._getJSONLD(this._doc);
+          this._removeScripts(this._doc);
+          this._prepDocument();
+          var metadata = this._getArticleMetadata(jsonLd);
+          this._metadata = metadata;
+          this._articleTitle = metadata.title;
+          var articleContent = this._grabArticle();
+          if (!articleContent) {
+            return null;
+          }
+          this.log("Grabbed: " + articleContent.innerHTML);
+          this._postProcessContent(articleContent);
+          if (!metadata.excerpt) {
+            var paragraphs = articleContent.getElementsByTagName("p");
+            if (paragraphs.length) {
+              metadata.excerpt = paragraphs[0].textContent.trim();
+            }
+          }
+          var textContent = articleContent.textContent;
+          return {
+            title: this._articleTitle,
+            byline: metadata.byline || this._articleByline,
+            dir: this._articleDir,
+            lang: this._articleLang,
+            content: this._serializer(articleContent),
+            textContent,
+            length: textContent.length,
+            excerpt: metadata.excerpt,
+            siteName: metadata.siteName || this._articleSiteName,
+            publishedTime: metadata.publishedTime
+          };
+        }
+      };
+      if (typeof module === "object") {
+        module.exports = Readability;
+      }
+    }
+  });
+
+  // ../node_modules/.pnpm/@mozilla+readability@0.6.0/node_modules/@mozilla/readability/Readability-readerable.js
+  var require_Readability_readerable = __commonJS({
+    "../node_modules/.pnpm/@mozilla+readability@0.6.0/node_modules/@mozilla/readability/Readability-readerable.js"(exports, module) {
+      var REGEXPS = {
+        // NOTE: These two regular expressions are duplicated in
+        // Readability.js. Please keep both copies in sync.
+        unlikelyCandidates: /-ad-|ai2html|banner|breadcrumbs|combx|comment|community|cover-wrap|disqus|extra|footer|gdpr|header|legends|menu|related|remark|replies|rss|shoutbox|sidebar|skyscraper|social|sponsor|supplemental|ad-break|agegate|pagination|pager|popup|yom-remote/i,
+        okMaybeItsACandidate: /and|article|body|column|content|main|shadow/i
+      };
+      function isNodeVisible(node) {
+        return (!node.style || node.style.display != "none") && !node.hasAttribute("hidden") && //check for "fallback-image" so that wikimedia math images are displayed
+        (!node.hasAttribute("aria-hidden") || node.getAttribute("aria-hidden") != "true" || node.className && node.className.includes && node.className.includes("fallback-image"));
+      }
+      function isProbablyReaderable(doc, options = {}) {
+        if (typeof options == "function") {
+          options = { visibilityChecker: options };
+        }
+        var defaultOptions = {
+          minScore: 20,
+          minContentLength: 140,
+          visibilityChecker: isNodeVisible
+        };
+        options = Object.assign(defaultOptions, options);
+        var nodes = doc.querySelectorAll("p, pre, article");
+        var brNodes = doc.querySelectorAll("div > br");
+        if (brNodes.length) {
+          var set = new Set(nodes);
+          [].forEach.call(brNodes, function(node) {
+            set.add(node.parentNode);
+          });
+          nodes = Array.from(set);
+        }
+        var score = 0;
+        return [].some.call(nodes, function(node) {
+          if (!options.visibilityChecker(node)) {
+            return false;
+          }
+          var matchString = node.className + " " + node.id;
+          if (REGEXPS.unlikelyCandidates.test(matchString) && !REGEXPS.okMaybeItsACandidate.test(matchString)) {
+            return false;
+          }
+          if (node.matches("li p")) {
+            return false;
+          }
+          var textContentLength = node.textContent.trim().length;
+          if (textContentLength < options.minContentLength) {
+            return false;
+          }
+          score += Math.sqrt(textContentLength - options.minContentLength);
+          if (score > options.minScore) {
+            return true;
+          }
+          return false;
+        });
+      }
+      if (typeof module === "object") {
+        module.exports = isProbablyReaderable;
+      }
+    }
+  });
+
+  // src/vendor/_entry.cjs
+  var require_entry = __commonJS({
+    "src/vendor/_entry.cjs"() {
+      var { Readability } = require_Readability();
+      var { isProbablyReaderable } = require_Readability_readerable();
+      globalThis.__readability = { Readability, isProbablyReaderable };
+    }
+  });
+  require_entry();
+})();
diff --git a/mcp/test/exec-engine-plugins.test.js b/mcp/test/exec-engine-plugins.test.js
index c5b3adf..05e3907 100644
--- a/mcp/test/exec-engine-plugins.test.js
+++ b/mcp/test/exec-engine-plugins.test.js
@@ -1,6 +1,6 @@
 import { test } from 'node:test';
 import assert from 'node:assert/strict';
-import { buildExecContext, runCode } from '../src/exec-engine.js';
+import { buildExecContext, runCode, formatResult } from '../src/exec-engine.js';
 
 const mockPage = { isClosed: () => false, url: () => 'about:blank', title: async () => 'Test' };
 const mockCtx = { pages: () => [mockPage] };
@@ -37,3 +37,30 @@ test('plugin helper receives null page gracefully when no page open', async () =
   const result = await runCode('return await safeHelper()', ctx, 5000);
   assert.equal(result, 'no-page');
 });
+
+test('buildExecContext exposes screenshot and content helpers in execute scope', () => {
+  const ctx = buildExecContext(mockPage, mockCtx, {}, {}, {});
+  assert.equal(typeof ctx.screenshotWithAccessibilityLabels, 'function');
+  assert.equal(typeof ctx.cleanHTML, 'function');
+  assert.equal(typeof ctx.pageMarkdown, 'function');
+});
+
+test('formatResult returns multi-content for labeled screenshot sentinel', () => {
+  const fakeBuffer = Buffer.from('fake-jpeg-data');
+  const formatted = formatResult({
+    _bf_type: 'labeled_screenshot',
+    screenshot: fakeBuffer,
+    snapshot: '- button "Submit" [ref=e1]',
+    labelCount: 1,
+  });
+
+  assert.ok(Array.isArray(formatted));
+  assert.equal(formatted.length, 2);
+  assert.deepEqual(formatted[0], {
+    type: 'image',
+    data: fakeBuffer.toString('base64'),
+    mimeType: 'image/jpeg',
+  });
+  assert.equal(formatted[1].type, 'text');
+  assert.ok(formatted[1].text.includes('Labels: 1 interactive elements'));
+});
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index c25037c..4bb507d 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -56,7 +56,7 @@ describe('Tool Definitions', () => {
     assert.equal(result, '');
   });
 
-  it('registers exactly 3 tools: execute, reset, screenshot_with_labels', () => {
+  it('registers exactly 2 tools: execute, reset', () => {
     const source = readFileSync(
       join(import.meta.url.replace('file://', ''), '../../src/index.js'),
       'utf8'
@@ -69,8 +69,8 @@ describe('Tool Definitions', () => {
       toolNames.push(match[1]);
     }
 
-    assert.equal(toolNames.length, 3, `Should have exactly 3 tools, found ${toolNames.length}: ${toolNames.join(', ')}`);
-    assert.deepEqual(toolNames.sort(), ['execute', 'reset', 'screenshot_with_labels']);
+    assert.equal(toolNames.length, 2, `Should have exactly 2 tools, found ${toolNames.length}: ${toolNames.join(', ')}`);
+    assert.deepEqual(toolNames.sort(), ['execute', 'reset']);
   });
 
   it('tools have non-empty descriptions', () => {
@@ -105,6 +105,9 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('state.page'), 'should mention state.page for page management');
     assert.ok(promptBlock.includes('snapshot'), 'should mention snapshot-first approach');
     assert.ok(promptBlock.includes('waitForPageLoad'), 'should mention waitForPageLoad');
+    assert.ok(promptBlock.includes('screenshotWithAccessibilityLabels'), 'should mention screenshotWithAccessibilityLabels helper');
+    assert.ok(promptBlock.includes('cleanHTML'), 'should mention cleanHTML helper');
+    assert.ok(promptBlock.includes('pageMarkdown'), 'should mention pageMarkdown helper');
     assert.ok(promptBlock.includes('newPage'), 'should mention creating new tabs');
     // Anti-patterns section
     assert.ok(promptBlock.includes('ANTI-PATTERN') || promptBlock.includes('Don\'t') || promptBlock.includes('✗'), 'should include anti-patterns');
@@ -138,31 +141,14 @@ describe('Tool Definitions', () => {
     assert.ok(paramsMatch, 'reset should have empty params {}');
   });
 
-  it('screenshot_with_labels tool has optional selector and interactiveOnly params', () => {
+  it('does not register screenshot_with_labels tool', () => {
     const source = readFileSync(
       join(import.meta.url.replace('file://', ''), '../../src/index.js'),
       'utf8'
     );
 
-    const toolBlock = source.split("'screenshot_with_labels'")[1]?.split('server.tool(')[0] || '';
-    assert.ok(toolBlock.includes('z.string().optional()'), 'should have optional string param (selector)');
-    assert.ok(toolBlock.includes('z.boolean().optional()'), 'should have optional boolean param (interactiveOnly)');
-    assert.ok(toolBlock.includes('selector:'), 'should have selector param');
-    assert.ok(toolBlock.includes('interactiveOnly:'), 'should have interactiveOnly param');
-  });
-
-  it('screenshot_with_labels tool has descriptive prompt', () => {
-    const source = readFileSync(
-      join(import.meta.url.replace('file://', ''), '../../src/index.js'),
-      'utf8'
-    );
-
-    assert.ok(source.includes('SCREENSHOT_LABELS_PROMPT'), 'should reference SCREENSHOT_LABELS_PROMPT');
-    assert.ok(source.includes('const SCREENSHOT_LABELS_PROMPT'), 'SCREENSHOT_LABELS_PROMPT should be defined');
-    const promptIdx = source.indexOf('const SCREENSHOT_LABELS_PROMPT');
-    const promptBlock = source.slice(promptIdx, source.indexOf("server.tool(\n  'screenshot_with_labels'"));
-    assert.ok(promptBlock.includes('color-coded'), 'prompt should mention color coding');
-    assert.ok(promptBlock.includes('snapshot'), 'prompt should mention snapshot');
+    assert.ok(!source.includes("'screenshot_with_labels'"), 'screenshot_with_labels tool should be removed');
+    assert.ok(!source.includes('SCREENSHOT_LABELS_PROMPT'), 'dedicated screenshot prompt should be removed');
   });
 });
 
@@ -201,7 +187,7 @@ describe('MCP Response Format', () => {
     assert.equal(parsed[1].url, 'https://github.com');
   });
 
-  it('screenshot_with_labels multi-content format is valid', () => {
+  it('labeled screenshot multi-content format is valid', () => {
     const fakeBase64 = Buffer.from('fake-jpeg-data').toString('base64');
     const response = {
       content: [

From aa4e39127aa4a8e600977ff475bd1595940ecd83 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 16:44:09 +0530
Subject: [PATCH 019/192] docs(readme): consolidate MCP client setup and add
 antigravity

---
 README.md | 67 +++++++++++++++++++++++++++++++++++++++++++++++--------
 1 file changed, 58 insertions(+), 9 deletions(-)

diff --git a/README.md b/README.md
index 74a8a59..9c9e1c6 100644
--- a/README.md
+++ b/README.md
@@ -107,9 +107,11 @@ browserforce serve
 If your agent browses to the page and responds with the title, you're all set.
 
 <details>
-<summary><b>Alternative: MCP server</b> (advanced)</summary>
+<summary><b>MCP setup for OpenClaw, Claude, Codex, Cursor, and Antigravity</b></summary>
 
-If you prefer MCP over the skill, add to `~/.openclaw/openclaw.json`:
+#### OpenClaw (MCP adapter)
+
+Add to `~/.openclaw/openclaw.json`:
 
 ```json
 {
@@ -123,7 +125,7 @@ If you prefer MCP over the skill, add to `~/.openclaw/openclaw.json`:
               "name": "browserforce",
               "transport": "stdio",
               "command": "npx",
-              "args": ["-y", "browserforce", "mcp"]
+              "args": ["-y", "browserforce@latest", "mcp"]
             }
           ]
         }
@@ -133,9 +135,7 @@ If you prefer MCP over the skill, add to `~/.openclaw/openclaw.json`:
 }
 ```
 
-</details>
-
-### Claude Desktop
+#### Claude Desktop
 
 Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
 
@@ -144,13 +144,13 @@ Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
   "mcpServers": {
     "browserforce": {
       "command": "npx",
-      "args": ["-y", "browserforce", "mcp"]
+      "args": ["-y", "browserforce@latest", "mcp"]
     }
   }
 }
 ```
 
-### Claude Code
+#### Claude Code
 
 Add to `~/.claude/mcp.json`:
 
@@ -159,12 +159,61 @@ Add to `~/.claude/mcp.json`:
   "mcpServers": {
     "browserforce": {
       "command": "npx",
-      "args": ["-y", "browserforce", "mcp"]
+      "args": ["-y", "browserforce@latest", "mcp"]
     }
   }
 }
 ```
 
+#### Codex
+
+Add to `~/.codex/config.toml`:
+
+```toml
+[mcp_servers.browserforce]
+command = "npx"
+args = ["-y", "browserforce@latest", "mcp"]
+```
+
+#### Cursor
+
+Add to `~/.cursor/mcp.json`:
+
+```json
+{
+  "mcpServers": {
+    "browserforce": {
+      "command": "npx",
+      "args": ["-y", "browserforce@latest", "mcp"]
+    }
+  }
+}
+```
+
+#### Antigravity
+
+In Antigravity: Agent panel -> `...` -> `Manage MCP Servers` -> `View raw config`.
+Add the same `mcpServers` entry:
+
+```json
+{
+  "mcpServers": {
+    "browserforce": {
+      "command": "npx",
+      "args": ["-y", "browserforce@latest", "mcp"]
+    }
+  }
+}
+```
+
+If MCP startup fails with `connection closed: initialize response`:
+
+1. Ensure args include `"mcp"` (without it, BrowserForce prints help and exits).
+2. If running from a local clone, install deps first: `pnpm install`.
+3. Validate the launch command manually: `npx -y browserforce@latest mcp`
+
+</details>
+
 ### CLI
 
 ```bash

From 73e0769c6a9c4d4ff3563cf3f205fc64b56611f5 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 16:44:19 +0530
Subject: [PATCH 020/192] docs(guide): add codex and cursor MCP config +
 handshake troubleshooting

---
 GUIDE.md | 31 +++++++++++++++++++++++++++++++
 1 file changed, 31 insertions(+)

diff --git a/GUIDE.md b/GUIDE.md
index 966a2d2..f798650 100644
--- a/GUIDE.md
+++ b/GUIDE.md
@@ -166,6 +166,37 @@ Add to your Claude config:
 }
 ```
 
+**Option B.1: Codex (via MCP)**
+
+Add to `~/.codex/config.toml`:
+
+```toml
+[mcp_servers.browserforce]
+command = "npx"
+args = ["-y", "browserforce@latest", "mcp"]
+```
+
+**Option B.2: Cursor (via MCP)**
+
+Add to `~/.cursor/mcp.json`:
+
+```json
+{
+  "mcpServers": {
+    "browserforce": {
+      "command": "npx",
+      "args": ["-y", "browserforce@latest", "mcp"]
+    }
+  }
+}
+```
+
+If startup fails with `connection closed: initialize response`:
+
+1. Ensure args include `"mcp"` (without it, BrowserForce exits after printing help).
+2. If launching from a local clone, run `pnpm install` first.
+3. Verify manually: `npx -y browserforce@latest mcp`
+
 Then just talk to Claude: *"Open twitter.com and take a screenshot"*
 
 **Option C: Custom Playwright script**

From 55e3c000e831923c1c9886a012b8980b17cb75a0 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 16:46:34 +0530
Subject: [PATCH 021/192] chore: update .gitignore and README for installation
 instructions

- Added '.superset' to .gitignore to exclude it from version control.
- Updated README to reflect the addition of new tools in the installation instructions and clarified the steps for loading the unpacked extension.
---
 .gitignore | 3 ++-
 README.md  | 8 +++++---
 2 files changed, 7 insertions(+), 4 deletions(-)

diff --git a/.gitignore b/.gitignore
index a485ef2..74c7dfb 100644
--- a/.gitignore
+++ b/.gitignore
@@ -7,4 +7,5 @@ node_modules/
 .npm
 pnpm-debug.log*
 .worktrees/
-docs/plans/*
\ No newline at end of file
+docs/plans/*
+.superset
\ No newline at end of file
diff --git a/README.md b/README.md
index 9c9e1c6..7750656 100644
--- a/README.md
+++ b/README.md
@@ -19,7 +19,7 @@ Works with [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-
 | Tab access | N/A (new browser) | Managed by agent | Click each tab | Click each tab | **All tabs, automatic** |
 | Autonomous | Yes | Yes | No (manual click) | No (manual click) | **Yes (fully autonomous)** |
 | Context method | Screenshots (100KB+) | Screenshots + snapshots | A11y snapshots (5-20KB) | Screenshots (100KB+) | **A11y snapshots (5-20KB)** |
-| Tools | Many dedicated | 1 `browser` tool | 1 `execute` tool | Built-in | **1 `execute` tool** |
+| Tools | Many dedicated | 1 `browser` tool | 1 `execute` tool | Built-in | **3 tools: `execute`, `screenshot_with_labels`, `reset`** |
 | Agent support | Any MCP client | OpenClaw only | Any MCP client | Claude only | **Any MCP client** |
 | Playwright API | Partial | No | Full | No | **Full** |
 
@@ -57,10 +57,12 @@ pnpm install
 
 **If you installed via npm:**
 
-1. Run: `browserforce install-extension`
+1. Run: `browserforce install-extension` — note the path it prints (e.g. `/Users/you/.browserforce/extension`)
 2. Open `chrome://extensions/` in Chrome
 3. Enable **Developer mode** (top-right toggle)
-4. Click **Load unpacked** → select the path printed in step 1
+4. Click **Load unpacked** → a file picker opens
+   - **macOS**: press `Cmd+Shift+G`, paste the path from step 1, press Enter
+   - **Windows/Linux**: paste the path directly into the address bar of the dialog
 
 ❗ After every BrowserForce update, re-run `browserforce install-extension`, then reload the extension in `chrome://extensions/` (click the ↺ icon next to BrowserForce).
 

From f6b5bad2068f026d36636942f18f2aafb26aa9fd Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:13:31 +0530
Subject: [PATCH 022/192] docs: add design for diffing parity and cdp logging

---
 ...aywriter-parity-diff-cdp-logging-design.md | 109 ++++++++++++++++++
 1 file changed, 109 insertions(+)
 create mode 100644 docs/plans/2026-02-24-playwriter-parity-diff-cdp-logging-design.md

diff --git a/docs/plans/2026-02-24-playwriter-parity-diff-cdp-logging-design.md b/docs/plans/2026-02-24-playwriter-parity-diff-cdp-logging-design.md
new file mode 100644
index 0000000..06d05fc
--- /dev/null
+++ b/docs/plans/2026-02-24-playwriter-parity-diff-cdp-logging-design.md
@@ -0,0 +1,109 @@
+# Playwriter Parity Diffing + CDP Logging Design
+
+## Goal
+Implement two P0 features in BrowserForce with playwriter behavior parity:
+- Diff-aware extraction helpers (`snapshot`, `cleanHTML`, `pageMarkdown`) with `showDiffSinceLastCall`
+- Relay-side JSONL CDP traffic logging queryable with `jq`
+
+## Scope Decisions (Approved)
+- `showDiffSinceLastCall` default: `true` (playwriter parity)
+- Relay CDP log lifecycle: recreate/truncate on each relay start
+- Execution model: subagent-driven implementation after plan creation
+
+## Non-Goals
+- Reworking extension protocol
+- Changing CDP routing semantics
+- Adding non-essential relay dependencies
+
+## Current State
+- `mcp/src/snapshot.js` already has `createSmartDiff(oldText, newText)` but helper wiring is missing.
+- `mcp/src/exec-engine.js` exposes `snapshot({ selector, search })`, `cleanHTML(selector, opts)`, and `pageMarkdown()` without diff mode state.
+- `relay/src/index.js` has operational console logging but no structured CDP JSONL log.
+
+## Proposed Design
+
+### 1) MCP Diffing Parity
+
+#### `snapshot`
+- Extend helper signature to `snapshot({ selector?, search?, showDiffSinceLastCall? } = {})`.
+- Keep existing snapshot build pipeline and ref table unchanged.
+- Cache last snapshot text per page (only for full-page snapshot, same practical behavior as playwriter page-scoped caching).
+- If `showDiffSinceLastCall` is `true` and a previous snapshot exists:
+  - `createSmartDiff` result `no-change` => return a clear no-change message with guidance to set `false` for full output.
+  - `diff` => return diff text.
+  - `full` => return full snapshot text.
+- If no previous snapshot or `showDiffSinceLastCall: false`, return full snapshot text.
+
+#### `cleanHTML`
+- Add option `showDiffSinceLastCall` to `getCleanHTML(page, selector, opts)`.
+- Maintain per-page/per-selector snapshot cache via `WeakMap<Page, Map<string, string>>`.
+- Preserve existing HTML cleaning output and current options (`maxAttrLen`, `maxContentLen`).
+- Diff behavior mirrors `snapshot` no-change/full/diff handling.
+
+#### `pageMarkdown`
+- Update to `getPageMarkdown(page, opts = {})` with `showDiffSinceLastCall` and optional `search`.
+- Maintain per-page snapshot cache via `WeakMap<Page, string>`.
+- Preserve current readability extraction and markdown structure.
+- Diff behavior mirrors `cleanHTML`.
+
+### 2) Relay JSONL CDP Logging
+
+#### Logging module
+- Add `relay/src/cdp-log.js` to encapsulate JSONL writing:
+  - file path default: `~/.browserforce/cdp.jsonl`
+  - env overrides:
+    - `BROWSERFORCE_CDP_LOG_FILE_PATH`
+    - `BROWSERFORCE_CDP_LOG_MAX_STRING_LENGTH`
+  - truncating replacer for large strings + circular safety
+  - async append queue to preserve ordering
+  - truncate file on relay startup (approved behavior)
+
+#### Relay integration points (`relay/src/index.js`)
+- Instantiate logger once in `RelayServer` lifecycle.
+- Log entries with shape `{ timestamp, direction, message, clientId?, source? }`.
+- Directions:
+  - `from-playwright`: inbound CDP client commands
+  - `to-extension`: forwarded `cdpCommand` payloads
+  - `from-extension`: inbound extension `cdpEvent`
+  - `to-playwright`: outbound events/responses sent to CDP clients
+- Hook points:
+  - `_handleCdpClientMessage`
+  - `_forwardToTab` / `_sendToExt` path for `cdpCommand`
+  - `_handleCdpEventFromExt`
+  - `_broadcastCdp` and direct response send paths
+
+### 3) Test Strategy
+
+#### MCP tests
+- Extend `mcp/test/exec-engine-plugins.test.js` (integration surface for `buildExecContext` helpers):
+  - snapshot returns no-change message on repeated identical calls
+  - snapshot returns diff on small change
+  - `cleanHTML`/`pageMarkdown` support `showDiffSinceLastCall: false` full output fallback
+- Keep existing pure diff unit tests in `mcp/test/mcp-tools.test.js` intact.
+
+#### Relay tests
+- Extend `relay/test/relay-server.test.js` with `CDP Logging` suite:
+  - log file created/truncated on startup
+  - command forward and event forward paths produce JSONL entries with expected directions/methods
+  - entries are valid JSON per line and queryable with `jq`-style field access
+
+### 4) Documentation
+- Update user-facing docs (likely `README.md`/`GUIDE.md`) to include:
+  - new helper parameters and defaults
+  - no-change messaging semantics
+  - CDP JSONL path and example `jq` command
+
+## Risks and Mitigations
+- Behavior shift from full outputs to diff-by-default may surprise existing flows.
+  - Mitigation: explicit docs + clear no-change/full fallback message.
+- High-volume CDP logs can grow quickly.
+  - Mitigation: per-start truncation plus string length truncation controls.
+- Logging must not affect CDP routing correctness.
+  - Mitigation: append queue is fire-and-forget and never blocks forwarding decisions.
+
+## Acceptance Criteria
+- Repeated helper calls default to diff behavior with playwriter-like semantics.
+- `showDiffSinceLastCall: false` reliably returns full output.
+- Relay writes `~/.browserforce/cdp.jsonl` with structured entries for command/event/response traffic.
+- New/updated tests pass in `mcp` and `relay` packages.
+- Docs explain feature usage and debugging workflow.

From 090dd93613ff25aa55736af303167f221a871a13 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:17:44 +0530
Subject: [PATCH 023/192] test(mcp): add failing tests for diff-aware helper
 wiring

---
 mcp/test/exec-engine-plugins.test.js | 114 +++++++++++++++++++++++++++
 1 file changed, 114 insertions(+)

diff --git a/mcp/test/exec-engine-plugins.test.js b/mcp/test/exec-engine-plugins.test.js
index 05e3907..9df25bc 100644
--- a/mcp/test/exec-engine-plugins.test.js
+++ b/mcp/test/exec-engine-plugins.test.js
@@ -5,6 +5,73 @@ import { buildExecContext, runCode, formatResult } from '../src/exec-engine.js';
 const mockPage = { isClosed: () => false, url: () => 'about:blank', title: async () => 'Test' };
 const mockCtx = { pages: () => [mockPage] };
 
+function createSnapshotPage() {
+  return {
+    isClosed: () => false,
+    url: () => 'https://example.test',
+    title: async () => 'Snapshot Test',
+    evaluate: async (_fn, arg) => {
+      if (arg && typeof arg === 'object' && Array.isArray(arg.testIdAttrs)) {
+        return {};
+      }
+      return {
+        role: 'WebArea',
+        name: '',
+        children: [
+          {
+            role: 'main',
+            name: '',
+            children: [{ role: 'button', name: 'Submit', children: [] }],
+          },
+        ],
+      };
+    },
+  };
+}
+
+function createCleanHtmlPage() {
+  return {
+    isClosed: () => false,
+    evaluate: async (_fn, arg) => {
+      if (arg && typeof arg === 'object' && Object.hasOwn(arg, 'maxAttrLen')) {
+        return '<html><body><main>clean body</main></body></html>';
+      }
+      throw new Error('Unexpected evaluate call in cleanHTML test');
+    },
+  };
+}
+
+function createPageMarkdownPage() {
+  return {
+    isClosed: () => false,
+    evaluate: async (arg) => {
+      if (typeof arg === 'function') {
+        const fnSource = arg.toString();
+        if (fnSource.includes('!!globalThis.__readability')) {
+          return true;
+        }
+        if (fnSource.includes('isProbablyReaderable')) {
+          return {
+            content: 'Markdown content line',
+            title: 'Markdown Title',
+            author: null,
+            excerpt: null,
+            siteName: null,
+            lang: 'en',
+            publishedTime: null,
+            wordCount: 3,
+            readable: true,
+          };
+        }
+      }
+      if (typeof arg === 'string') {
+        return undefined;
+      }
+      throw new Error('Unexpected evaluate call in pageMarkdown test');
+    },
+  };
+}
+
 test('plugin helpers are available in execute scope', async () => {
   const pluginHelpers = {
     myHelper: async (page, ctx, state, arg) => `result:${arg}`,
@@ -64,3 +131,50 @@ test('formatResult returns multi-content for labeled screenshot sentinel', () =>
   assert.equal(formatted[1].type, 'text');
   assert.ok(formatted[1].text.includes('Labels: 1 interactive elements'));
 });
+
+test('snapshot diff wiring returns full, then no-change guidance, then full when disabled', async () => {
+  const page = createSnapshotPage();
+  const ctx = buildExecContext(page, { pages: () => [page] }, {}, {}, {});
+
+  const first = await ctx.snapshot({ showDiffSinceLastCall: true });
+  assert.ok(first.includes('Page: Snapshot Test (https://example.test)'));
+  assert.ok(first.includes('- button "Submit" [ref=e1]'));
+
+  const second = await ctx.snapshot({ showDiffSinceLastCall: true });
+  assert.ok(second.includes('No changes since last snapshot'));
+  assert.ok(second.includes('showDiffSinceLastCall: false'));
+
+  const full = await ctx.snapshot({ showDiffSinceLastCall: false });
+  assert.ok(full.includes('Page: Snapshot Test (https://example.test)'));
+});
+
+test('cleanHTML diff wiring returns no-change guidance on identical repeated calls', async () => {
+  const page = createCleanHtmlPage();
+  const ctx = buildExecContext(page, { pages: () => [page] }, {}, {}, {});
+
+  const first = await ctx.cleanHTML('body', { showDiffSinceLastCall: true });
+  assert.ok(first.includes('<main>clean body</main>'));
+
+  const second = await ctx.cleanHTML('body', { showDiffSinceLastCall: true });
+  assert.ok(second.includes('No changes since last call'));
+  assert.ok(second.includes('showDiffSinceLastCall: false'));
+
+  const full = await ctx.cleanHTML('body', { showDiffSinceLastCall: false });
+  assert.ok(full.includes('<main>clean body</main>'));
+});
+
+test('pageMarkdown option forwarding and diff wiring returns no-change guidance on repeated calls', async () => {
+  const page = createPageMarkdownPage();
+  const ctx = buildExecContext(page, { pages: () => [page] }, {}, {}, {});
+
+  const first = await ctx.pageMarkdown({ showDiffSinceLastCall: true });
+  assert.ok(first.includes('# Markdown Title'));
+  assert.ok(first.includes('Markdown content line'));
+
+  const second = await ctx.pageMarkdown({ showDiffSinceLastCall: true });
+  assert.ok(second.includes('No changes since last call'));
+  assert.ok(second.includes('showDiffSinceLastCall: false'));
+
+  const full = await ctx.pageMarkdown({ showDiffSinceLastCall: false });
+  assert.ok(full.includes('# Markdown Title'));
+});

From 16585e9dd4437683d7b77c3f5d9c963fe60a332b Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:24:50 +0530
Subject: [PATCH 024/192] feat(mcp): add playwriter-style diff mode to snapshot
 and content helpers

---
 mcp/src/clean-html.js    | 25 +++++++++++++-
 mcp/src/exec-engine.js   | 25 +++++++++++---
 mcp/src/page-markdown.js | 70 +++++++++++++++++++++++++++++++++++++++-
 3 files changed, 114 insertions(+), 6 deletions(-)

diff --git a/mcp/src/clean-html.js b/mcp/src/clean-html.js
index c47261b..e34480d 100644
--- a/mcp/src/clean-html.js
+++ b/mcp/src/clean-html.js
@@ -1,18 +1,23 @@
 // Clean HTML extraction — runs entirely in the browser via page.evaluate().
 // Strips scripts, styles, decorative elements; keeps semantic attributes.
 
+import { createSmartDiff } from './snapshot.js';
+
+const lastHtmlSnapshots = new WeakMap();
+
 /**
  * Extracts cleaned HTML from a Playwright page or locator.
  * All processing happens in-page via DOM manipulation — no server-side parsing deps.
  *
  * @param {import('playwright-core').Page} page
  * @param {string} [selector] - CSS selector to scope extraction (default: document)
- * @param {{ maxAttrLen?: number, maxContentLen?: number }} [opts]
+ * @param {{ maxAttrLen?: number, maxContentLen?: number, showDiffSinceLastCall?: boolean }} [opts]
  * @returns {Promise<string>}
  */
 export async function getCleanHTML(page, selector, opts = {}) {
   const maxAttrLen = opts.maxAttrLen ?? 200;
   const maxContentLen = opts.maxContentLen ?? 500;
+  const showDiffSinceLastCall = opts.showDiffSinceLastCall ?? true;
 
   const html = await page.evaluate(({ selector, maxAttrLen, maxContentLen }) => {
     const TAGS_TO_REMOVE = new Set([
@@ -162,5 +167,23 @@ export async function getCleanHTML(page, selector, opts = {}) {
     return root.outerHTML || root.innerHTML || '';
   }, { selector: selector || null, maxAttrLen, maxContentLen });
 
+  let pageSnapshots = lastHtmlSnapshots.get(page);
+  if (!pageSnapshots) {
+    pageSnapshots = new Map();
+    lastHtmlSnapshots.set(page, pageSnapshots);
+  }
+
+  const snapshotKey = selector || '__full_page__';
+  const previousSnapshot = pageSnapshots.get(snapshotKey);
+  pageSnapshots.set(snapshotKey, html);
+
+  if (showDiffSinceLastCall && previousSnapshot) {
+    const diffResult = createSmartDiff(previousSnapshot, html);
+    if (diffResult.type === 'no-change') {
+      return 'No changes since last call. Use showDiffSinceLastCall: false to see full content.';
+    }
+    return diffResult.content;
+  }
+
   return html;
 }
diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index 96ca77c..064d297 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -7,7 +7,7 @@ import { homedir } from 'node:os';
 import { fileURLToPath } from 'node:url';
 import { spawn } from 'node:child_process';
 import {
-  TEST_ID_ATTRS,
+  TEST_ID_ATTRS, createSmartDiff,
   buildSnapshotText, parseSearchPattern, annotateStableAttrs,
 } from './snapshot.js';
 import { screenshotWithLabels } from './a11y-labels.js';
@@ -409,6 +409,7 @@ export class CodeExecutionTimeoutError extends Error {
 // instead of referencing module-level singletons.
 export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {}, pluginHelpers = {}) {
   const { consoleLogs, setupConsoleCapture } = consoleHelpers;
+  const lastSnapshots = userState.__lastSnapshots || (userState.__lastSnapshots = new WeakMap());
 
   const activePage = () => {
     if (userState.page && !userState.page.isClosed()) return userState.page;
@@ -416,7 +417,7 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     throw new Error('No active page. Create one first: state.page = await context.newPage()');
   };
 
-  const snapshot = async ({ selector, search } = {}) => {
+  const snapshot = async ({ selector, search, showDiffSinceLastCall = true } = {}) => {
     const page = activePage();
     const axRoot = await getAccessibilityTree(page, selector);
     if (!axRoot) return 'No accessibility tree available for this page.';
@@ -429,7 +430,23 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
       : '';
     const title = await page.title().catch(() => '');
     const pageUrl = page.url();
-    return `Page: ${title} (${pageUrl})\nRefs: ${refs.length} interactive elements\n\n${snapshotText}${refTable}`;
+    const fullSnapshot = `Page: ${title} (${pageUrl})\nRefs: ${refs.length} interactive elements\n\n${snapshotText}${refTable}`;
+
+    const shouldCacheSnapshot = !selector;
+    const previousSnapshot = shouldCacheSnapshot ? lastSnapshots.get(page) : undefined;
+    if (shouldCacheSnapshot) {
+      lastSnapshots.set(page, fullSnapshot);
+    }
+
+    if (showDiffSinceLastCall && previousSnapshot && shouldCacheSnapshot) {
+      const diffResult = createSmartDiff(previousSnapshot, fullSnapshot);
+      if (diffResult.type === 'no-change') {
+        return 'No changes since last snapshot. Use showDiffSinceLastCall: false to see full content.';
+      }
+      return diffResult.content;
+    }
+
+    return fullSnapshot;
   };
 
   const waitForPageLoad = (opts = {}) =>
@@ -458,7 +475,7 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
 
   const cleanHTML = (selector, opts) => getCleanHTML(activePage(), selector, opts);
 
-  const pageMarkdown = () => getPageMarkdown(activePage());
+  const pageMarkdown = (opts) => getPageMarkdown(activePage(), opts);
 
   // Wrap plugin helpers to auto-inject (page, ctx, state) as first three args
   const wrappedPluginHelpers = {};
diff --git a/mcp/src/page-markdown.js b/mcp/src/page-markdown.js
index cd2ece9..fe8424b 100644
--- a/mcp/src/page-markdown.js
+++ b/mcp/src/page-markdown.js
@@ -4,10 +4,21 @@
 import { readFileSync } from 'node:fs';
 import { join, dirname } from 'node:path';
 import { fileURLToPath } from 'node:url';
+import { createSmartDiff } from './snapshot.js';
 
 const __dirname = dirname(fileURLToPath(import.meta.url));
 
 let readabilityCode = null;
+const lastMarkdownSnapshots = new WeakMap();
+
+function isRegExp(value) {
+  return (
+    typeof value === 'object' &&
+    value !== null &&
+    typeof value.test === 'function' &&
+    typeof value.exec === 'function'
+  );
+}
 
 function getReadabilityCode() {
   if (readabilityCode) return readabilityCode;
@@ -21,9 +32,13 @@ function getReadabilityCode() {
  * Strips nav, ads, sidebars — returns article body with metadata.
  *
  * @param {import('playwright-core').Page} page
+ * @param {{ search?: string | RegExp, showDiffSinceLastCall?: boolean }} [opts]
  * @returns {Promise<string>}
  */
-export async function getPageMarkdown(page) {
+export async function getPageMarkdown(page, opts = {}) {
+  const search = opts.search;
+  const showDiffSinceLastCall = opts.showDiffSinceLastCall ?? true;
+
   // Inject Readability if not already present
   const hasReadability = await page.evaluate(() => !!globalThis.__readability);
   if (!hasReadability) {
@@ -110,5 +125,58 @@ export async function getPageMarkdown(page) {
     markdown = markdown.toWellFormed();
   }
 
+  const previousSnapshot = lastMarkdownSnapshots.get(page);
+  lastMarkdownSnapshots.set(page, markdown);
+
+  if (showDiffSinceLastCall && previousSnapshot) {
+    const diffResult = createSmartDiff(previousSnapshot, markdown);
+    if (diffResult.type === 'no-change') {
+      return 'No changes since last call. Use showDiffSinceLastCall: false to see full content.';
+    }
+    return diffResult.content;
+  }
+
+  if (search) {
+    const lines = markdown.split('\n');
+    const matchIndices = [];
+
+    for (let i = 0; i < lines.length; i++) {
+      const line = lines[i];
+      const isMatch = isRegExp(search)
+        ? search.test(line)
+        : line.toLowerCase().includes(String(search).toLowerCase());
+      if (isMatch) {
+        matchIndices.push(i);
+        if (matchIndices.length >= 10) break;
+      }
+    }
+
+    if (matchIndices.length === 0) {
+      return 'No matches found';
+    }
+
+    const CONTEXT_LINES = 5;
+    const includedLines = new Set();
+    for (const idx of matchIndices) {
+      const start = Math.max(0, idx - CONTEXT_LINES);
+      const end = Math.min(lines.length - 1, idx + CONTEXT_LINES);
+      for (let i = start; i <= end; i++) {
+        includedLines.add(i);
+      }
+    }
+
+    const sortedIndices = [...includedLines].sort((a, b) => a - b);
+    const resultLines = [];
+    for (let i = 0; i < sortedIndices.length; i++) {
+      const lineIdx = sortedIndices[i];
+      if (i > 0 && sortedIndices[i - 1] !== lineIdx - 1) {
+        resultLines.push('---');
+      }
+      resultLines.push(lines[lineIdx]);
+    }
+
+    return resultLines.join('\n');
+  }
+
   return markdown;
 }

From 8031301b0044afcc2a7354bac47f5afb8230ee0e Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:32:44 +0530
Subject: [PATCH 025/192] fix(mcp): preserve pageMarkdown search semantics with
 diff mode

---
 mcp/src/page-markdown.js             | 31 +++++++++++++++++-----------
 mcp/test/exec-engine-plugins.test.js | 29 +++++++++++++++++++++++---
 2 files changed, 45 insertions(+), 15 deletions(-)

diff --git a/mcp/src/page-markdown.js b/mcp/src/page-markdown.js
index fe8424b..f17be92 100644
--- a/mcp/src/page-markdown.js
+++ b/mcp/src/page-markdown.js
@@ -20,6 +20,16 @@ function isRegExp(value) {
   );
 }
 
+function lineMatchesSearch(search, line) {
+  if (!isRegExp(search)) {
+    return line.toLowerCase().includes(String(search).toLowerCase());
+  }
+  if (search.global || search.sticky) {
+    search.lastIndex = 0;
+  }
+  return search.test(line);
+}
+
 function getReadabilityCode() {
   if (readabilityCode) return readabilityCode;
   const bundlePath = join(__dirname, 'vendor', 'readability.bundle.js');
@@ -128,24 +138,13 @@ export async function getPageMarkdown(page, opts = {}) {
   const previousSnapshot = lastMarkdownSnapshots.get(page);
   lastMarkdownSnapshots.set(page, markdown);
 
-  if (showDiffSinceLastCall && previousSnapshot) {
-    const diffResult = createSmartDiff(previousSnapshot, markdown);
-    if (diffResult.type === 'no-change') {
-      return 'No changes since last call. Use showDiffSinceLastCall: false to see full content.';
-    }
-    return diffResult.content;
-  }
-
   if (search) {
     const lines = markdown.split('\n');
     const matchIndices = [];
 
     for (let i = 0; i < lines.length; i++) {
       const line = lines[i];
-      const isMatch = isRegExp(search)
-        ? search.test(line)
-        : line.toLowerCase().includes(String(search).toLowerCase());
-      if (isMatch) {
+      if (lineMatchesSearch(search, line)) {
         matchIndices.push(i);
         if (matchIndices.length >= 10) break;
       }
@@ -178,5 +177,13 @@ export async function getPageMarkdown(page, opts = {}) {
     return resultLines.join('\n');
   }
 
+  if (showDiffSinceLastCall && previousSnapshot) {
+    const diffResult = createSmartDiff(previousSnapshot, markdown);
+    if (diffResult.type === 'no-change') {
+      return 'No changes since last call. Use showDiffSinceLastCall: false to see full content.';
+    }
+    return diffResult.content;
+  }
+
   return markdown;
 }
diff --git a/mcp/test/exec-engine-plugins.test.js b/mcp/test/exec-engine-plugins.test.js
index 9df25bc..3f87d44 100644
--- a/mcp/test/exec-engine-plugins.test.js
+++ b/mcp/test/exec-engine-plugins.test.js
@@ -41,7 +41,8 @@ function createCleanHtmlPage() {
   };
 }
 
-function createPageMarkdownPage() {
+function createPageMarkdownPage(content = 'Markdown content line', options = {}) {
+  const title = options.title === undefined ? 'Markdown Title' : options.title;
   return {
     isClosed: () => false,
     evaluate: async (arg) => {
@@ -52,8 +53,8 @@ function createPageMarkdownPage() {
         }
         if (fnSource.includes('isProbablyReaderable')) {
           return {
-            content: 'Markdown content line',
-            title: 'Markdown Title',
+            content,
+            title,
             author: null,
             excerpt: null,
             siteName: null,
@@ -178,3 +179,25 @@ test('pageMarkdown option forwarding and diff wiring returns no-change guidance
   const full = await ctx.pageMarkdown({ showDiffSinceLastCall: false });
   assert.ok(full.includes('# Markdown Title'));
 });
+
+test('pageMarkdown search takes precedence over diff mode on repeated calls', async () => {
+  const page = createPageMarkdownPage('alpha line\nfind me here\nomega line');
+  const ctx = buildExecContext(page, { pages: () => [page] }, {}, {}, {});
+
+  await ctx.pageMarkdown({ showDiffSinceLastCall: true });
+  const searched = await ctx.pageMarkdown({ search: 'find me' });
+
+  assert.ok(searched.includes('find me here'));
+  assert.ok(!searched.includes('No changes since last call'));
+});
+
+test('pageMarkdown search resets regex state for g/y regex flags', async () => {
+  const page = createPageMarkdownPage('target on only line', { title: null });
+  const ctx = buildExecContext(page, { pages: () => [page] }, {}, {}, {});
+  const search = /target/g;
+  search.lastIndex = 1;
+
+  const result = await ctx.pageMarkdown({ search, showDiffSinceLastCall: false });
+  assert.ok(result.includes('target on only line'));
+  assert.ok(!result.includes('No matches found'));
+});

From 9edf832d5abde62886fae5d6a3ed696ff508de82 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:35:19 +0530
Subject: [PATCH 026/192] test(mcp): add prompt regression guards for tactical
 guidance

---
 mcp/src/index.js           | 18 ++++++++++++++++++
 mcp/test/mcp-tools.test.js | 17 +++++++++++++++++
 2 files changed, 35 insertions(+)

diff --git a/mcp/src/index.js b/mcp/src/index.js
index 51edc83..1fd080a 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -241,6 +241,21 @@ If snapshot shows [ref=some-id] for an element with a data-testid or id:
 For text content:
   const text = await state.page.locator('role=heading').textContent();
 
+Selector priority:
+  1. Use [ref=...] locators from snapshot output immediately after observing
+  2. Use role/name locators from snapshot
+  3. Use stable test IDs (data-testid) if present
+  4. Avoid brittle nth()/deep CSS selectors unless no stable option exists
+
+Before interacting, handle page blockers (cookie/consent banners, age gates, login popups):
+  const blockers = await snapshot({ search: /cookie|consent|accept|reject|allow|age|verify|login|sign.in/i });
+  // Dismiss blockers first, then continue with the main task
+
+Avoid stale locator usage:
+  // BAD: using a stale locator from an old snapshot after DOM changes
+  // GOOD: refresh observation first, then act with new refs/locators
+  await snapshot();
+
 ═══ COMMON PATTERNS ═══
 
 Navigate and read:
@@ -272,6 +287,9 @@ Wait for specific element:
 Debug with console logs:
   return getLogs({ count: 20 });
 
+When you need the full tree instead of diff output:
+  return await snapshot({ showDiffSinceLastCall: false });
+
 ═══ ANTI-PATTERNS ═══
 
 ✗ Don't navigate the user's existing tabs — create your own via context.newPage()
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 4bb507d..64e9e37 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -113,6 +113,23 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('ANTI-PATTERN') || promptBlock.includes('Don\'t') || promptBlock.includes('✗'), 'should include anti-patterns');
   });
 
+  it('execute prompt includes tactical anti-pattern and decision guidance', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/index.js'),
+      'utf8'
+    );
+
+    const promptStart = source.indexOf('const EXECUTE_PROMPT');
+    const promptEnd = source.indexOf("server.tool(\n  'execute'");
+    const promptBlock = source.slice(promptStart, promptEnd);
+
+    assert.ok(promptBlock.includes('Selector priority'), 'should include selector ranking guidance');
+    assert.ok(promptBlock.includes('login popups'), 'should include login popup handling');
+    assert.ok(promptBlock.includes('cookie') || promptBlock.includes('consent'), 'should include consent modal handling');
+    assert.ok(promptBlock.includes('stale locator'), 'should include stale locator warning');
+    assert.ok(promptBlock.includes('snapshot({ showDiffSinceLastCall'), 'should include diff usage guidance');
+  });
+
   it('execute tool has code and optional timeout params', () => {
     const source = readFileSync(
       join(import.meta.url.replace('file://', ''), '../../src/index.js'),

From 8d31393d53fddbba7f6cf7cb4dad413ba535ebef Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:38:10 +0530
Subject: [PATCH 027/192] test(relay): add failing coverage for cdp jsonl
 traffic logging

---
 relay/test/relay-server.test.js | 143 ++++++++++++++++++++++++++++++++
 1 file changed, 143 insertions(+)

diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index 593bc70..89d3312 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -72,6 +72,15 @@ function sleep(ms) {
   return new Promise((r) => setTimeout(r, ms));
 }
 
+function readJsonlEntries(logFilePath) {
+  const raw = fs.readFileSync(logFilePath, 'utf8').trim();
+  if (!raw) return [];
+  return raw
+    .split('\n')
+    .filter(Boolean)
+    .map((line) => JSON.parse(line));
+}
+
 // ─── Token Persistence ───────────────────────────────────────────────────────
 
 describe('Token Persistence', () => {
@@ -928,6 +937,140 @@ describe('CDP Event Forwarding', () => {
   });
 });
 
+// ─── CDP JSONL Logging ──────────────────────────────────────────────────────
+
+describe('CDP JSONL Logging', () => {
+  let logDir;
+  let logFilePath;
+  let originalLogFileEnv;
+
+  beforeEach(() => {
+    logDir = fs.mkdtempSync(path.join(os.tmpdir(), 'bf-cdp-log-'));
+    logFilePath = path.join(logDir, 'cdp-traffic.jsonl');
+    originalLogFileEnv = process.env.BROWSERFORCE_CDP_LOG_FILE_PATH;
+    process.env.BROWSERFORCE_CDP_LOG_FILE_PATH = logFilePath;
+  });
+
+  afterEach(() => {
+    if (originalLogFileEnv === undefined) delete process.env.BROWSERFORCE_CDP_LOG_FILE_PATH;
+    else process.env.BROWSERFORCE_CDP_LOG_FILE_PATH = originalLogFileEnv;
+    fs.rmSync(logDir, { recursive: true, force: true });
+  });
+
+  it('creates and truncates the CDP JSONL log file on relay start', async () => {
+    let firstRelay;
+    let secondRelay;
+
+    try {
+      firstRelay = new RelayServer(getRandomPort());
+      await firstRelay.start({ writeCdpUrl: false });
+      assert.equal(fs.existsSync(logFilePath), true, 'CDP log file should be created on start');
+
+      firstRelay.stop();
+      firstRelay = null;
+
+      fs.writeFileSync(logFilePath, '{"stale":true}\n');
+      assert.ok(fs.statSync(logFilePath).size > 0, 'CDP log file should contain stale data before restart');
+
+      secondRelay = new RelayServer(getRandomPort());
+      await secondRelay.start({ writeCdpUrl: false });
+      assert.equal(fs.existsSync(logFilePath), true, 'CDP log file should still exist after restart');
+      assert.equal(fs.readFileSync(logFilePath, 'utf8'), '', 'CDP log file should be truncated on each start');
+    } finally {
+      secondRelay?.stop();
+      firstRelay?.stop();
+    }
+  });
+
+  it('logs command/event traffic with direction and method in JSONL entries', async () => {
+    let relay;
+    let ext;
+    let cdp;
+
+    try {
+      relay = new RelayServer(getRandomPort());
+      await relay.start({ writeCdpUrl: false });
+
+      ext = await connectWs(`ws://127.0.0.1:${relay.port}/extension`, {
+        headers: { Origin: 'chrome-extension://test' },
+      });
+
+      ext.on('message', (data) => {
+        const msg = JSON.parse(data.toString());
+        if (msg.method === 'ping') { ext.send(JSON.stringify({ method: 'pong' })); return; }
+        if (msg.id === undefined) return;
+
+        if (msg.method === 'createTab') {
+          ext.send(JSON.stringify({
+            id: msg.id,
+            result: {
+              tabId: 501,
+              targetId: 'real-target-501',
+              sessionId: msg.params.sessionId,
+              targetInfo: {
+                targetId: 'real-target-501',
+                type: 'page',
+                title: 'Logging Test',
+                url: msg.params.url || 'about:blank',
+              },
+            },
+          }));
+        } else if (msg.method === 'cdpCommand' && msg.params.method === 'Runtime.evaluate') {
+          ext.send(JSON.stringify({
+            id: msg.id,
+            result: { result: { type: 'string', value: 'ok' } },
+          }));
+        }
+      });
+
+      cdp = await connectWs(`ws://127.0.0.1:${relay.port}/cdp?token=${relay.authToken}`);
+      const cdpMessages = [];
+      cdp.on('message', (data) => cdpMessages.push(JSON.parse(data.toString())));
+
+      cdp.send(JSON.stringify({ id: 1, method: 'Target.createTarget', params: { url: 'https://example.com' } }));
+      await sleep(300);
+
+      const attached = cdpMessages.find((m) => m.method === 'Target.attachedToTarget');
+      assert.ok(attached, 'Expected Target.attachedToTarget after createTarget');
+      const sessionId = attached.params.sessionId;
+
+      cdp.send(JSON.stringify({
+        id: 2,
+        method: 'Runtime.evaluate',
+        params: { expression: '"ok"' },
+        sessionId,
+      }));
+      await sleep(200);
+
+      ext.send(JSON.stringify({
+        method: 'cdpEvent',
+        params: {
+          tabId: 501,
+          method: 'Page.loadEventFired',
+          params: { timestamp: 42 },
+        },
+      }));
+      await sleep(300);
+
+      assert.equal(fs.existsSync(logFilePath), true, 'CDP log file should exist');
+      const entries = readJsonlEntries(logFilePath);
+      const directions = new Set(entries.map((entry) => entry.direction));
+      const methods = entries.map((entry) => entry?.message?.method).filter(Boolean);
+
+      assert.ok(directions.has('from-playwright'), 'Should log from-playwright direction');
+      assert.ok(directions.has('to-extension'), 'Should log to-extension direction');
+      assert.ok(directions.has('from-extension'), 'Should log from-extension direction');
+      assert.ok(directions.has('to-playwright'), 'Should log to-playwright direction');
+      assert.ok(methods.includes('Runtime.evaluate'), 'Should log Runtime.evaluate method');
+      assert.ok(methods.includes('Page.loadEventFired'), 'Should log Page.loadEventFired method');
+    } finally {
+      cdp?.close();
+      ext?.close();
+      relay?.stop();
+    }
+  });
+});
+
 // ─── Tab Lifecycle ───────────────────────────────────────────────────────────
 
 describe('Tab Lifecycle', () => {

From 409ab6bd77dac726615a50b9e24de62b10ae8629 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:39:21 +0530
Subject: [PATCH 028/192] feat(mcp): expand execute prompt with tactical web
 automation playbooks

---
 mcp/src/index.js           | 70 ++++++++++++++++++++++++++++++++++++++
 mcp/test/mcp-tools.test.js | 15 ++++++++
 2 files changed, 85 insertions(+)

diff --git a/mcp/src/index.js b/mcp/src/index.js
index 1fd080a..d46599c 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -256,6 +256,76 @@ Avoid stale locator usage:
   // GOOD: refresh observation first, then act with new refs/locators
   await snapshot();
 
+Typing text with newlines:
+  // Use fill() for multiline blocks to avoid accidental Enter key submissions
+  await state.page.locator('role=textbox[name="Message"]').fill('Line 1\\nLine 2');
+
+═══ TACTICAL ANTI-PATTERNS ═══
+
+Popup control:
+  ✗ Don’t click through a popup without confirming what changed
+  ✓ Dismiss popup, then run snapshot() immediately to confirm main UI is usable
+
+Consent blockers:
+  ✗ Don’t continue form/page actions while consent banners block focus
+  ✓ Handle cookie/consent overlays first, then retry the intended action
+
+Stale locators:
+  ✗ Don’t reuse [ref=...] values after DOM/nav updates
+  ✓ Refresh snapshot() and use the newest refs/role locators
+
+Newline typing:
+  ✗ Don’t use keyboard Enter loops for multiline textareas unless explicitly needed
+  ✓ Prefer locator.fill('line1\\nline2') for deterministic multiline input
+
+═══ EXTRACTION DECISION TREE ═══
+
+snapshot vs cleanHTML vs pageMarkdown:
+  1) Use snapshot() when you need current interactive structure, labels, and refs.
+  2) Use cleanHTML(selector?) when you need structured DOM content for parsing/extraction.
+  3) Use pageMarkdown() for article/blog/news pages where nav/ads should be removed.
+  4) Use screenshotWithAccessibilityLabels() only when layout/visual evidence is required.
+
+═══ DEBUGGING WORKFLOW ═══
+
+Combine snapshot + logs:
+  1) snapshot({ search: /target text|button|error/i }) to verify element presence and naming
+  2) getLogs({ count: 30 }) for runtime/network/console errors
+  3) page.evaluate(() => { ...visibility checks... }) to validate hidden/disabled/overlay states
+
+Example visibility check:
+  return await state.page.evaluate(() => {
+    const el = document.querySelector('[data-testid="submit"]');
+    if (!el) return { found: false };
+    const s = getComputedStyle(el);
+    const r = el.getBoundingClientRect();
+    return { found: true, visible: s.display !== 'none' && s.visibility !== 'hidden' && r.width > 0 && r.height > 0 };
+  });
+
+═══ ADVANCED PATTERNS ═══
+
+Authenticated fetch:
+  // Reuse browser session cookies/headers from the current page context
+  return await state.page.evaluate(async () => {
+    const res = await fetch('/api/me', { credentials: 'include' });
+    return { status: res.status, body: await res.text() };
+  });
+
+Network interception:
+  await state.page.route('**/api/**', async (route) => {
+    const request = route.request();
+    // Inspect/modify request here if needed before continuing
+    await route.continue();
+  });
+
+Downloads:
+  // Use expect_download pattern and save path after click/navigation trigger
+  const [download] = await Promise.all([
+    state.page.waitForEvent('download'),
+    state.page.locator('role=button[name="Export CSV"]').click(),
+  ]);
+  return { suggestedFilename: download.suggestedFilename() };
+
 ═══ COMMON PATTERNS ═══
 
 Navigate and read:
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 64e9e37..dcff140 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -130,6 +130,21 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('snapshot({ showDiffSinceLastCall'), 'should include diff usage guidance');
   });
 
+  it('execute prompt includes tool-selection and debugging decision trees', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/index.js'),
+      'utf8'
+    );
+    const promptStart = source.indexOf('const EXECUTE_PROMPT');
+    const promptEnd = source.indexOf("server.tool(\n  'execute'");
+    const promptBlock = source.slice(promptStart, promptEnd);
+
+    assert.ok(promptBlock.includes('snapshot vs cleanHTML vs pageMarkdown'), 'should include extraction decision tree');
+    assert.ok(promptBlock.includes('Combine snapshot + logs'), 'should include debugging workflow');
+    assert.ok(promptBlock.includes('Authenticated fetch'), 'should include authenticated fetch pattern');
+    assert.ok(promptBlock.includes('Downloads'), 'should include download pattern');
+  });
+
   it('execute tool has code and optional timeout params', () => {
     const source = readFileSync(
       join(import.meta.url.replace('file://', ''), '../../src/index.js'),

From 0cb0585031214e5809b477505887e77a35895491 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:42:50 +0530
Subject: [PATCH 029/192] test(relay): stabilize failing cdp logging tests

---
 relay/test/relay-server.test.js | 32 ++++++++++++++++++++++++++------
 1 file changed, 26 insertions(+), 6 deletions(-)

diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index 89d3312..3b36b2a 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -81,6 +81,20 @@ function readJsonlEntries(logFilePath) {
     .map((line) => JSON.parse(line));
 }
 
+async function waitForCondition(check, {
+  timeoutMs = 3000,
+  intervalMs = 25,
+  description = 'condition',
+} = {}) {
+  const startedAt = Date.now();
+  while (Date.now() - startedAt < timeoutMs) {
+    const result = check();
+    if (result) return result;
+    await sleep(intervalMs);
+  }
+  throw new Error(`Timed out waiting for ${description} after ${timeoutMs}ms`);
+}
+
 // ─── Token Persistence ───────────────────────────────────────────────────────
 
 describe('Token Persistence', () => {
@@ -1028,10 +1042,10 @@ describe('CDP JSONL Logging', () => {
       cdp.on('message', (data) => cdpMessages.push(JSON.parse(data.toString())));
 
       cdp.send(JSON.stringify({ id: 1, method: 'Target.createTarget', params: { url: 'https://example.com' } }));
-      await sleep(300);
-
-      const attached = cdpMessages.find((m) => m.method === 'Target.attachedToTarget');
-      assert.ok(attached, 'Expected Target.attachedToTarget after createTarget');
+      const attached = await waitForCondition(
+        () => cdpMessages.find((m) => m.method === 'Target.attachedToTarget'),
+        { description: 'Target.attachedToTarget event after createTarget' },
+      );
       const sessionId = attached.params.sessionId;
 
       cdp.send(JSON.stringify({
@@ -1040,7 +1054,10 @@ describe('CDP JSONL Logging', () => {
         params: { expression: '"ok"' },
         sessionId,
       }));
-      await sleep(200);
+      await waitForCondition(
+        () => cdpMessages.find((m) => m.id === 2),
+        { description: 'Runtime.evaluate response' },
+      );
 
       ext.send(JSON.stringify({
         method: 'cdpEvent',
@@ -1050,7 +1067,10 @@ describe('CDP JSONL Logging', () => {
           params: { timestamp: 42 },
         },
       }));
-      await sleep(300);
+      await waitForCondition(
+        () => cdpMessages.find((m) => m.method === 'Page.loadEventFired'),
+        { description: 'Page.loadEventFired event routed to CDP client' },
+      );
 
       assert.equal(fs.existsSync(logFilePath), true, 'CDP log file should exist');
       const entries = readJsonlEntries(logFilePath);

From dfafc62f074513e6d16bc3ad8ba5560a7afd7d25 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:45:03 +0530
Subject: [PATCH 030/192] feat(mcp): expose refToLocator helper in execute
 context

---
 mcp/src/exec-engine.js     | 12 +++++++++++-
 mcp/src/index.js           |  8 +++++---
 mcp/test/mcp-tools.test.js | 10 ++++++++++
 3 files changed, 26 insertions(+), 4 deletions(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index 064d297..3edbb50 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -410,6 +410,7 @@ export class CodeExecutionTimeoutError extends Error {
 export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {}, pluginHelpers = {}) {
   const { consoleLogs, setupConsoleCapture } = consoleHelpers;
   const lastSnapshots = userState.__lastSnapshots || (userState.__lastSnapshots = new WeakMap());
+  const lastRefToLocator = userState.__lastRefToLocator || (userState.__lastRefToLocator = new WeakMap());
 
   const activePage = () => {
     if (userState.page && !userState.page.isClosed()) return userState.page;
@@ -425,6 +426,8 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     annotateStableAttrs(axRoot, stableIds);
     const searchPattern = parseSearchPattern(search);
     const { text: snapshotText, refs } = buildSnapshotText(axRoot, null, searchPattern);
+    const refMap = new Map(refs.map(({ ref, locator }) => [ref, locator]));
+    lastRefToLocator.set(page, refMap);
     const refTable = refs.length > 0
       ? '\n\n--- Ref → Locator ---\n' + refs.map(r => `${r.ref}: ${r.locator}`).join('\n')
       : '';
@@ -449,6 +452,13 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     return fullSnapshot;
   };
 
+  const refToLocator = ({ ref, page: targetPage } = {}) => {
+    const p = targetPage || activePage();
+    const map = lastRefToLocator.get(p);
+    if (!map) return null;
+    return map.get(ref) ?? null;
+  };
+
   const waitForPageLoad = (opts = {}) =>
     smartWaitForPageLoad(activePage(), opts.timeout ?? 30000);
 
@@ -490,7 +500,7 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
   return {
     ...wrappedPluginHelpers,           // plugin helpers spread first — built-ins always win
     page: defaultPage, context: ctx, state: userState,
-    snapshot, waitForPageLoad, getLogs, clearLogs,
+    snapshot, refToLocator, waitForPageLoad, getLogs, clearLogs,
     screenshotWithAccessibilityLabels, cleanHTML, pageMarkdown,
     fetch, URL, URLSearchParams, Buffer, setTimeout, clearTimeout,
     TextEncoder, TextDecoder,
diff --git a/mcp/src/index.js b/mcp/src/index.js
index d46599c..b0c6ca8 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -134,6 +134,7 @@ Variables:
 
 Helpers:
   snapshot({ selector?, search? })   Accessibility tree as text. 10-100x cheaper than screenshots.
+  refToLocator({ ref })              Resolve a snapshot ref (e.g., e3) to a Playwright locator string.
   waitForPageLoad({ timeout? })      Smart load detection (filters analytics/ads, polls readyState).
   getLogs({ count? })                Browser console logs captured for current page.
   clearLogs()                        Clear captured console logs.
@@ -235,8 +236,9 @@ Use Playwright locators with accessibility roles (from snapshot output):
   await state.page.locator('role=textbox[name="Search"]').fill('query');
   await state.page.locator('role=link[name="Settings"]').click();
 
-If snapshot shows [ref=some-id] for an element with a data-testid or id:
-  await state.page.locator('[data-testid="some-id"]').click();
+If snapshot shows [ref=e3], resolve it with refToLocator({ ref }) before acting:
+  const locator = refToLocator({ ref: 'e3' });
+  if (locator) await state.page.locator(locator).click();
 
 For text content:
   const text = await state.page.locator('role=heading').textContent();
@@ -406,7 +408,7 @@ function registerExecuteTool(skillAppendix = '') {
     'execute',
     EXECUTE_PROMPT + skillAppendix,
     {
-      code: z.string().describe('JavaScript to run — page/context/state/snapshot/waitForPageLoad/getLogs/cleanHTML/pageMarkdown in scope'),
+      code: z.string().describe('JavaScript to run — page/context/state/snapshot/refToLocator/waitForPageLoad/getLogs/cleanHTML/pageMarkdown in scope'),
       timeout: z.number().optional().describe('Max execution time in ms (default: 30000)'),
     },
     async ({ code, timeout = 30000 }) => {
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index dcff140..261c87e 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -106,6 +106,7 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('snapshot'), 'should mention snapshot-first approach');
     assert.ok(promptBlock.includes('waitForPageLoad'), 'should mention waitForPageLoad');
     assert.ok(promptBlock.includes('screenshotWithAccessibilityLabels'), 'should mention screenshotWithAccessibilityLabels helper');
+    assert.ok(promptBlock.includes('refToLocator({ ref })'), 'should mention refToLocator helper usage');
     assert.ok(promptBlock.includes('cleanHTML'), 'should mention cleanHTML helper');
     assert.ok(promptBlock.includes('pageMarkdown'), 'should mention pageMarkdown helper');
     assert.ok(promptBlock.includes('newPage'), 'should mention creating new tabs');
@@ -182,6 +183,15 @@ describe('Tool Definitions', () => {
     assert.ok(!source.includes("'screenshot_with_labels'"), 'screenshot_with_labels tool should be removed');
     assert.ok(!source.includes('SCREENSHOT_LABELS_PROMPT'), 'dedicated screenshot prompt should be removed');
   });
+
+  it('exec context source exposes refToLocator helper', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/exec-engine.js'),
+      'utf8'
+    );
+
+    assert.ok(source.includes('refToLocator'), 'exec engine should expose refToLocator helper');
+  });
 });
 
 // ─── MCP Response Format ─────────────────────────────────────────────────────

From 96aff087a57937c621b4b3c38c335c12ae1666a5 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:50:37 +0530
Subject: [PATCH 031/192] feat(mcp): add getCDPSession helper for relay-safe
 raw CDP access

---
 mcp/src/exec-engine.js     | 10 +++++++++-
 mcp/src/index.js           |  6 ++++++
 mcp/test/mcp-tools.test.js |  2 ++
 3 files changed, 17 insertions(+), 1 deletion(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index 3edbb50..dfbc088 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -474,6 +474,14 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     if (consoleLogs) consoleLogs.set(activePage(), []);
   };
 
+  const getCDPSession = async ({ page: targetPage } = {}) => {
+    const p = targetPage || activePage();
+    if (!p || p.isClosed()) {
+      throw new Error('Cannot create CDP session for closed page');
+    }
+    return p.context().newCDPSession(p);
+  };
+
   const screenshotWithAccessibilityLabels = async ({ selector, interactiveOnly = true } = {}) => {
     const page = activePage();
     const { screenshot, snapshot: snapText, labelCount } = await screenshotWithLabels(page, {
@@ -500,7 +508,7 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
   return {
     ...wrappedPluginHelpers,           // plugin helpers spread first — built-ins always win
     page: defaultPage, context: ctx, state: userState,
-    snapshot, refToLocator, waitForPageLoad, getLogs, clearLogs,
+    snapshot, refToLocator, waitForPageLoad, getLogs, clearLogs, getCDPSession,
     screenshotWithAccessibilityLabels, cleanHTML, pageMarkdown,
     fetch, URL, URLSearchParams, Buffer, setTimeout, clearTimeout,
     TextEncoder, TextDecoder,
diff --git a/mcp/src/index.js b/mcp/src/index.js
index b0c6ca8..d966fc2 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -148,6 +148,8 @@ Helpers:
   pageMarkdown()                     Article content via Mozilla Readability (Firefox Reader View).
                                      Strips nav/ads/sidebars. Returns title + metadata + body text.
                                      Falls back to raw body text for non-article pages.
+  getCDPSession({ page })            Create a relay-safe raw CDP session for a page.
+                                     Use this instead of page.context().newCDPSession(page).
 
 Globals: fetch, URL, URLSearchParams, Buffer, setTimeout, clearTimeout, TextEncoder, TextDecoder
 
@@ -280,6 +282,10 @@ Newline typing:
   ✗ Don’t use keyboard Enter loops for multiline textareas unless explicitly needed
   ✓ Prefer locator.fill('line1\\nline2') for deterministic multiline input
 
+Raw CDP sessions:
+  ✗ Don’t call page.context().newCDPSession(page) directly
+  ✓ Use getCDPSession({ page }) for relay-safe CDP session creation
+
 ═══ EXTRACTION DECISION TREE ═══
 
 snapshot vs cleanHTML vs pageMarkdown:
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 261c87e..7008e4f 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -107,6 +107,7 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('waitForPageLoad'), 'should mention waitForPageLoad');
     assert.ok(promptBlock.includes('screenshotWithAccessibilityLabels'), 'should mention screenshotWithAccessibilityLabels helper');
     assert.ok(promptBlock.includes('refToLocator({ ref })'), 'should mention refToLocator helper usage');
+    assert.ok(promptBlock.includes('getCDPSession({ page })'), 'should mention relay-safe getCDPSession helper usage');
     assert.ok(promptBlock.includes('cleanHTML'), 'should mention cleanHTML helper');
     assert.ok(promptBlock.includes('pageMarkdown'), 'should mention pageMarkdown helper');
     assert.ok(promptBlock.includes('newPage'), 'should mention creating new tabs');
@@ -191,6 +192,7 @@ describe('Tool Definitions', () => {
     );
 
     assert.ok(source.includes('refToLocator'), 'exec engine should expose refToLocator helper');
+    assert.ok(source.includes('const getCDPSession = async'), 'exec engine should define getCDPSession helper');
   });
 });
 

From e81d78d3f4eddd42d0df482a8e40093297856aba Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:56:49 +0530
Subject: [PATCH 032/192] feat(mcp): add snapshot diff mode with
 showDiffSinceLastCall toggle

---
 mcp/src/exec-engine.js     | 37 ++++++++++++++++++++++++++-----------
 mcp/src/index.js           |  3 ++-
 mcp/test/mcp-tools.test.js |  9 +++++++++
 3 files changed, 37 insertions(+), 12 deletions(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index dfbc088..9c5ca6d 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -425,23 +425,33 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     const stableIds = await getStableIds(page, selector);
     annotateStableAttrs(axRoot, stableIds);
     const searchPattern = parseSearchPattern(search);
-    const { text: snapshotText, refs } = buildSnapshotText(axRoot, null, searchPattern);
-    const refMap = new Map(refs.map(({ ref, locator }) => [ref, locator]));
+    const { text: fullSnapshotText, refs: fullRefs } = buildSnapshotText(axRoot, null, null);
+    const refMap = new Map(fullRefs.map(({ ref, locator }) => [ref, locator]));
     lastRefToLocator.set(page, refMap);
-    const refTable = refs.length > 0
-      ? '\n\n--- Ref → Locator ---\n' + refs.map(r => `${r.ref}: ${r.locator}`).join('\n')
-      : '';
     const title = await page.title().catch(() => '');
     const pageUrl = page.url();
-    const fullSnapshot = `Page: ${title} (${pageUrl})\nRefs: ${refs.length} interactive elements\n\n${snapshotText}${refTable}`;
+    const formatSnapshot = (snapshotText, refs) => {
+      const refTable = refs.length > 0
+        ? '\n\n--- Ref → Locator ---\n' + refs.map(r => `${r.ref}: ${r.locator}`).join('\n')
+        : '';
+      return `Page: ${title} (${pageUrl})\nRefs: ${refs.length} interactive elements\n\n${snapshotText}${refTable}`;
+    };
+    const fullSnapshot = formatSnapshot(fullSnapshotText, fullRefs);
 
-    const shouldCacheSnapshot = !selector;
-    const previousSnapshot = shouldCacheSnapshot ? lastSnapshots.get(page) : undefined;
-    if (shouldCacheSnapshot) {
-      lastSnapshots.set(page, fullSnapshot);
+    let pageSnapshots = lastSnapshots.get(page);
+    if (!(pageSnapshots instanceof Map)) {
+      const migratedSnapshots = new Map();
+      if (typeof pageSnapshots === 'string') {
+        migratedSnapshots.set('__full_page__', pageSnapshots);
+      }
+      pageSnapshots = migratedSnapshots;
+      lastSnapshots.set(page, pageSnapshots);
     }
+    const snapshotKey = selector || '__full_page__';
+    const previousSnapshot = pageSnapshots.get(snapshotKey);
+    pageSnapshots.set(snapshotKey, fullSnapshot);
 
-    if (showDiffSinceLastCall && previousSnapshot && shouldCacheSnapshot) {
+    if (!search && showDiffSinceLastCall && previousSnapshot) {
       const diffResult = createSmartDiff(previousSnapshot, fullSnapshot);
       if (diffResult.type === 'no-change') {
         return 'No changes since last snapshot. Use showDiffSinceLastCall: false to see full content.';
@@ -449,6 +459,11 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
       return diffResult.content;
     }
 
+    if (searchPattern) {
+      const { text: filteredSnapshotText, refs: filteredRefs } = buildSnapshotText(axRoot, null, searchPattern);
+      return formatSnapshot(filteredSnapshotText, filteredRefs);
+    }
+
     return fullSnapshot;
   };
 
diff --git a/mcp/src/index.js b/mcp/src/index.js
index d966fc2..3f35e01 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -133,7 +133,7 @@ Variables:
   state       Persistent object across calls (cleared on reset). Store your working page here.
 
 Helpers:
-  snapshot({ selector?, search? })   Accessibility tree as text. 10-100x cheaper than screenshots.
+  snapshot({ selector?, search?, showDiffSinceLastCall? })   Accessibility tree as text. 10-100x cheaper than screenshots.
   refToLocator({ ref })              Resolve a snapshot ref (e.g., e3) to a Playwright locator string.
   waitForPageLoad({ timeout? })      Smart load detection (filters analytics/ads, polls readyState).
   getLogs({ count? })                Browser console logs captured for current page.
@@ -391,6 +391,7 @@ If timeout:          Increase timeout param, or break into smaller steps
 snapshot(options?)
   options.selector  CSS selector to scope the snapshot (e.g., '#main', '.sidebar')
   options.search    Regex string to filter tree nodes (e.g., 'button|link')
+  options.showDiffSinceLastCall  When true (default), returns a smart diff from previous snapshot when unchanged scope+search is not used
   Returns: Text accessibility tree with interactive element refs
 
 waitForPageLoad(options?)
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 7008e4f..08377df 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -130,6 +130,7 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('cookie') || promptBlock.includes('consent'), 'should include consent modal handling');
     assert.ok(promptBlock.includes('stale locator'), 'should include stale locator warning');
     assert.ok(promptBlock.includes('snapshot({ showDiffSinceLastCall'), 'should include diff usage guidance');
+    assert.ok(promptBlock.includes('options.showDiffSinceLastCall'), 'should document snapshot diff toggle in API reference');
   });
 
   it('execute prompt includes tool-selection and debugging decision trees', () => {
@@ -193,6 +194,14 @@ describe('Tool Definitions', () => {
 
     assert.ok(source.includes('refToLocator'), 'exec engine should expose refToLocator helper');
     assert.ok(source.includes('const getCDPSession = async'), 'exec engine should define getCDPSession helper');
+    assert.ok(
+      source.includes('No changes since last snapshot. Use showDiffSinceLastCall: false to see full content.'),
+      'exec engine should return snapshot no-change guidance'
+    );
+    assert.ok(
+      source.includes('!search && showDiffSinceLastCall') || source.includes('showDiffSinceLastCall && !search'),
+      'snapshot diff mode should only run when search is not provided'
+    );
   });
 });
 

From 1f50d6172eb22458418691c32ef379a6150d3992 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 18:02:43 +0530
Subject: [PATCH 033/192] fix(mcp): diff snapshot only for full-page views

---
 mcp/src/exec-engine.js     | 2 +-
 mcp/test/mcp-tools.test.js | 5 +++--
 2 files changed, 4 insertions(+), 3 deletions(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index 9c5ca6d..c4f2e86 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -451,7 +451,7 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     const previousSnapshot = pageSnapshots.get(snapshotKey);
     pageSnapshots.set(snapshotKey, fullSnapshot);
 
-    if (!search && showDiffSinceLastCall && previousSnapshot) {
+    if (!selector && !search && showDiffSinceLastCall && previousSnapshot) {
       const diffResult = createSmartDiff(previousSnapshot, fullSnapshot);
       if (diffResult.type === 'no-change') {
         return 'No changes since last snapshot. Use showDiffSinceLastCall: false to see full content.';
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 08377df..fcefc4b 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -199,8 +199,9 @@ describe('Tool Definitions', () => {
       'exec engine should return snapshot no-change guidance'
     );
     assert.ok(
-      source.includes('!search && showDiffSinceLastCall') || source.includes('showDiffSinceLastCall && !search'),
-      'snapshot diff mode should only run when search is not provided'
+      source.includes('!selector && !search && showDiffSinceLastCall') ||
+      source.includes('showDiffSinceLastCall && !selector && !search'),
+      'snapshot diff mode should only run for full-page snapshots with no search'
     );
   });
 });

From eb37c44fca0e0d8c7ad8eced3322abedf59ec088 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 18:27:05 +0530
Subject: [PATCH 034/192] feat(relay): add jsonl cdp traffic logging with
 playwriter-style directions

---
 relay/src/cdp-log.js | 66 +++++++++++++++++++++++++++++++
 relay/src/index.js   | 92 ++++++++++++++++++++++++++++++++++++++++----
 2 files changed, 151 insertions(+), 7 deletions(-)
 create mode 100644 relay/src/cdp-log.js

diff --git a/relay/src/cdp-log.js b/relay/src/cdp-log.js
new file mode 100644
index 0000000..84ce83a
--- /dev/null
+++ b/relay/src/cdp-log.js
@@ -0,0 +1,66 @@
+const fs = require('node:fs');
+const os = require('node:os');
+const path = require('node:path');
+
+const BF_DIR = path.join(os.homedir(), '.browserforce');
+const LOG_CDP_FILE_PATH = process.env.BROWSERFORCE_CDP_LOG_FILE_PATH || path.join(BF_DIR, 'cdp.jsonl');
+const DEFAULT_MAX_STRING_LENGTH = 2000;
+
+function resolveMaxStringLength(maxStringLength) {
+  if (Number.isFinite(maxStringLength) && maxStringLength > 0) {
+    return Math.floor(maxStringLength);
+  }
+  const fromEnv = Number(process.env.BROWSERFORCE_CDP_LOG_MAX_STRING_LENGTH);
+  if (Number.isFinite(fromEnv) && fromEnv > 0) {
+    return Math.floor(fromEnv);
+  }
+  return DEFAULT_MAX_STRING_LENGTH;
+}
+
+function truncateString(value, maxLength) {
+  if (value.length <= maxLength) {
+    return value;
+  }
+  const truncatedCount = value.length - maxLength;
+  return `${value.slice(0, maxLength)}...[truncated ${truncatedCount} chars]`;
+}
+
+function createTruncatingCircularReplacer(maxStringLength) {
+  const seen = new WeakSet();
+  return (_key, value) => {
+    if (typeof value === 'string') {
+      return truncateString(value, maxStringLength);
+    }
+    if (value && typeof value === 'object') {
+      if (seen.has(value)) {
+        return '[Circular]';
+      }
+      seen.add(value);
+    }
+    return value;
+  };
+}
+
+function createCdpLogger({ logFilePath, maxStringLength } = {}) {
+  const resolvedLogFilePath = logFilePath || process.env.BROWSERFORCE_CDP_LOG_FILE_PATH || LOG_CDP_FILE_PATH;
+  fs.mkdirSync(path.dirname(resolvedLogFilePath), { recursive: true });
+  fs.writeFileSync(resolvedLogFilePath, '');
+
+  const resolvedMaxStringLength = resolveMaxStringLength(maxStringLength);
+  let queue = Promise.resolve();
+
+  return {
+    logFilePath: resolvedLogFilePath,
+    log(entry) {
+      const line = JSON.stringify(entry, createTruncatingCircularReplacer(resolvedMaxStringLength));
+      queue = queue
+        .then(() => fs.promises.appendFile(resolvedLogFilePath, `${line}\n`))
+        .catch(() => {});
+    },
+  };
+}
+
+module.exports = {
+  LOG_CDP_FILE_PATH,
+  createCdpLogger,
+};
diff --git a/relay/src/index.js b/relay/src/index.js
index 5459f5e..f114e3a 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -4,6 +4,7 @@ const fs = require('node:fs');
 const path = require('node:path');
 const os = require('node:os');
 const { WebSocketServer, WebSocket } = require('ws');
+const { createCdpLogger } = require('./cdp-log.js');
 
 // ─── Constants ───────────────────────────────────────────────────────────────
 
@@ -151,9 +152,13 @@ class RelayServer {
 
     // Pending extension reload ack resolver (at most one at a time)
     this._extReloadResolve = null;
+
+    // CDP traffic logger, initialized on start.
+    this.cdpLogger = null;
   }
 
   start({ writeCdpUrl = true } = {}) {
+    this.cdpLogger = createCdpLogger();
     const server = http.createServer((req, res) => this._handleHttp(req, res));
 
     this.extWss = new WebSocketServer({ noServer: true });
@@ -185,6 +190,16 @@ class RelayServer {
     });
   }
 
+  _logCdp(entry) {
+    if (!this.cdpLogger || typeof this.cdpLogger.log !== 'function') {
+      return;
+    }
+    this.cdpLogger.log({
+      timestamp: new Date().toISOString(),
+      ...entry,
+    });
+  }
+
   // ─── HTTP ────────────────────────────────────────────────────────────────
 
   async _handleHttp(req, res) {
@@ -535,7 +550,13 @@ class RelayServer {
 
   _handleCdpEventFromExt({ tabId, method, params, childSessionId }) {
     const sessionId = this.tabToSession.get(tabId);
-    if (!sessionId) return;
+    if (!sessionId) {
+      this._logCdp({
+        direction: 'from-extension',
+        message: { method, params, tabId, childSessionId },
+      });
+      return;
+    }
 
     // Track child sessions (iframes / OOPIFs)
     if (method === 'Target.attachedToTarget' && params?.sessionId) {
@@ -550,6 +571,11 @@ class RelayServer {
       ? (this.childSessions.get(childSessionId)?.parentSessionId || sessionId)
       : sessionId;
 
+    this._logCdp({
+      direction: 'from-extension',
+      message: { method, params, tabId, sessionId: outerSessionId, childSessionId },
+    });
+
     this._broadcastCdp({ method, params, sessionId: outerSessionId });
   }
 
@@ -627,17 +653,25 @@ class RelayServer {
 
   async _handleCdpClientMessage(ws, msg) {
     const { id, method, params, sessionId } = msg;
+    this._logCdp({
+      direction: 'from-playwright',
+      message: { id, method, params, sessionId },
+    });
 
     try {
       let result;
       if (sessionId) {
-        result = await this._forwardToTab(sessionId, method, params);
+        result = await this._forwardToTab(sessionId, method, params, id);
       } else {
         result = await this._handleBrowserCommand(ws, id, method, params);
       }
       if (result !== undefined) {
         const response = { id, result };
         if (sessionId) response.sessionId = sessionId;
+        this._logCdp({
+          direction: 'to-playwright',
+          message: response,
+        });
         ws.send(JSON.stringify(response));
       }
     } catch (err) {
@@ -646,6 +680,10 @@ class RelayServer {
         error: { code: -32000, message: err.message },
       };
       if (sessionId) response.sessionId = sessionId;
+      this._logCdp({
+        direction: 'to-playwright',
+        message: response,
+      });
       ws.send(JSON.stringify(response));
     }
   }
@@ -663,7 +701,7 @@ class RelayServer {
       case 'Target.setDiscoverTargets':
         // Emit targetCreated for all known targets
         for (const [, target] of this.targets) {
-          ws.send(JSON.stringify({
+          const event = {
             method: 'Target.targetCreated',
             params: {
               targetInfo: {
@@ -675,7 +713,12 @@ class RelayServer {
                 browserContextId: DEFAULT_BROWSER_CONTEXT_ID,
               },
             },
-          }));
+          };
+          this._logCdp({
+            direction: 'to-playwright',
+            message: event,
+          });
+          ws.send(JSON.stringify(event));
         }
         return {};
 
@@ -683,6 +726,10 @@ class RelayServer {
         this.autoAttachEnabled = true;
         this.autoAttachParams = params;
         // Respond immediately, then attach tabs asynchronously
+        this._logCdp({
+          direction: 'to-playwright',
+          message: { id: msgId, result: {} },
+        });
         ws.send(JSON.stringify({ id: msgId, result: {} }));
         this._autoAttachAllTabs(ws).catch((e) => {
           logErr('[relay] Auto-attach error:', e.message);
@@ -804,7 +851,7 @@ class RelayServer {
   }
 
   _sendAttachedEvent(ws, sessionId, target) {
-    ws.send(JSON.stringify({
+    const event = {
       method: 'Target.attachedToTarget',
       params: {
         sessionId,
@@ -818,7 +865,12 @@ class RelayServer {
         },
         waitingForDebugger: false,
       },
-    }));
+    };
+    this._logCdp({
+      direction: 'to-playwright',
+      message: event,
+    });
+    ws.send(JSON.stringify(event));
   }
 
   async _createTarget(ws, params) {
@@ -891,7 +943,7 @@ class RelayServer {
 
   // ─── CDP Command Forwarding ─────────────────────────────────────────────
 
-  async _forwardToTab(sessionId, method, params) {
+  async _forwardToTab(sessionId, method, params, id) {
     // Main session
     const target = this.targets.get(sessionId);
     if (target) {
@@ -906,6 +958,16 @@ class RelayServer {
         target._triggerMethod = method;
         await this._ensureDebuggerAttached(target, sessionId);
       }
+      this._logCdp({
+        direction: 'to-extension',
+        message: {
+          id,
+          method,
+          params: params || {},
+          sessionId,
+          tabId: target.tabId,
+        },
+      });
       return this._sendToExt('cdpCommand', {
         tabId: target.tabId,
         method,
@@ -922,6 +984,18 @@ class RelayServer {
       if (parentTarget && !parentTarget.debuggerAttached) {
         await this._ensureDebuggerAttached(parentTarget, parentSessionId);
       }
+      this._logCdp({
+        direction: 'to-extension',
+        message: {
+          id,
+          method,
+          params: params || {},
+          sessionId,
+          tabId: child.tabId,
+          childSessionId: sessionId,
+          parentSessionId,
+        },
+      });
       return this._sendToExt('cdpCommand', {
         tabId: child.tabId,
         method,
@@ -936,6 +1010,10 @@ class RelayServer {
   // ─── Broadcast ──────────────────────────────────────────────────────────
 
   _broadcastCdp(msg) {
+    this._logCdp({
+      direction: 'to-playwright',
+      message: msg,
+    });
     const data = JSON.stringify(msg);
     for (const client of this.clients) {
       if (client.readyState === WebSocket.OPEN) {

From 8b34054ed143fcebe0116df330e6b45f3aae13c1 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 18:30:44 +0530
Subject: [PATCH 035/192] fix(mcp): align execute schema helpers and add helper
 exposure test

---
 mcp/src/index.js                     |  2 +-
 mcp/test/exec-engine-plugins.test.js | 25 +++++++++++++++++++++++++
 2 files changed, 26 insertions(+), 1 deletion(-)

diff --git a/mcp/src/index.js b/mcp/src/index.js
index 3f35e01..b3bc12d 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -415,7 +415,7 @@ function registerExecuteTool(skillAppendix = '') {
     'execute',
     EXECUTE_PROMPT + skillAppendix,
     {
-      code: z.string().describe('JavaScript to run — page/context/state/snapshot/refToLocator/waitForPageLoad/getLogs/cleanHTML/pageMarkdown in scope'),
+      code: z.string().describe('JavaScript to run — page/context/state/snapshot/refToLocator/getCDPSession/waitForPageLoad/getLogs/cleanHTML/pageMarkdown in scope'),
       timeout: z.number().optional().describe('Max execution time in ms (default: 30000)'),
     },
     async ({ code, timeout = 30000 }) => {
diff --git a/mcp/test/exec-engine-plugins.test.js b/mcp/test/exec-engine-plugins.test.js
index 3f87d44..0a22cab 100644
--- a/mcp/test/exec-engine-plugins.test.js
+++ b/mcp/test/exec-engine-plugins.test.js
@@ -113,6 +113,31 @@ test('buildExecContext exposes screenshot and content helpers in execute scope',
   assert.equal(typeof ctx.pageMarkdown, 'function');
 });
 
+test('buildExecContext exposes callable ref and CDP helpers', async () => {
+  const fakeSession = { send: async () => ({}) };
+  const page = {
+    isClosed: () => false,
+    context: () => ({
+      newCDPSession: async (targetPage) => {
+        assert.equal(targetPage, page);
+        return fakeSession;
+      },
+    }),
+  };
+
+  const ctx = buildExecContext(page, { pages: () => [page] }, {}, {}, {});
+  assert.equal(typeof ctx.refToLocator, 'function');
+  assert.equal(typeof ctx.getCDPSession, 'function');
+
+  const session = await ctx.getCDPSession({ page });
+  assert.equal(session, fakeSession);
+
+  await assert.rejects(
+    () => ctx.getCDPSession({ page: { isClosed: () => true } }),
+    /Cannot create CDP session for closed page/
+  );
+});
+
 test('formatResult returns multi-content for labeled screenshot sentinel', () => {
   const fakeBuffer = Buffer.from('fake-jpeg-data');
   const formatted = formatResult({

From 4d15beb6609be115436c9417847106761d474006 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 18:35:12 +0530
Subject: [PATCH 036/192] fix(relay): harden cdp logger startup and file
 permissions

---
 relay/src/cdp-log.js | 15 +++++++++++++--
 relay/src/index.js   |  8 +++++++-
 2 files changed, 20 insertions(+), 3 deletions(-)

diff --git a/relay/src/cdp-log.js b/relay/src/cdp-log.js
index 84ce83a..a04bbdf 100644
--- a/relay/src/cdp-log.js
+++ b/relay/src/cdp-log.js
@@ -6,6 +6,14 @@ const BF_DIR = path.join(os.homedir(), '.browserforce');
 const LOG_CDP_FILE_PATH = process.env.BROWSERFORCE_CDP_LOG_FILE_PATH || path.join(BF_DIR, 'cdp.jsonl');
 const DEFAULT_MAX_STRING_LENGTH = 2000;
 
+function chmodBestEffort(filePath, mode) {
+  try {
+    fs.chmodSync(filePath, mode);
+  } catch {
+    // Best effort only: some platforms/filesystems do not support POSIX modes.
+  }
+}
+
 function resolveMaxStringLength(maxStringLength) {
   if (Number.isFinite(maxStringLength) && maxStringLength > 0) {
     return Math.floor(maxStringLength);
@@ -43,8 +51,11 @@ function createTruncatingCircularReplacer(maxStringLength) {
 
 function createCdpLogger({ logFilePath, maxStringLength } = {}) {
   const resolvedLogFilePath = logFilePath || process.env.BROWSERFORCE_CDP_LOG_FILE_PATH || LOG_CDP_FILE_PATH;
-  fs.mkdirSync(path.dirname(resolvedLogFilePath), { recursive: true });
-  fs.writeFileSync(resolvedLogFilePath, '');
+  const logDir = path.dirname(resolvedLogFilePath);
+  fs.mkdirSync(logDir, { recursive: true });
+  chmodBestEffort(logDir, 0o700);
+  fs.writeFileSync(resolvedLogFilePath, '', { mode: 0o600 });
+  chmodBestEffort(resolvedLogFilePath, 0o600);
 
   const resolvedMaxStringLength = resolveMaxStringLength(maxStringLength);
   let queue = Promise.resolve();
diff --git a/relay/src/index.js b/relay/src/index.js
index f114e3a..5326072 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -158,7 +158,13 @@ class RelayServer {
   }
 
   start({ writeCdpUrl = true } = {}) {
-    this.cdpLogger = createCdpLogger();
+    try {
+      this.cdpLogger = createCdpLogger();
+    } catch (err) {
+      const message = err && err.message ? err.message : String(err);
+      log('[relay] Warning: CDP logger disabled:', message);
+      this.cdpLogger = null;
+    }
     const server = http.createServer((req, res) => this._handleHttp(req, res));
 
     this.extWss = new WebSocketServer({ noServer: true });

From 84ee6c8ab730a8b6fa52df321d8a301dd91f6d8b Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:35:19 +0530
Subject: [PATCH 037/192] test(mcp): add prompt regression guards for tactical
 guidance

---
 mcp/src/index.js           | 18 ++++++++++++++++++
 mcp/test/mcp-tools.test.js | 17 +++++++++++++++++
 2 files changed, 35 insertions(+)

diff --git a/mcp/src/index.js b/mcp/src/index.js
index 51edc83..1fd080a 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -241,6 +241,21 @@ If snapshot shows [ref=some-id] for an element with a data-testid or id:
 For text content:
   const text = await state.page.locator('role=heading').textContent();
 
+Selector priority:
+  1. Use [ref=...] locators from snapshot output immediately after observing
+  2. Use role/name locators from snapshot
+  3. Use stable test IDs (data-testid) if present
+  4. Avoid brittle nth()/deep CSS selectors unless no stable option exists
+
+Before interacting, handle page blockers (cookie/consent banners, age gates, login popups):
+  const blockers = await snapshot({ search: /cookie|consent|accept|reject|allow|age|verify|login|sign.in/i });
+  // Dismiss blockers first, then continue with the main task
+
+Avoid stale locator usage:
+  // BAD: using a stale locator from an old snapshot after DOM changes
+  // GOOD: refresh observation first, then act with new refs/locators
+  await snapshot();
+
 ═══ COMMON PATTERNS ═══
 
 Navigate and read:
@@ -272,6 +287,9 @@ Wait for specific element:
 Debug with console logs:
   return getLogs({ count: 20 });
 
+When you need the full tree instead of diff output:
+  return await snapshot({ showDiffSinceLastCall: false });
+
 ═══ ANTI-PATTERNS ═══
 
 ✗ Don't navigate the user's existing tabs — create your own via context.newPage()
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 4bb507d..64e9e37 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -113,6 +113,23 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('ANTI-PATTERN') || promptBlock.includes('Don\'t') || promptBlock.includes('✗'), 'should include anti-patterns');
   });
 
+  it('execute prompt includes tactical anti-pattern and decision guidance', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/index.js'),
+      'utf8'
+    );
+
+    const promptStart = source.indexOf('const EXECUTE_PROMPT');
+    const promptEnd = source.indexOf("server.tool(\n  'execute'");
+    const promptBlock = source.slice(promptStart, promptEnd);
+
+    assert.ok(promptBlock.includes('Selector priority'), 'should include selector ranking guidance');
+    assert.ok(promptBlock.includes('login popups'), 'should include login popup handling');
+    assert.ok(promptBlock.includes('cookie') || promptBlock.includes('consent'), 'should include consent modal handling');
+    assert.ok(promptBlock.includes('stale locator'), 'should include stale locator warning');
+    assert.ok(promptBlock.includes('snapshot({ showDiffSinceLastCall'), 'should include diff usage guidance');
+  });
+
   it('execute tool has code and optional timeout params', () => {
     const source = readFileSync(
       join(import.meta.url.replace('file://', ''), '../../src/index.js'),

From 5db8d39e44200c3645ce35f5d1411d49471b160b Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:39:21 +0530
Subject: [PATCH 038/192] feat(mcp): expand execute prompt with tactical web
 automation playbooks

---
 mcp/src/index.js           | 70 ++++++++++++++++++++++++++++++++++++++
 mcp/test/mcp-tools.test.js | 15 ++++++++
 2 files changed, 85 insertions(+)

diff --git a/mcp/src/index.js b/mcp/src/index.js
index 1fd080a..d46599c 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -256,6 +256,76 @@ Avoid stale locator usage:
   // GOOD: refresh observation first, then act with new refs/locators
   await snapshot();
 
+Typing text with newlines:
+  // Use fill() for multiline blocks to avoid accidental Enter key submissions
+  await state.page.locator('role=textbox[name="Message"]').fill('Line 1\\nLine 2');
+
+═══ TACTICAL ANTI-PATTERNS ═══
+
+Popup control:
+  ✗ Don’t click through a popup without confirming what changed
+  ✓ Dismiss popup, then run snapshot() immediately to confirm main UI is usable
+
+Consent blockers:
+  ✗ Don’t continue form/page actions while consent banners block focus
+  ✓ Handle cookie/consent overlays first, then retry the intended action
+
+Stale locators:
+  ✗ Don’t reuse [ref=...] values after DOM/nav updates
+  ✓ Refresh snapshot() and use the newest refs/role locators
+
+Newline typing:
+  ✗ Don’t use keyboard Enter loops for multiline textareas unless explicitly needed
+  ✓ Prefer locator.fill('line1\\nline2') for deterministic multiline input
+
+═══ EXTRACTION DECISION TREE ═══
+
+snapshot vs cleanHTML vs pageMarkdown:
+  1) Use snapshot() when you need current interactive structure, labels, and refs.
+  2) Use cleanHTML(selector?) when you need structured DOM content for parsing/extraction.
+  3) Use pageMarkdown() for article/blog/news pages where nav/ads should be removed.
+  4) Use screenshotWithAccessibilityLabels() only when layout/visual evidence is required.
+
+═══ DEBUGGING WORKFLOW ═══
+
+Combine snapshot + logs:
+  1) snapshot({ search: /target text|button|error/i }) to verify element presence and naming
+  2) getLogs({ count: 30 }) for runtime/network/console errors
+  3) page.evaluate(() => { ...visibility checks... }) to validate hidden/disabled/overlay states
+
+Example visibility check:
+  return await state.page.evaluate(() => {
+    const el = document.querySelector('[data-testid="submit"]');
+    if (!el) return { found: false };
+    const s = getComputedStyle(el);
+    const r = el.getBoundingClientRect();
+    return { found: true, visible: s.display !== 'none' && s.visibility !== 'hidden' && r.width > 0 && r.height > 0 };
+  });
+
+═══ ADVANCED PATTERNS ═══
+
+Authenticated fetch:
+  // Reuse browser session cookies/headers from the current page context
+  return await state.page.evaluate(async () => {
+    const res = await fetch('/api/me', { credentials: 'include' });
+    return { status: res.status, body: await res.text() };
+  });
+
+Network interception:
+  await state.page.route('**/api/**', async (route) => {
+    const request = route.request();
+    // Inspect/modify request here if needed before continuing
+    await route.continue();
+  });
+
+Downloads:
+  // Use expect_download pattern and save path after click/navigation trigger
+  const [download] = await Promise.all([
+    state.page.waitForEvent('download'),
+    state.page.locator('role=button[name="Export CSV"]').click(),
+  ]);
+  return { suggestedFilename: download.suggestedFilename() };
+
 ═══ COMMON PATTERNS ═══
 
 Navigate and read:
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 64e9e37..dcff140 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -130,6 +130,21 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('snapshot({ showDiffSinceLastCall'), 'should include diff usage guidance');
   });
 
+  it('execute prompt includes tool-selection and debugging decision trees', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/index.js'),
+      'utf8'
+    );
+    const promptStart = source.indexOf('const EXECUTE_PROMPT');
+    const promptEnd = source.indexOf("server.tool(\n  'execute'");
+    const promptBlock = source.slice(promptStart, promptEnd);
+
+    assert.ok(promptBlock.includes('snapshot vs cleanHTML vs pageMarkdown'), 'should include extraction decision tree');
+    assert.ok(promptBlock.includes('Combine snapshot + logs'), 'should include debugging workflow');
+    assert.ok(promptBlock.includes('Authenticated fetch'), 'should include authenticated fetch pattern');
+    assert.ok(promptBlock.includes('Downloads'), 'should include download pattern');
+  });
+
   it('execute tool has code and optional timeout params', () => {
     const source = readFileSync(
       join(import.meta.url.replace('file://', ''), '../../src/index.js'),

From cba8c89247319f0fd8ec39221851ea75aa105d96 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:45:03 +0530
Subject: [PATCH 039/192] feat(mcp): expose refToLocator helper in execute
 context

---
 mcp/src/exec-engine.js     | 13 ++++++++++++-
 mcp/src/index.js           |  8 +++++---
 mcp/test/mcp-tools.test.js | 10 ++++++++++
 3 files changed, 27 insertions(+), 4 deletions(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index 96ca77c..64fb6fd 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -409,6 +409,8 @@ export class CodeExecutionTimeoutError extends Error {
 // instead of referencing module-level singletons.
 export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {}, pluginHelpers = {}) {
   const { consoleLogs, setupConsoleCapture } = consoleHelpers;
+  const lastSnapshots = userState.__lastSnapshots || (userState.__lastSnapshots = new WeakMap());
+  const lastRefToLocator = userState.__lastRefToLocator || (userState.__lastRefToLocator = new WeakMap());
 
   const activePage = () => {
     if (userState.page && !userState.page.isClosed()) return userState.page;
@@ -424,6 +426,8 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     annotateStableAttrs(axRoot, stableIds);
     const searchPattern = parseSearchPattern(search);
     const { text: snapshotText, refs } = buildSnapshotText(axRoot, null, searchPattern);
+    const refMap = new Map(refs.map(({ ref, locator }) => [ref, locator]));
+    lastRefToLocator.set(page, refMap);
     const refTable = refs.length > 0
       ? '\n\n--- Ref → Locator ---\n' + refs.map(r => `${r.ref}: ${r.locator}`).join('\n')
       : '';
@@ -432,6 +436,13 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     return `Page: ${title} (${pageUrl})\nRefs: ${refs.length} interactive elements\n\n${snapshotText}${refTable}`;
   };
 
+  const refToLocator = ({ ref, page: targetPage } = {}) => {
+    const p = targetPage || activePage();
+    const map = lastRefToLocator.get(p);
+    if (!map) return null;
+    return map.get(ref) ?? null;
+  };
+
   const waitForPageLoad = (opts = {}) =>
     smartWaitForPageLoad(activePage(), opts.timeout ?? 30000);
 
@@ -473,7 +484,7 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
   return {
     ...wrappedPluginHelpers,           // plugin helpers spread first — built-ins always win
     page: defaultPage, context: ctx, state: userState,
-    snapshot, waitForPageLoad, getLogs, clearLogs,
+    snapshot, refToLocator, waitForPageLoad, getLogs, clearLogs,
     screenshotWithAccessibilityLabels, cleanHTML, pageMarkdown,
     fetch, URL, URLSearchParams, Buffer, setTimeout, clearTimeout,
     TextEncoder, TextDecoder,
diff --git a/mcp/src/index.js b/mcp/src/index.js
index d46599c..b0c6ca8 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -134,6 +134,7 @@ Variables:
 
 Helpers:
   snapshot({ selector?, search? })   Accessibility tree as text. 10-100x cheaper than screenshots.
+  refToLocator({ ref })              Resolve a snapshot ref (e.g., e3) to a Playwright locator string.
   waitForPageLoad({ timeout? })      Smart load detection (filters analytics/ads, polls readyState).
   getLogs({ count? })                Browser console logs captured for current page.
   clearLogs()                        Clear captured console logs.
@@ -235,8 +236,9 @@ Use Playwright locators with accessibility roles (from snapshot output):
   await state.page.locator('role=textbox[name="Search"]').fill('query');
   await state.page.locator('role=link[name="Settings"]').click();
 
-If snapshot shows [ref=some-id] for an element with a data-testid or id:
-  await state.page.locator('[data-testid="some-id"]').click();
+If snapshot shows [ref=e3], resolve it with refToLocator({ ref }) before acting:
+  const locator = refToLocator({ ref: 'e3' });
+  if (locator) await state.page.locator(locator).click();
 
 For text content:
   const text = await state.page.locator('role=heading').textContent();
@@ -406,7 +408,7 @@ function registerExecuteTool(skillAppendix = '') {
     'execute',
     EXECUTE_PROMPT + skillAppendix,
     {
-      code: z.string().describe('JavaScript to run — page/context/state/snapshot/waitForPageLoad/getLogs/cleanHTML/pageMarkdown in scope'),
+      code: z.string().describe('JavaScript to run — page/context/state/snapshot/refToLocator/waitForPageLoad/getLogs/cleanHTML/pageMarkdown in scope'),
       timeout: z.number().optional().describe('Max execution time in ms (default: 30000)'),
     },
     async ({ code, timeout = 30000 }) => {
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index dcff140..261c87e 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -106,6 +106,7 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('snapshot'), 'should mention snapshot-first approach');
     assert.ok(promptBlock.includes('waitForPageLoad'), 'should mention waitForPageLoad');
     assert.ok(promptBlock.includes('screenshotWithAccessibilityLabels'), 'should mention screenshotWithAccessibilityLabels helper');
+    assert.ok(promptBlock.includes('refToLocator({ ref })'), 'should mention refToLocator helper usage');
     assert.ok(promptBlock.includes('cleanHTML'), 'should mention cleanHTML helper');
     assert.ok(promptBlock.includes('pageMarkdown'), 'should mention pageMarkdown helper');
     assert.ok(promptBlock.includes('newPage'), 'should mention creating new tabs');
@@ -182,6 +183,15 @@ describe('Tool Definitions', () => {
     assert.ok(!source.includes("'screenshot_with_labels'"), 'screenshot_with_labels tool should be removed');
     assert.ok(!source.includes('SCREENSHOT_LABELS_PROMPT'), 'dedicated screenshot prompt should be removed');
   });
+
+  it('exec context source exposes refToLocator helper', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/exec-engine.js'),
+      'utf8'
+    );
+
+    assert.ok(source.includes('refToLocator'), 'exec engine should expose refToLocator helper');
+  });
 });
 
 // ─── MCP Response Format ─────────────────────────────────────────────────────

From 8eb92a8ced69789fd731811b2bc2643899e8de60 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:50:37 +0530
Subject: [PATCH 040/192] feat(mcp): add getCDPSession helper for relay-safe
 raw CDP access

---
 mcp/src/exec-engine.js     | 10 +++++++++-
 mcp/src/index.js           |  6 ++++++
 mcp/test/mcp-tools.test.js |  2 ++
 3 files changed, 17 insertions(+), 1 deletion(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index 64fb6fd..de50de4 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -458,6 +458,14 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     if (consoleLogs) consoleLogs.set(activePage(), []);
   };
 
+  const getCDPSession = async ({ page: targetPage } = {}) => {
+    const p = targetPage || activePage();
+    if (!p || p.isClosed()) {
+      throw new Error('Cannot create CDP session for closed page');
+    }
+    return p.context().newCDPSession(p);
+  };
+
   const screenshotWithAccessibilityLabels = async ({ selector, interactiveOnly = true } = {}) => {
     const page = activePage();
     const { screenshot, snapshot: snapText, labelCount } = await screenshotWithLabels(page, {
@@ -484,7 +492,7 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
   return {
     ...wrappedPluginHelpers,           // plugin helpers spread first — built-ins always win
     page: defaultPage, context: ctx, state: userState,
-    snapshot, refToLocator, waitForPageLoad, getLogs, clearLogs,
+    snapshot, refToLocator, waitForPageLoad, getLogs, clearLogs, getCDPSession,
     screenshotWithAccessibilityLabels, cleanHTML, pageMarkdown,
     fetch, URL, URLSearchParams, Buffer, setTimeout, clearTimeout,
     TextEncoder, TextDecoder,
diff --git a/mcp/src/index.js b/mcp/src/index.js
index b0c6ca8..d966fc2 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -148,6 +148,8 @@ Helpers:
   pageMarkdown()                     Article content via Mozilla Readability (Firefox Reader View).
                                      Strips nav/ads/sidebars. Returns title + metadata + body text.
                                      Falls back to raw body text for non-article pages.
+  getCDPSession({ page })            Create a relay-safe raw CDP session for a page.
+                                     Use this instead of page.context().newCDPSession(page).
 
 Globals: fetch, URL, URLSearchParams, Buffer, setTimeout, clearTimeout, TextEncoder, TextDecoder
 
@@ -280,6 +282,10 @@ Newline typing:
   ✗ Don’t use keyboard Enter loops for multiline textareas unless explicitly needed
   ✓ Prefer locator.fill('line1\\nline2') for deterministic multiline input
 
+Raw CDP sessions:
+  ✗ Don’t call page.context().newCDPSession(page) directly
+  ✓ Use getCDPSession({ page }) for relay-safe CDP session creation
+
 ═══ EXTRACTION DECISION TREE ═══
 
 snapshot vs cleanHTML vs pageMarkdown:
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 261c87e..7008e4f 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -107,6 +107,7 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('waitForPageLoad'), 'should mention waitForPageLoad');
     assert.ok(promptBlock.includes('screenshotWithAccessibilityLabels'), 'should mention screenshotWithAccessibilityLabels helper');
     assert.ok(promptBlock.includes('refToLocator({ ref })'), 'should mention refToLocator helper usage');
+    assert.ok(promptBlock.includes('getCDPSession({ page })'), 'should mention relay-safe getCDPSession helper usage');
     assert.ok(promptBlock.includes('cleanHTML'), 'should mention cleanHTML helper');
     assert.ok(promptBlock.includes('pageMarkdown'), 'should mention pageMarkdown helper');
     assert.ok(promptBlock.includes('newPage'), 'should mention creating new tabs');
@@ -191,6 +192,7 @@ describe('Tool Definitions', () => {
     );
 
     assert.ok(source.includes('refToLocator'), 'exec engine should expose refToLocator helper');
+    assert.ok(source.includes('const getCDPSession = async'), 'exec engine should define getCDPSession helper');
   });
 });
 

From 16c7cd9ab41a743c4cd3fdd081940dc5f1d6e2f7 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 17:56:49 +0530
Subject: [PATCH 041/192] feat(mcp): add snapshot diff mode with
 showDiffSinceLastCall toggle

---
 mcp/src/exec-engine.js     | 45 ++++++++++++++++++++++++++++++++------
 mcp/src/index.js           |  3 ++-
 mcp/test/mcp-tools.test.js |  9 ++++++++
 3 files changed, 49 insertions(+), 8 deletions(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index de50de4..dd4962b 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -418,22 +418,53 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     throw new Error('No active page. Create one first: state.page = await context.newPage()');
   };
 
-  const snapshot = async ({ selector, search } = {}) => {
+  const snapshot = async ({ selector, search, showDiffSinceLastCall = true } = {}) => {
     const page = activePage();
     const axRoot = await getAccessibilityTree(page, selector);
     if (!axRoot) return 'No accessibility tree available for this page.';
     const stableIds = await getStableIds(page, selector);
     annotateStableAttrs(axRoot, stableIds);
     const searchPattern = parseSearchPattern(search);
-    const { text: snapshotText, refs } = buildSnapshotText(axRoot, null, searchPattern);
-    const refMap = new Map(refs.map(({ ref, locator }) => [ref, locator]));
+    const { text: fullSnapshotText, refs: fullRefs } = buildSnapshotText(axRoot, null, null);
+    const refMap = new Map(fullRefs.map(({ ref, locator }) => [ref, locator]));
     lastRefToLocator.set(page, refMap);
-    const refTable = refs.length > 0
-      ? '\n\n--- Ref → Locator ---\n' + refs.map(r => `${r.ref}: ${r.locator}`).join('\n')
-      : '';
     const title = await page.title().catch(() => '');
     const pageUrl = page.url();
-    return `Page: ${title} (${pageUrl})\nRefs: ${refs.length} interactive elements\n\n${snapshotText}${refTable}`;
+    const formatSnapshot = (snapshotText, refs) => {
+      const refTable = refs.length > 0
+        ? '\n\n--- Ref → Locator ---\n' + refs.map(r => `${r.ref}: ${r.locator}`).join('\n')
+        : '';
+      return `Page: ${title} (${pageUrl})\nRefs: ${refs.length} interactive elements\n\n${snapshotText}${refTable}`;
+    };
+    const fullSnapshot = formatSnapshot(fullSnapshotText, fullRefs);
+
+    let pageSnapshots = lastSnapshots.get(page);
+    if (!(pageSnapshots instanceof Map)) {
+      const migratedSnapshots = new Map();
+      if (typeof pageSnapshots === 'string') {
+        migratedSnapshots.set('__full_page__', pageSnapshots);
+      }
+      pageSnapshots = migratedSnapshots;
+      lastSnapshots.set(page, pageSnapshots);
+    }
+    const snapshotKey = selector || '__full_page__';
+    const previousSnapshot = pageSnapshots.get(snapshotKey);
+    pageSnapshots.set(snapshotKey, fullSnapshot);
+
+    if (!search && showDiffSinceLastCall && previousSnapshot) {
+      const diffResult = createSmartDiff(previousSnapshot, fullSnapshot);
+      if (diffResult.type === 'no-change') {
+        return 'No changes since last snapshot. Use showDiffSinceLastCall: false to see full content.';
+      }
+      return diffResult.content;
+    }
+
+    if (searchPattern) {
+      const { text: filteredSnapshotText, refs: filteredRefs } = buildSnapshotText(axRoot, null, searchPattern);
+      return formatSnapshot(filteredSnapshotText, filteredRefs);
+    }
+
+    return fullSnapshot;
   };
 
   const refToLocator = ({ ref, page: targetPage } = {}) => {
diff --git a/mcp/src/index.js b/mcp/src/index.js
index d966fc2..3f35e01 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -133,7 +133,7 @@ Variables:
   state       Persistent object across calls (cleared on reset). Store your working page here.
 
 Helpers:
-  snapshot({ selector?, search? })   Accessibility tree as text. 10-100x cheaper than screenshots.
+  snapshot({ selector?, search?, showDiffSinceLastCall? })   Accessibility tree as text. 10-100x cheaper than screenshots.
   refToLocator({ ref })              Resolve a snapshot ref (e.g., e3) to a Playwright locator string.
   waitForPageLoad({ timeout? })      Smart load detection (filters analytics/ads, polls readyState).
   getLogs({ count? })                Browser console logs captured for current page.
@@ -391,6 +391,7 @@ If timeout:          Increase timeout param, or break into smaller steps
 snapshot(options?)
   options.selector  CSS selector to scope the snapshot (e.g., '#main', '.sidebar')
   options.search    Regex string to filter tree nodes (e.g., 'button|link')
+  options.showDiffSinceLastCall  When true (default), returns a smart diff from previous snapshot when unchanged scope+search is not used
   Returns: Text accessibility tree with interactive element refs
 
 waitForPageLoad(options?)
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 7008e4f..08377df 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -130,6 +130,7 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('cookie') || promptBlock.includes('consent'), 'should include consent modal handling');
     assert.ok(promptBlock.includes('stale locator'), 'should include stale locator warning');
     assert.ok(promptBlock.includes('snapshot({ showDiffSinceLastCall'), 'should include diff usage guidance');
+    assert.ok(promptBlock.includes('options.showDiffSinceLastCall'), 'should document snapshot diff toggle in API reference');
   });
 
   it('execute prompt includes tool-selection and debugging decision trees', () => {
@@ -193,6 +194,14 @@ describe('Tool Definitions', () => {
 
     assert.ok(source.includes('refToLocator'), 'exec engine should expose refToLocator helper');
     assert.ok(source.includes('const getCDPSession = async'), 'exec engine should define getCDPSession helper');
+    assert.ok(
+      source.includes('No changes since last snapshot. Use showDiffSinceLastCall: false to see full content.'),
+      'exec engine should return snapshot no-change guidance'
+    );
+    assert.ok(
+      source.includes('!search && showDiffSinceLastCall') || source.includes('showDiffSinceLastCall && !search'),
+      'snapshot diff mode should only run when search is not provided'
+    );
   });
 });
 

From 032a2da21b7c39747e94781f6affd4e27b458719 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 18:02:43 +0530
Subject: [PATCH 042/192] fix(mcp): diff snapshot only for full-page views

---
 mcp/src/exec-engine.js     | 2 +-
 mcp/test/mcp-tools.test.js | 5 +++--
 2 files changed, 4 insertions(+), 3 deletions(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index dd4962b..b0d59e1 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -451,7 +451,7 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
     const previousSnapshot = pageSnapshots.get(snapshotKey);
     pageSnapshots.set(snapshotKey, fullSnapshot);
 
-    if (!search && showDiffSinceLastCall && previousSnapshot) {
+    if (!selector && !search && showDiffSinceLastCall && previousSnapshot) {
       const diffResult = createSmartDiff(previousSnapshot, fullSnapshot);
       if (diffResult.type === 'no-change') {
         return 'No changes since last snapshot. Use showDiffSinceLastCall: false to see full content.';
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 08377df..fcefc4b 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -199,8 +199,9 @@ describe('Tool Definitions', () => {
       'exec engine should return snapshot no-change guidance'
     );
     assert.ok(
-      source.includes('!search && showDiffSinceLastCall') || source.includes('showDiffSinceLastCall && !search'),
-      'snapshot diff mode should only run when search is not provided'
+      source.includes('!selector && !search && showDiffSinceLastCall') ||
+      source.includes('showDiffSinceLastCall && !selector && !search'),
+      'snapshot diff mode should only run for full-page snapshots with no search'
     );
   });
 });

From cd12e5798bd2b506d8cd64ab7c82e3573dea7007 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 18:30:44 +0530
Subject: [PATCH 043/192] fix(mcp): align execute schema helpers and add helper
 exposure test

---
 mcp/src/index.js                     |  2 +-
 mcp/test/exec-engine-plugins.test.js | 25 +++++++++++++++++++++++++
 2 files changed, 26 insertions(+), 1 deletion(-)

diff --git a/mcp/src/index.js b/mcp/src/index.js
index 3f35e01..b3bc12d 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -415,7 +415,7 @@ function registerExecuteTool(skillAppendix = '') {
     'execute',
     EXECUTE_PROMPT + skillAppendix,
     {
-      code: z.string().describe('JavaScript to run — page/context/state/snapshot/refToLocator/waitForPageLoad/getLogs/cleanHTML/pageMarkdown in scope'),
+      code: z.string().describe('JavaScript to run — page/context/state/snapshot/refToLocator/getCDPSession/waitForPageLoad/getLogs/cleanHTML/pageMarkdown in scope'),
       timeout: z.number().optional().describe('Max execution time in ms (default: 30000)'),
     },
     async ({ code, timeout = 30000 }) => {
diff --git a/mcp/test/exec-engine-plugins.test.js b/mcp/test/exec-engine-plugins.test.js
index 05e3907..150a204 100644
--- a/mcp/test/exec-engine-plugins.test.js
+++ b/mcp/test/exec-engine-plugins.test.js
@@ -45,6 +45,31 @@ test('buildExecContext exposes screenshot and content helpers in execute scope',
   assert.equal(typeof ctx.pageMarkdown, 'function');
 });
 
+test('buildExecContext exposes callable ref and CDP helpers', async () => {
+  const fakeSession = { send: async () => ({}) };
+  const page = {
+    isClosed: () => false,
+    context: () => ({
+      newCDPSession: async (targetPage) => {
+        assert.equal(targetPage, page);
+        return fakeSession;
+      },
+    }),
+  };
+
+  const ctx = buildExecContext(page, { pages: () => [page] }, {}, {}, {});
+  assert.equal(typeof ctx.refToLocator, 'function');
+  assert.equal(typeof ctx.getCDPSession, 'function');
+
+  const session = await ctx.getCDPSession({ page });
+  assert.equal(session, fakeSession);
+
+  await assert.rejects(
+    () => ctx.getCDPSession({ page: { isClosed: () => true } }),
+    /Cannot create CDP session for closed page/
+  );
+});
+
 test('formatResult returns multi-content for labeled screenshot sentinel', () => {
   const fakeBuffer = Buffer.from('fake-jpeg-data');
   const formatted = formatResult({

From 7a76e8694c31191b93162b1a8479e0a5d14a2804 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 18:39:40 +0530
Subject: [PATCH 044/192] docs: document diff-aware helpers and cdp jsonl
 logging

---
 GUIDE.md  | 17 +++++++++++++++++
 README.md | 17 +++++++++++++++++
 2 files changed, 34 insertions(+)

diff --git a/GUIDE.md b/GUIDE.md
index f798650..d831550 100644
--- a/GUIDE.md
+++ b/GUIDE.md
@@ -250,6 +250,17 @@ When connected via MCP (OpenClaw, Claude Desktop, Claude Code), the AI has two t
 | `execute` | Run Playwright JavaScript in your real Chrome. Access `page`, `context`, `state`, `snapshot()`, `waitForPageLoad()`, `getLogs()`, `screenshotWithAccessibilityLabels()`, `cleanHTML()`, `pageMarkdown()`, and Node.js globals. |
 | `reset` | Reconnect to the relay and clear state. Use when the connection drops. |
 
+### Diff-Aware Helpers
+
+Use `showDiffSinceLastCall` to control diff output vs full output in execute helper calls:
+
+```javascript
+await snapshot({ showDiffSinceLastCall: true });
+await snapshot({ showDiffSinceLastCall: false });
+await cleanHTML('body', { showDiffSinceLastCall: false });
+await pageMarkdown({ showDiffSinceLastCall: true });
+```
+
 The `execute` tool gives the agent full Playwright access — it can navigate, click, type, screenshot, read accessibility trees, and run JavaScript in the page context. All within your real browser session.
 
 ## Examples
@@ -416,3 +427,9 @@ A: Yes. All tabs across all Chrome windows are visible.
 | AI sees 0 pages | Open at least one regular webpage (not `chrome://`) |
 | Extension keeps disconnecting | Normal MV3 behavior — it auto-reconnects |
 | Port already in use | Run `lsof -ti:19222 \| xargs kill -9` to kill stale process |
+
+CDP traffic is logged to `~/.browserforce/cdp.jsonl` (recreated on each relay start). Summarize traffic by direction + method:
+
+```bash
+jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.jsonl | uniq -c
+```
diff --git a/README.md b/README.md
index 7750656..294fdb0 100644
--- a/README.md
+++ b/README.md
@@ -357,6 +357,17 @@ state.results = await page.evaluate(() => document.title);
 | `execute` | Run Playwright JavaScript in your real Chrome. Access `page`, `context`, `state`, `snapshot()`, `waitForPageLoad()`, `getLogs()`, `screenshotWithAccessibilityLabels()`, `cleanHTML()`, `pageMarkdown()`, and Node.js globals. |
 | `reset` | Reconnect to the relay and clear state. Use when the connection drops. |
 
+### Diff-Aware Helpers
+
+Use `showDiffSinceLastCall` to control diff output vs full output in execute helper calls:
+
+```javascript
+await snapshot({ showDiffSinceLastCall: true });
+await snapshot({ showDiffSinceLastCall: false });
+await cleanHTML('body', { showDiffSinceLastCall: false });
+await pageMarkdown({ showDiffSinceLastCall: true });
+```
+
 ## Examples
 
 Get started with simple prompts. The AI generates code and does the work.
@@ -576,4 +587,10 @@ RELAY_PORT=19333 browserforce serve
 | Extension keeps reconnecting | Normal — MV3 kills idle workers; it auto-recovers |
 | Port in use | `lsof -ti:19222 \| xargs kill -9` |
 
+CDP traffic is logged to `~/.browserforce/cdp.jsonl` (recreated on each relay start). Summarize traffic by direction + method:
+
+```bash
+jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.jsonl | uniq -c
+```
+
 > **Want the full walkthrough?** Read the [User Guide](https://github.com/ivalsaraj/browserforce/blob/main/GUIDE.md) for a plain-English explanation of what this does and how to get started.

From 74fbcaa06348514556982a1fcb50b53b64ab4cb3 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 19:09:24 +0530
Subject: [PATCH 045/192] fix(extension): resync attached tab group naming on
 group changes

---
 extension/background.js | 27 ++++++++++++++++++---------
 1 file changed, 18 insertions(+), 9 deletions(-)

diff --git a/extension/background.js b/extension/background.js
index efc8447..a3f0cbc 100644
--- a/extension/background.js
+++ b/extension/background.js
@@ -224,6 +224,8 @@ async function attachTab(tabId, sessionId) {
   if (attachedTabs.has(tabId)) {
     const existing = attachedTabs.get(tabId);
     existing.sessionId = sessionId;
+    // Ensure attached tabs are always reconciled into the browserforce group.
+    queueSyncTabGroup();
     return existing;
   }
 
@@ -491,20 +493,27 @@ function onTabRemoved(tabId) {
 
 function onTabUpdated(tabId, changeInfo) {
   if (!attachedTabs.has(tabId)) return;
-  if (!changeInfo.url && !changeInfo.title) return;
+  if (!changeInfo.url && !changeInfo.title && changeInfo.groupId === undefined) return;
+
+  // Reconcile group membership/title if user or Chrome moved this attached tab.
+  if (changeInfo.groupId !== undefined) {
+    queueSyncTabGroup();
+  }
 
   const entry = attachedTabs.get(tabId);
   if (changeInfo.url) entry.targetInfo.url = changeInfo.url;
   if (changeInfo.title) entry.targetInfo.title = changeInfo.title;
 
-  send({
-    method: 'tabUpdated',
-    params: {
-      tabId,
-      url: changeInfo.url,
-      title: changeInfo.title,
-    },
-  });
+  if (changeInfo.url || changeInfo.title) {
+    send({
+      method: 'tabUpdated',
+      params: {
+        tabId,
+        url: changeInfo.url,
+        title: changeInfo.title,
+      },
+    });
+  }
 }
 
 // ─── Helpers ─────────────────────────────────────────────────────────────────

From 29fa1578a262894d585ec6b53d58057d02f524f4 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 19:09:24 +0530
Subject: [PATCH 046/192] fix(extension): resync attached tab group naming on
 group changes

---
 extension/background.js | 27 ++++++++++++++++++---------
 1 file changed, 18 insertions(+), 9 deletions(-)

diff --git a/extension/background.js b/extension/background.js
index efc8447..a3f0cbc 100644
--- a/extension/background.js
+++ b/extension/background.js
@@ -224,6 +224,8 @@ async function attachTab(tabId, sessionId) {
   if (attachedTabs.has(tabId)) {
     const existing = attachedTabs.get(tabId);
     existing.sessionId = sessionId;
+    // Ensure attached tabs are always reconciled into the browserforce group.
+    queueSyncTabGroup();
     return existing;
   }
 
@@ -491,20 +493,27 @@ function onTabRemoved(tabId) {
 
 function onTabUpdated(tabId, changeInfo) {
   if (!attachedTabs.has(tabId)) return;
-  if (!changeInfo.url && !changeInfo.title) return;
+  if (!changeInfo.url && !changeInfo.title && changeInfo.groupId === undefined) return;
+
+  // Reconcile group membership/title if user or Chrome moved this attached tab.
+  if (changeInfo.groupId !== undefined) {
+    queueSyncTabGroup();
+  }
 
   const entry = attachedTabs.get(tabId);
   if (changeInfo.url) entry.targetInfo.url = changeInfo.url;
   if (changeInfo.title) entry.targetInfo.title = changeInfo.title;
 
-  send({
-    method: 'tabUpdated',
-    params: {
-      tabId,
-      url: changeInfo.url,
-      title: changeInfo.title,
-    },
-  });
+  if (changeInfo.url || changeInfo.title) {
+    send({
+      method: 'tabUpdated',
+      params: {
+        tabId,
+        url: changeInfo.url,
+        title: changeInfo.title,
+      },
+    });
+  }
 }
 
 // ─── Helpers ─────────────────────────────────────────────────────────────────

From 91d901d8e1492ec1963fb63d6b5b763599e56084 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 20:29:18 +0530
Subject: [PATCH 047/192] docs: expand controlled-tab guidance and persona use
 cases

---
 GUIDE.md          |  62 ++++++++-
 README.md         |  59 ++++++---
 docs/USE_CASES.md | 326 ++++++++++++++++++++++++++++++++++++++++++++++
 3 files changed, 431 insertions(+), 16 deletions(-)
 create mode 100644 docs/USE_CASES.md

diff --git a/GUIDE.md b/GUIDE.md
index d831550..bcae7bc 100644
--- a/GUIDE.md
+++ b/GUIDE.md
@@ -215,6 +215,62 @@ for (const page of pages) {
 }
 ```
 
+## Controlled Tabs Playbook
+
+Use this section when you want strict control over what the agent can touch.
+
+### 1) Manually Attach A Tab
+
+1. Open the exact tab you want the agent to use.
+2. Click the BrowserForce extension icon.
+3. In the popup, click **+ Attach Current Tab**.
+4. Confirm it appears under **Controlled Tabs**.
+
+This is the fastest way to grant access to an already logged-in page without exposing other tabs.
+
+### 2) Single-Tab Locked Workflow
+
+For high-safety tasks (admin pages, billing pages, production dashboards):
+
+1. Set **Mode** to `Manual`.
+2. Attach only one tab using **+ Attach Current Tab**.
+3. Enable **No new tabs**.
+4. Optionally enable **Lock URL** and **Read-only** depending on the task.
+
+Result: the agent is constrained to one attached tab and cannot open additional tabs.
+
+### 3) Multi-Tab Controlled Workflow
+
+If the task needs a few trusted tabs:
+
+1. Keep **Mode** on `Manual`.
+2. Switch to each required tab and click **+ Attach Current Tab**.
+3. Keep **No new tabs** on if you want to block any extra tab creation.
+
+Result: the agent can work only across the tabs you explicitly attached.
+
+### 4) Restriction Modes (How To Combine Them)
+
+- **Lock URL**: blocks navigation away from the current page (reload is still possible).
+- **No new tabs**: blocks agent-driven tab creation.
+- **Read-only**: blocks interaction methods (click/type/edit); useful for inspection-only runs.
+
+Common presets:
+
+- **Audit preset**: `Manual + No new tabs + Read-only`
+- **Form testing preset**: `Manual + No new tabs` (leave Read-only off)
+- **Pinned page preset**: `Manual + Lock URL + No new tabs`
+
+### 5) Auto-Cleanup After Use
+
+- **Auto-detach inactive tabs**: detaches tabs after 5-60 minutes of inactivity.
+- **Auto-close agent tabs**: closes tabs created by the agent after 5-60 minutes.
+
+Recommended:
+
+- Use `10-15 min` auto-detach for normal sessions.
+- Use auto-close when running broad exploration tasks that open many tabs.
+
 ## CLI
 
 Once installed globally (`npm install -g browserforce`), the CLI is available:
@@ -261,6 +317,8 @@ await cleanHTML('body', { showDiffSinceLastCall: false });
 await pageMarkdown({ showDiffSinceLastCall: true });
 ```
 
+Need concrete persona-based workflows? See [Actionable Use Cases](docs/USE_CASES.md).
+
 The `execute` tool gives the agent full Playwright access — it can navigate, click, type, screenshot, read accessibility trees, and run JavaScript in the page context. All within your real browser session.
 
 ## Examples
@@ -413,7 +471,7 @@ A: Any AI that supports MCP (OpenClaw, Claude Desktop, Claude Code) or any tool
 A: Chrome aggressively kills MV3 extensions after 30 seconds of inactivity. The relay sends keepalive pings every 5 seconds to prevent this. If the extension does restart, it auto-reconnects.
 
 **Q: Can I control which tabs the AI accesses?**
-A: Yes. Click the extension icon to switch between Auto mode (agent sees all tabs) and Manual mode (you select which tabs). You can also lock URLs, block new tabs, or enable read-only mode.
+A: Yes. In Auto mode the agent can create and control its own tabs. In Manual mode, you explicitly attach tabs with **+ Attach Current Tab**. You can also lock URLs, block new tabs, or enable read-only mode.
 
 **Q: Does it work with multiple windows?**
 A: Yes. All tabs across all Chrome windows are visible.
@@ -433,3 +491,5 @@ CDP traffic is logged to `~/.browserforce/cdp.jsonl` (recreated on each relay st
 ```bash
 jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.jsonl | uniq -c
 ```
+
+For incident/debug playbooks, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
diff --git a/README.md b/README.md
index 294fdb0..488e651 100644
--- a/README.md
+++ b/README.md
@@ -6,7 +6,7 @@
 
 **You're giving an AI your real Chrome — your logins, cookies, and sessions. That takes conviction.** BrowserForce is built for people who use the best models and don't look back. Security is built in: lock URLs, block navigation, read-only mode, auto-cleanup — you stay in control.
 
-**Fully autonomous browser control.** No manual tab clicking. Your agent browses as you, even from WhatsApp. Other tools make you click each tab, spawn a fresh Chrome, or only work with one AI client. BrowserForce connects to **your running browser** and auto-attaches to all tabs. One Chrome extension, full Playwright API, completely hands-off.
+**Autonomous when you want it, controlled when you need it.** Your agent can run hands-off in Auto mode, or you can switch to Manual mode and explicitly attach only the tabs you trust. BrowserForce connects to **your running browser** with one Chrome extension and full Playwright API support.
 
 Works with [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-compatible agent.
 
@@ -16,10 +16,10 @@ Works with [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-
 |---|---|---|---|---|---|
 | Browser | Spawns new Chrome | Separate profile | Your Chrome | Your Chrome | **Your Chrome** |
 | Login state | Fresh | Fresh (isolated) | Yours | Yours | **Yours** |
-| Tab access | N/A (new browser) | Managed by agent | Click each tab | Click each tab | **All tabs, automatic** |
+| Tab access | N/A (new browser) | Managed by agent | Click each tab | Click each tab | **Auto mode + manual attached tabs** |
 | Autonomous | Yes | Yes | No (manual click) | No (manual click) | **Yes (fully autonomous)** |
 | Context method | Screenshots (100KB+) | Screenshots + snapshots | A11y snapshots (5-20KB) | Screenshots (100KB+) | **A11y snapshots (5-20KB)** |
-| Tools | Many dedicated | 1 `browser` tool | 1 `execute` tool | Built-in | **3 tools: `execute`, `screenshot_with_labels`, `reset`** |
+| Tools | Many dedicated | 1 `browser` tool | 1 `execute` tool | Built-in | **2 tools: `execute`, `reset`** |
 | Agent support | Any MCP client | OpenClaw only | Any MCP client | Claude only | **Any MCP client** |
 | Playwright API | Partial | No | Full | No | **Full** |
 
@@ -108,10 +108,10 @@ browserforce serve
 
 If your agent browses to the page and responds with the title, you're all set.
 
-<details>
-<summary><b>MCP setup for OpenClaw, Claude, Codex, Cursor, and Antigravity</b></summary>
+**MCP setup (advanced):**
 
-#### OpenClaw (MCP adapter)
+<details>
+<summary><b>OpenClaw (MCP adapter)</b></summary>
 
 Add to `~/.openclaw/openclaw.json`:
 
@@ -137,7 +137,10 @@ Add to `~/.openclaw/openclaw.json`:
 }
 ```
 
-#### Claude Desktop
+</details>
+
+<details>
+<summary><b>Claude Desktop</b></summary>
 
 Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
 
@@ -152,7 +155,10 @@ Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
 }
 ```
 
-#### Claude Code
+</details>
+
+<details>
+<summary><b>Claude Code</b></summary>
 
 Add to `~/.claude/mcp.json`:
 
@@ -167,7 +173,10 @@ Add to `~/.claude/mcp.json`:
 }
 ```
 
-#### Codex
+</details>
+
+<details>
+<summary><b>Codex</b></summary>
 
 Add to `~/.codex/config.toml`:
 
@@ -177,7 +186,10 @@ command = "npx"
 args = ["-y", "browserforce@latest", "mcp"]
 ```
 
-#### Cursor
+</details>
+
+<details>
+<summary><b>Cursor</b></summary>
 
 Add to `~/.cursor/mcp.json`:
 
@@ -192,7 +204,10 @@ Add to `~/.cursor/mcp.json`:
 }
 ```
 
-#### Antigravity
+</details>
+
+<details>
+<summary><b>Antigravity</b></summary>
 
 In Antigravity: Agent panel -> `...` -> `Manage MCP Servers` -> `View raw config`.
 Add the same `mcpServers` entry:
@@ -208,14 +223,14 @@ Add the same `mcpServers` entry:
 }
 ```
 
+</details>
+
 If MCP startup fails with `connection closed: initialize response`:
 
 1. Ensure args include `"mcp"` (without it, BrowserForce prints help and exits).
 2. If running from a local clone, install deps first: `pnpm install`.
 3. Validate the launch command manually: `npx -y browserforce@latest mcp`
 
-</details>
-
 ### CLI
 
 ```bash
@@ -368,6 +383,8 @@ await cleanHTML('body', { showDiffSinceLastCall: false });
 await pageMarkdown({ showDiffSinceLastCall: true });
 ```
 
+Need role-based, real workflows? See [Actionable Use Cases](docs/USE_CASES.md).
+
 ## Examples
 
 Get started with simple prompts. The AI generates code and does the work.
@@ -519,9 +536,9 @@ Get started with simple prompts. The AI generates code and does the work.
 
 The **relay server** runs on your machine (localhost only). It translates between the agent's CDP commands and the extension's debugger bridge.
 
-The **Chrome extension** lives in your browser. It attaches Chrome's built-in debugger to your tabs and forwards commands — exactly like DevTools does.
+The **Chrome extension** lives in your browser. It attaches Chrome's built-in debugger to permitted tabs and forwards commands — exactly like DevTools does.
 
-When the agent connects, it immediately sees all your open tabs as controllable Playwright pages. No clicking, no manual attachment.
+In **Auto mode**, the agent can create and control tabs it opens. In **Manual mode**, you decide access by clicking **+ Attach Current Tab**.
 
 ## You Stay in Control
 
@@ -537,6 +554,16 @@ Click the extension icon to configure restrictions. Your browser, your rules:
 | **Auto-close** | Automatically close agent-created tabs after 5-60 minutes |
 | **Custom instructions** | Pass text instructions to the agent (e.g. "don't click any buy buttons") |
 
+### Controlled Tab Workflows
+
+- **Manually attach a tab:** Open the tab you want, click the extension popup, then click **+ Attach Current Tab**.
+- **Restrict to one controlled tab:** Use **Manual mode**, attach one tab, and enable **No new tabs**.
+- **Allow multiple controlled tabs:** Stay in **Manual mode** and attach each tab you want the agent to access.
+- **Restriction modes:** Use **Lock URL** (no navigation), **No new tabs**, and **Read-only** (observe only) together or separately.
+- **Auto-cleanup:** Use **Auto-detach** for inactive attached tabs and **Auto-close** for agent-created tabs.
+
+For step-by-step setups, see the [Controlled Tabs Playbook](GUIDE.md#controlled-tabs-playbook).
+
 ## Security
 
 | Layer | Control |
@@ -593,4 +620,6 @@ CDP traffic is logged to `~/.browserforce/cdp.jsonl` (recreated on each relay st
 jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.jsonl | uniq -c
 ```
 
+For practical debugging and operations flows, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
+
 > **Want the full walkthrough?** Read the [User Guide](https://github.com/ivalsaraj/browserforce/blob/main/GUIDE.md) for a plain-English explanation of what this does and how to get started.
diff --git a/docs/USE_CASES.md b/docs/USE_CASES.md
new file mode 100644
index 0000000..8b8bbf7
--- /dev/null
+++ b/docs/USE_CASES.md
@@ -0,0 +1,326 @@
+# BrowserForce Use Cases (Actionable)
+
+This page is for real-world execution, not theory. Each section includes:
+
+- What role this is for
+- What you are trying to achieve
+- Which BrowserForce switches/helpers to use
+- A copy-paste example
+- What success looks like
+
+## Quick Switch Guide
+
+| Switch / Helper | Use it when | Typical outcome |
+|---|---|---|
+| `snapshot({ showDiffSinceLastCall: true })` | You are in a multi-step flow and want only changes | Faster loops, lower token usage, less noise |
+| `snapshot({ showDiffSinceLastCall: false })` | You need full context right now | Full tree and refs for reliable decisions |
+| `cleanHTML(selector, { showDiffSinceLastCall: true })` | You monitor DOM changes over time | Detect only meaningful structural changes |
+| `cleanHTML(selector, { showDiffSinceLastCall: false })` | You need full HTML snapshot for parsing | Complete cleaned HTML for extraction |
+| `pageMarkdown({ showDiffSinceLastCall: true })` | You monitor long-form content/pages | Alert only on content changes |
+| `pageMarkdown({ search: /.../ })` | You need targeted text checks | Focused findings with context lines |
+| `refToLocator({ ref: 'eN' })` | You got a ref from `snapshot()` and need a stable locator | Reliable interaction without brittle selectors |
+| `getCDPSession({ page })` | You need low-level CDP commands in relay environment | Raw CDP access with relay-safe session creation |
+
+## Feature-by-Feature Use Cases (High Impact First)
+
+This section maps each newly added capability to practical scenarios by user type.
+
+### 1) `snapshot({ showDiffSinceLastCall })` (Most Impactful)
+
+**Why this is high impact:** It cuts repeated context noise in long flows and makes automation loops faster.
+
+- **OpenClaw user scenario:** Checkout flow monitoring from chat
+  - Run a full baseline once, then diff mode on each step.
+  - You see only changed controls/messages after each action.
+- **Developer scenario:** Flaky UI reproduction loop
+  - Keep one stable script: `observe -> act -> observe diff`.
+  - Faster diagnosis when UI mutates between attempts.
+- **Other scenario (Ops / Monitoring):** Status page drift detection
+  - Poll snapshot diff on dashboards.
+  - Alert only when visible state changes, not every poll.
+
+**Example execute pattern:**
+
+```javascript
+await snapshot({ showDiffSinceLastCall: false }); // baseline once
+// ... perform one action
+return await snapshot({ showDiffSinceLastCall: true }); // concise change output
+```
+
+### 2) `refToLocator({ ref })`
+
+**Why this is high impact:** It converts snapshot refs into actionable selectors without brittle locator guessing.
+
+- **OpenClaw user scenario:** “Click the third approve button” from messaging app
+  - Agent inspects snapshot refs and resolves exact target with `refToLocator`.
+- **Developer scenario:** Remove flaky `nth()` selectors in tests
+  - Replace deep CSS chains with snapshot-ref resolution per step.
+- **Other scenario (Support):** Guided incident triage
+  - Agent can target the exact control visible in the current UI state.
+
+**Example execute pattern:**
+
+```javascript
+await snapshot({ showDiffSinceLastCall: false });
+const locator = refToLocator({ ref: 'e3' });
+if (!locator) throw new Error('ref e3 not available');
+await state.page.locator(locator).click();
+```
+
+### 3) `getCDPSession({ page })`
+
+**Why this is high impact:** It gives relay-safe low-level browser access for cases Playwright APIs do not cover cleanly.
+
+- **OpenClaw user scenario:** Advanced site diagnostics on authenticated pages
+  - Run protocol-level checks while still using real logged-in Chrome sessions.
+- **Developer scenario:** Deep debugging in relay environments
+  - Enable CDP domains (`Network`, `Runtime`, `Performance`) safely.
+- **Other scenario (QA):** Protocol verification in test workflows
+  - Validate low-level page/runtime conditions before/after critical actions.
+
+**Example execute pattern:**
+
+```javascript
+const cdp = await getCDPSession({ page: state.page });
+await cdp.send('Network.enable');
+return await cdp.send('Runtime.evaluate', { expression: 'document.readyState' });
+```
+
+### 4) Tactical Execute Playbook (Prompt Guidance)
+
+**Why this is high impact:** Better default agent behavior reduces dead-end runs on real websites.
+
+- **OpenClaw user scenario:** Cookie/consent/login blockers handled automatically
+  - Agent is guided to clear blockers before continuing.
+- **Developer scenario:** Correct extraction tool choice per task
+  - Guidance for `snapshot vs cleanHTML vs pageMarkdown` reduces wrong-tool usage.
+- **Other scenario (QA / Incident):** Faster root-cause loops
+  - “Combine snapshot + logs” guidance standardizes debugging flow.
+
+**Example prompt-to-agent outcomes:**
+
+- More reliable form/task completion on consent-heavy sites.
+- Fewer retries caused by stale locators after page updates.
+- Better extraction quality on article/news pages using `pageMarkdown`.
+
+### 5) Prompt/Test Regression Guards (Team Safety)
+
+**Why this is high impact:** Prevents silent drift between documented helper surface and runtime behavior.
+
+- **OpenClaw user scenario:** Stable agent behavior across updates
+  - Key guidance phrases remain enforced by tests.
+- **Developer scenario:** Safer refactors of MCP prompt/runtime
+  - Failing tests catch missing helper mentions or diff contract changes.
+- **Other scenario (Maintainers):** Predictable release quality
+  - Prompt contracts and helper exposure stay synchronized.
+
+**Operational check:**
+
+```bash
+node --test mcp/test/mcp-tools.test.js
+node --test mcp/test/exec-engine-plugins.test.js
+```
+
+## OpenClaw User (High Impact)
+
+### 1) Fast Checkout / Form Completion With Less Noise
+
+**Goal:** Complete long forms without re-reading the whole page every step.
+
+**Use:**
+- `snapshot({ showDiffSinceLastCall: true })`
+- `refToLocator({ ref })`
+
+**Example execute flow:**
+
+```javascript
+await snapshot({ showDiffSinceLastCall: false }); // baseline full view
+// ... fill step 1
+const delta = await snapshot({ showDiffSinceLastCall: true });
+return delta;
+```
+
+**Success looks like:** You only see what changed after each action, and fewer wrong clicks happen.
+
+### 2) Watch Your Competitor Pricing Page
+
+**Goal:** Detect only meaningful pricing-card changes.
+
+**Use:**
+- `cleanHTML('.pricing', { showDiffSinceLastCall: true })`
+
+**Example execute flow:**
+
+```javascript
+const first = await cleanHTML('.pricing', { showDiffSinceLastCall: true });
+const second = await cleanHTML('.pricing', { showDiffSinceLastCall: true });
+return { firstPreview: first.slice(0, 300), secondPreview: second.slice(0, 300) };
+```
+
+**Success looks like:** Second run returns either a compact diff or no-change guidance instead of full repeated markup.
+
+### 3) Track Policy/Terms Changes On Services You Use
+
+**Goal:** Be notified when legal/terms wording changes.
+
+**Use:**
+- `pageMarkdown({ showDiffSinceLastCall: true })`
+
+**Example execute flow:**
+
+```javascript
+await state.page.goto('https://example.com/terms');
+await waitForPageLoad();
+const baseline = await pageMarkdown({ showDiffSinceLastCall: true });
+const next = await pageMarkdown({ showDiffSinceLastCall: true });
+return { baselineLen: baseline.length, next };
+```
+
+**Success looks like:** You get concise change output only when terms changed.
+
+## Developer (High Impact)
+
+### 1) Debug “Action Sent But Nothing Happened”
+
+**Goal:** Find where command flow failed.
+
+**Use:**
+- CDP JSONL log (`~/.browserforce/cdp.jsonl`)
+
+**Run:**
+
+```bash
+jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.jsonl | uniq -c
+```
+
+**Success looks like:** You can confirm whether the command reached extension and whether response/event returned to Playwright.
+
+### 2) Reproduce Flaky Interaction Deterministically
+
+**Goal:** Replace brittle selectors and stale refs.
+
+**Use:**
+- `snapshot({ showDiffSinceLastCall: false })`
+- `refToLocator({ ref })`
+
+**Example execute flow:**
+
+```javascript
+const snap = await snapshot({ showDiffSinceLastCall: false });
+const locator = refToLocator({ ref: 'e3' });
+if (!locator) throw new Error('ref e3 not available');
+await state.page.locator(locator).click();
+return await snapshot({ showDiffSinceLastCall: true });
+```
+
+**Success looks like:** Fewer flaky failures from stale `nth()`/deep CSS paths.
+
+### 3) Raw CDP Verification In Relay Context
+
+**Goal:** Inspect browser/network behavior beyond normal locator APIs.
+
+**Use:**
+- `getCDPSession({ page })`
+
+**Example execute flow:**
+
+```javascript
+const cdp = await getCDPSession({ page: state.page });
+await cdp.send('Network.enable');
+const result = await cdp.send('Runtime.evaluate', { expression: 'document.readyState' });
+return result;
+```
+
+**Success looks like:** You can run low-level checks without breaking relay compatibility.
+
+## QA / Automation Engineer
+
+### 1) Regression Diff Between Test Steps
+
+**Goal:** Catch unexpected UI changes early.
+
+**Use:**
+- `snapshot({ showDiffSinceLastCall: true })`
+
+**Example:** Run snapshot diff after each core step (`login -> cart -> checkout -> confirmation`) and fail test if unexpected controls appear/disappear.
+
+**Success looks like:** Smaller, reviewable diffs in CI logs.
+
+### 2) Validate Article/Release Notes Updates
+
+**Goal:** Verify content releases actually changed required sections.
+
+**Use:**
+- `pageMarkdown({ search: /feature-x|deprecation|breaking/i })`
+
+**Example execute flow:**
+
+```javascript
+await state.page.goto('https://example.com/changelog');
+await waitForPageLoad();
+return await pageMarkdown({ search: /feature-x|deprecation|breaking/i });
+```
+
+**Success looks like:** You immediately see whether required terms exist in published content.
+
+## Support / Incident Response
+
+### 1) Triaging User Reports Quickly
+
+**Goal:** Determine whether issue is UI, extension, or relay routing.
+
+**Use:**
+- `snapshot({ showDiffSinceLastCall: false })`
+- `getLogs({ count: 30 })`
+- `~/.browserforce/cdp.jsonl`
+
+**Flow:**
+1. Capture full snapshot.
+2. Capture console logs.
+3. Check CDP direction flow in JSONL.
+
+**Success looks like:** Clear fault domain in minutes, not guesswork.
+
+### 2) Verify Page-Load Deadlocks
+
+**Goal:** Confirm whether page is stuck vs automation issue.
+
+**Use:**
+- `waitForPageLoad({ timeout: ... })`
+- `snapshot({ showDiffSinceLastCall: true })`
+
+**Success looks like:** You can prove if the page state is unchanged over time and isolate blocker overlays quickly.
+
+## Compliance / Risk
+
+### 1) Continuous Monitoring Of Disclosures
+
+**Goal:** Alert on modifications in legal disclosures/policy text.
+
+**Use:**
+- `cleanHTML('main', { showDiffSinceLastCall: true })`
+- `pageMarkdown({ showDiffSinceLastCall: true })`
+
+**Success looks like:** Only meaningful textual/structural changes trigger review tickets.
+
+### 2) Local Audit Trail For Automation
+
+**Goal:** Keep evidence of what automation asked and what browser returned.
+
+**Use:**
+- `~/.browserforce/cdp.jsonl`
+
+**Run:**
+
+```bash
+tail -n 200 ~/.browserforce/cdp.jsonl
+```
+
+**Success looks like:** Actionable timeline for audits and postmortems.
+
+## Rollout Pattern For Teams
+
+1. Start with one workflow in `diff` mode (`showDiffSinceLastCall: true`).
+2. Keep one “escape hatch” call in full mode (`showDiffSinceLastCall: false`) for debugging.
+3. Add JSONL checks to incident runbooks.
+4. Standardize around `snapshot -> refToLocator -> action -> snapshot diff`.

From 5dd7022776963cbbdd93f5ba01e3fd8b3599bf10 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 20:29:18 +0530
Subject: [PATCH 048/192] docs: expand controlled-tab guidance and persona use
 cases

---
 GUIDE.md          |  62 ++++++++-
 README.md         |  59 ++++++---
 docs/USE_CASES.md | 326 ++++++++++++++++++++++++++++++++++++++++++++++
 3 files changed, 431 insertions(+), 16 deletions(-)
 create mode 100644 docs/USE_CASES.md

diff --git a/GUIDE.md b/GUIDE.md
index d831550..bcae7bc 100644
--- a/GUIDE.md
+++ b/GUIDE.md
@@ -215,6 +215,62 @@ for (const page of pages) {
 }
 ```
 
+## Controlled Tabs Playbook
+
+Use this section when you want strict control over what the agent can touch.
+
+### 1) Manually Attach A Tab
+
+1. Open the exact tab you want the agent to use.
+2. Click the BrowserForce extension icon.
+3. In the popup, click **+ Attach Current Tab**.
+4. Confirm it appears under **Controlled Tabs**.
+
+This is the fastest way to grant access to an already logged-in page without exposing other tabs.
+
+### 2) Single-Tab Locked Workflow
+
+For high-safety tasks (admin pages, billing pages, production dashboards):
+
+1. Set **Mode** to `Manual`.
+2. Attach only one tab using **+ Attach Current Tab**.
+3. Enable **No new tabs**.
+4. Optionally enable **Lock URL** and **Read-only** depending on the task.
+
+Result: the agent is constrained to one attached tab and cannot open additional tabs.
+
+### 3) Multi-Tab Controlled Workflow
+
+If the task needs a few trusted tabs:
+
+1. Keep **Mode** on `Manual`.
+2. Switch to each required tab and click **+ Attach Current Tab**.
+3. Keep **No new tabs** on if you want to block any extra tab creation.
+
+Result: the agent can work only across the tabs you explicitly attached.
+
+### 4) Restriction Modes (How To Combine Them)
+
+- **Lock URL**: blocks navigation away from the current page (reload is still possible).
+- **No new tabs**: blocks agent-driven tab creation.
+- **Read-only**: blocks interaction methods (click/type/edit); useful for inspection-only runs.
+
+Common presets:
+
+- **Audit preset**: `Manual + No new tabs + Read-only`
+- **Form testing preset**: `Manual + No new tabs` (leave Read-only off)
+- **Pinned page preset**: `Manual + Lock URL + No new tabs`
+
+### 5) Auto-Cleanup After Use
+
+- **Auto-detach inactive tabs**: detaches tabs after 5-60 minutes of inactivity.
+- **Auto-close agent tabs**: closes tabs created by the agent after 5-60 minutes.
+
+Recommended:
+
+- Use `10-15 min` auto-detach for normal sessions.
+- Use auto-close when running broad exploration tasks that open many tabs.
+
 ## CLI
 
 Once installed globally (`npm install -g browserforce`), the CLI is available:
@@ -261,6 +317,8 @@ await cleanHTML('body', { showDiffSinceLastCall: false });
 await pageMarkdown({ showDiffSinceLastCall: true });
 ```
 
+Need concrete persona-based workflows? See [Actionable Use Cases](docs/USE_CASES.md).
+
 The `execute` tool gives the agent full Playwright access — it can navigate, click, type, screenshot, read accessibility trees, and run JavaScript in the page context. All within your real browser session.
 
 ## Examples
@@ -413,7 +471,7 @@ A: Any AI that supports MCP (OpenClaw, Claude Desktop, Claude Code) or any tool
 A: Chrome aggressively kills MV3 extensions after 30 seconds of inactivity. The relay sends keepalive pings every 5 seconds to prevent this. If the extension does restart, it auto-reconnects.
 
 **Q: Can I control which tabs the AI accesses?**
-A: Yes. Click the extension icon to switch between Auto mode (agent sees all tabs) and Manual mode (you select which tabs). You can also lock URLs, block new tabs, or enable read-only mode.
+A: Yes. In Auto mode the agent can create and control its own tabs. In Manual mode, you explicitly attach tabs with **+ Attach Current Tab**. You can also lock URLs, block new tabs, or enable read-only mode.
 
 **Q: Does it work with multiple windows?**
 A: Yes. All tabs across all Chrome windows are visible.
@@ -433,3 +491,5 @@ CDP traffic is logged to `~/.browserforce/cdp.jsonl` (recreated on each relay st
 ```bash
 jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.jsonl | uniq -c
 ```
+
+For incident/debug playbooks, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
diff --git a/README.md b/README.md
index 294fdb0..488e651 100644
--- a/README.md
+++ b/README.md
@@ -6,7 +6,7 @@
 
 **You're giving an AI your real Chrome — your logins, cookies, and sessions. That takes conviction.** BrowserForce is built for people who use the best models and don't look back. Security is built in: lock URLs, block navigation, read-only mode, auto-cleanup — you stay in control.
 
-**Fully autonomous browser control.** No manual tab clicking. Your agent browses as you, even from WhatsApp. Other tools make you click each tab, spawn a fresh Chrome, or only work with one AI client. BrowserForce connects to **your running browser** and auto-attaches to all tabs. One Chrome extension, full Playwright API, completely hands-off.
+**Autonomous when you want it, controlled when you need it.** Your agent can run hands-off in Auto mode, or you can switch to Manual mode and explicitly attach only the tabs you trust. BrowserForce connects to **your running browser** with one Chrome extension and full Playwright API support.
 
 Works with [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-compatible agent.
 
@@ -16,10 +16,10 @@ Works with [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-
 |---|---|---|---|---|---|
 | Browser | Spawns new Chrome | Separate profile | Your Chrome | Your Chrome | **Your Chrome** |
 | Login state | Fresh | Fresh (isolated) | Yours | Yours | **Yours** |
-| Tab access | N/A (new browser) | Managed by agent | Click each tab | Click each tab | **All tabs, automatic** |
+| Tab access | N/A (new browser) | Managed by agent | Click each tab | Click each tab | **Auto mode + manual attached tabs** |
 | Autonomous | Yes | Yes | No (manual click) | No (manual click) | **Yes (fully autonomous)** |
 | Context method | Screenshots (100KB+) | Screenshots + snapshots | A11y snapshots (5-20KB) | Screenshots (100KB+) | **A11y snapshots (5-20KB)** |
-| Tools | Many dedicated | 1 `browser` tool | 1 `execute` tool | Built-in | **3 tools: `execute`, `screenshot_with_labels`, `reset`** |
+| Tools | Many dedicated | 1 `browser` tool | 1 `execute` tool | Built-in | **2 tools: `execute`, `reset`** |
 | Agent support | Any MCP client | OpenClaw only | Any MCP client | Claude only | **Any MCP client** |
 | Playwright API | Partial | No | Full | No | **Full** |
 
@@ -108,10 +108,10 @@ browserforce serve
 
 If your agent browses to the page and responds with the title, you're all set.
 
-<details>
-<summary><b>MCP setup for OpenClaw, Claude, Codex, Cursor, and Antigravity</b></summary>
+**MCP setup (advanced):**
 
-#### OpenClaw (MCP adapter)
+<details>
+<summary><b>OpenClaw (MCP adapter)</b></summary>
 
 Add to `~/.openclaw/openclaw.json`:
 
@@ -137,7 +137,10 @@ Add to `~/.openclaw/openclaw.json`:
 }
 ```
 
-#### Claude Desktop
+</details>
+
+<details>
+<summary><b>Claude Desktop</b></summary>
 
 Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
 
@@ -152,7 +155,10 @@ Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
 }
 ```
 
-#### Claude Code
+</details>
+
+<details>
+<summary><b>Claude Code</b></summary>
 
 Add to `~/.claude/mcp.json`:
 
@@ -167,7 +173,10 @@ Add to `~/.claude/mcp.json`:
 }
 ```
 
-#### Codex
+</details>
+
+<details>
+<summary><b>Codex</b></summary>
 
 Add to `~/.codex/config.toml`:
 
@@ -177,7 +186,10 @@ command = "npx"
 args = ["-y", "browserforce@latest", "mcp"]
 ```
 
-#### Cursor
+</details>
+
+<details>
+<summary><b>Cursor</b></summary>
 
 Add to `~/.cursor/mcp.json`:
 
@@ -192,7 +204,10 @@ Add to `~/.cursor/mcp.json`:
 }
 ```
 
-#### Antigravity
+</details>
+
+<details>
+<summary><b>Antigravity</b></summary>
 
 In Antigravity: Agent panel -> `...` -> `Manage MCP Servers` -> `View raw config`.
 Add the same `mcpServers` entry:
@@ -208,14 +223,14 @@ Add the same `mcpServers` entry:
 }
 ```
 
+</details>
+
 If MCP startup fails with `connection closed: initialize response`:
 
 1. Ensure args include `"mcp"` (without it, BrowserForce prints help and exits).
 2. If running from a local clone, install deps first: `pnpm install`.
 3. Validate the launch command manually: `npx -y browserforce@latest mcp`
 
-</details>
-
 ### CLI
 
 ```bash
@@ -368,6 +383,8 @@ await cleanHTML('body', { showDiffSinceLastCall: false });
 await pageMarkdown({ showDiffSinceLastCall: true });
 ```
 
+Need role-based, real workflows? See [Actionable Use Cases](docs/USE_CASES.md).
+
 ## Examples
 
 Get started with simple prompts. The AI generates code and does the work.
@@ -519,9 +536,9 @@ Get started with simple prompts. The AI generates code and does the work.
 
 The **relay server** runs on your machine (localhost only). It translates between the agent's CDP commands and the extension's debugger bridge.
 
-The **Chrome extension** lives in your browser. It attaches Chrome's built-in debugger to your tabs and forwards commands — exactly like DevTools does.
+The **Chrome extension** lives in your browser. It attaches Chrome's built-in debugger to permitted tabs and forwards commands — exactly like DevTools does.
 
-When the agent connects, it immediately sees all your open tabs as controllable Playwright pages. No clicking, no manual attachment.
+In **Auto mode**, the agent can create and control tabs it opens. In **Manual mode**, you decide access by clicking **+ Attach Current Tab**.
 
 ## You Stay in Control
 
@@ -537,6 +554,16 @@ Click the extension icon to configure restrictions. Your browser, your rules:
 | **Auto-close** | Automatically close agent-created tabs after 5-60 minutes |
 | **Custom instructions** | Pass text instructions to the agent (e.g. "don't click any buy buttons") |
 
+### Controlled Tab Workflows
+
+- **Manually attach a tab:** Open the tab you want, click the extension popup, then click **+ Attach Current Tab**.
+- **Restrict to one controlled tab:** Use **Manual mode**, attach one tab, and enable **No new tabs**.
+- **Allow multiple controlled tabs:** Stay in **Manual mode** and attach each tab you want the agent to access.
+- **Restriction modes:** Use **Lock URL** (no navigation), **No new tabs**, and **Read-only** (observe only) together or separately.
+- **Auto-cleanup:** Use **Auto-detach** for inactive attached tabs and **Auto-close** for agent-created tabs.
+
+For step-by-step setups, see the [Controlled Tabs Playbook](GUIDE.md#controlled-tabs-playbook).
+
 ## Security
 
 | Layer | Control |
@@ -593,4 +620,6 @@ CDP traffic is logged to `~/.browserforce/cdp.jsonl` (recreated on each relay st
 jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.jsonl | uniq -c
 ```
 
+For practical debugging and operations flows, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
+
 > **Want the full walkthrough?** Read the [User Guide](https://github.com/ivalsaraj/browserforce/blob/main/GUIDE.md) for a plain-English explanation of what this does and how to get started.
diff --git a/docs/USE_CASES.md b/docs/USE_CASES.md
new file mode 100644
index 0000000..8b8bbf7
--- /dev/null
+++ b/docs/USE_CASES.md
@@ -0,0 +1,326 @@
+# BrowserForce Use Cases (Actionable)
+
+This page is for real-world execution, not theory. Each section includes:
+
+- What role this is for
+- What you are trying to achieve
+- Which BrowserForce switches/helpers to use
+- A copy-paste example
+- What success looks like
+
+## Quick Switch Guide
+
+| Switch / Helper | Use it when | Typical outcome |
+|---|---|---|
+| `snapshot({ showDiffSinceLastCall: true })` | You are in a multi-step flow and want only changes | Faster loops, lower token usage, less noise |
+| `snapshot({ showDiffSinceLastCall: false })` | You need full context right now | Full tree and refs for reliable decisions |
+| `cleanHTML(selector, { showDiffSinceLastCall: true })` | You monitor DOM changes over time | Detect only meaningful structural changes |
+| `cleanHTML(selector, { showDiffSinceLastCall: false })` | You need full HTML snapshot for parsing | Complete cleaned HTML for extraction |
+| `pageMarkdown({ showDiffSinceLastCall: true })` | You monitor long-form content/pages | Alert only on content changes |
+| `pageMarkdown({ search: /.../ })` | You need targeted text checks | Focused findings with context lines |
+| `refToLocator({ ref: 'eN' })` | You got a ref from `snapshot()` and need a stable locator | Reliable interaction without brittle selectors |
+| `getCDPSession({ page })` | You need low-level CDP commands in relay environment | Raw CDP access with relay-safe session creation |
+
+## Feature-by-Feature Use Cases (High Impact First)
+
+This section maps each newly added capability to practical scenarios by user type.
+
+### 1) `snapshot({ showDiffSinceLastCall })` (Most Impactful)
+
+**Why this is high impact:** It cuts repeated context noise in long flows and makes automation loops faster.
+
+- **OpenClaw user scenario:** Checkout flow monitoring from chat
+  - Run a full baseline once, then diff mode on each step.
+  - You see only changed controls/messages after each action.
+- **Developer scenario:** Flaky UI reproduction loop
+  - Keep one stable script: `observe -> act -> observe diff`.
+  - Faster diagnosis when UI mutates between attempts.
+- **Other scenario (Ops / Monitoring):** Status page drift detection
+  - Poll snapshot diff on dashboards.
+  - Alert only when visible state changes, not every poll.
+
+**Example execute pattern:**
+
+```javascript
+await snapshot({ showDiffSinceLastCall: false }); // baseline once
+// ... perform one action
+return await snapshot({ showDiffSinceLastCall: true }); // concise change output
+```
+
+### 2) `refToLocator({ ref })`
+
+**Why this is high impact:** It converts snapshot refs into actionable selectors without brittle locator guessing.
+
+- **OpenClaw user scenario:** “Click the third approve button” from messaging app
+  - Agent inspects snapshot refs and resolves exact target with `refToLocator`.
+- **Developer scenario:** Remove flaky `nth()` selectors in tests
+  - Replace deep CSS chains with snapshot-ref resolution per step.
+- **Other scenario (Support):** Guided incident triage
+  - Agent can target the exact control visible in the current UI state.
+
+**Example execute pattern:**
+
+```javascript
+await snapshot({ showDiffSinceLastCall: false });
+const locator = refToLocator({ ref: 'e3' });
+if (!locator) throw new Error('ref e3 not available');
+await state.page.locator(locator).click();
+```
+
+### 3) `getCDPSession({ page })`
+
+**Why this is high impact:** It gives relay-safe low-level browser access for cases Playwright APIs do not cover cleanly.
+
+- **OpenClaw user scenario:** Advanced site diagnostics on authenticated pages
+  - Run protocol-level checks while still using real logged-in Chrome sessions.
+- **Developer scenario:** Deep debugging in relay environments
+  - Enable CDP domains (`Network`, `Runtime`, `Performance`) safely.
+- **Other scenario (QA):** Protocol verification in test workflows
+  - Validate low-level page/runtime conditions before/after critical actions.
+
+**Example execute pattern:**
+
+```javascript
+const cdp = await getCDPSession({ page: state.page });
+await cdp.send('Network.enable');
+return await cdp.send('Runtime.evaluate', { expression: 'document.readyState' });
+```
+
+### 4) Tactical Execute Playbook (Prompt Guidance)
+
+**Why this is high impact:** Better default agent behavior reduces dead-end runs on real websites.
+
+- **OpenClaw user scenario:** Cookie/consent/login blockers handled automatically
+  - Agent is guided to clear blockers before continuing.
+- **Developer scenario:** Correct extraction tool choice per task
+  - Guidance for `snapshot vs cleanHTML vs pageMarkdown` reduces wrong-tool usage.
+- **Other scenario (QA / Incident):** Faster root-cause loops
+  - “Combine snapshot + logs” guidance standardizes debugging flow.
+
+**Example prompt-to-agent outcomes:**
+
+- More reliable form/task completion on consent-heavy sites.
+- Fewer retries caused by stale locators after page updates.
+- Better extraction quality on article/news pages using `pageMarkdown`.
+
+### 5) Prompt/Test Regression Guards (Team Safety)
+
+**Why this is high impact:** Prevents silent drift between documented helper surface and runtime behavior.
+
+- **OpenClaw user scenario:** Stable agent behavior across updates
+  - Key guidance phrases remain enforced by tests.
+- **Developer scenario:** Safer refactors of MCP prompt/runtime
+  - Failing tests catch missing helper mentions or diff contract changes.
+- **Other scenario (Maintainers):** Predictable release quality
+  - Prompt contracts and helper exposure stay synchronized.
+
+**Operational check:**
+
+```bash
+node --test mcp/test/mcp-tools.test.js
+node --test mcp/test/exec-engine-plugins.test.js
+```
+
+## OpenClaw User (High Impact)
+
+### 1) Fast Checkout / Form Completion With Less Noise
+
+**Goal:** Complete long forms without re-reading the whole page every step.
+
+**Use:**
+- `snapshot({ showDiffSinceLastCall: true })`
+- `refToLocator({ ref })`
+
+**Example execute flow:**
+
+```javascript
+await snapshot({ showDiffSinceLastCall: false }); // baseline full view
+// ... fill step 1
+const delta = await snapshot({ showDiffSinceLastCall: true });
+return delta;
+```
+
+**Success looks like:** You only see what changed after each action, and fewer wrong clicks happen.
+
+### 2) Watch Your Competitor Pricing Page
+
+**Goal:** Detect only meaningful pricing-card changes.
+
+**Use:**
+- `cleanHTML('.pricing', { showDiffSinceLastCall: true })`
+
+**Example execute flow:**
+
+```javascript
+const first = await cleanHTML('.pricing', { showDiffSinceLastCall: true });
+const second = await cleanHTML('.pricing', { showDiffSinceLastCall: true });
+return { firstPreview: first.slice(0, 300), secondPreview: second.slice(0, 300) };
+```
+
+**Success looks like:** Second run returns either a compact diff or no-change guidance instead of full repeated markup.
+
+### 3) Track Policy/Terms Changes On Services You Use
+
+**Goal:** Be notified when legal/terms wording changes.
+
+**Use:**
+- `pageMarkdown({ showDiffSinceLastCall: true })`
+
+**Example execute flow:**
+
+```javascript
+await state.page.goto('https://example.com/terms');
+await waitForPageLoad();
+const baseline = await pageMarkdown({ showDiffSinceLastCall: true });
+const next = await pageMarkdown({ showDiffSinceLastCall: true });
+return { baselineLen: baseline.length, next };
+```
+
+**Success looks like:** You get concise change output only when terms changed.
+
+## Developer (High Impact)
+
+### 1) Debug “Action Sent But Nothing Happened”
+
+**Goal:** Find where command flow failed.
+
+**Use:**
+- CDP JSONL log (`~/.browserforce/cdp.jsonl`)
+
+**Run:**
+
+```bash
+jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.jsonl | uniq -c
+```
+
+**Success looks like:** You can confirm whether the command reached extension and whether response/event returned to Playwright.
+
+### 2) Reproduce Flaky Interaction Deterministically
+
+**Goal:** Replace brittle selectors and stale refs.
+
+**Use:**
+- `snapshot({ showDiffSinceLastCall: false })`
+- `refToLocator({ ref })`
+
+**Example execute flow:**
+
+```javascript
+const snap = await snapshot({ showDiffSinceLastCall: false });
+const locator = refToLocator({ ref: 'e3' });
+if (!locator) throw new Error('ref e3 not available');
+await state.page.locator(locator).click();
+return await snapshot({ showDiffSinceLastCall: true });
+```
+
+**Success looks like:** Fewer flaky failures from stale `nth()`/deep CSS paths.
+
+### 3) Raw CDP Verification In Relay Context
+
+**Goal:** Inspect browser/network behavior beyond normal locator APIs.
+
+**Use:**
+- `getCDPSession({ page })`
+
+**Example execute flow:**
+
+```javascript
+const cdp = await getCDPSession({ page: state.page });
+await cdp.send('Network.enable');
+const result = await cdp.send('Runtime.evaluate', { expression: 'document.readyState' });
+return result;
+```
+
+**Success looks like:** You can run low-level checks without breaking relay compatibility.
+
+## QA / Automation Engineer
+
+### 1) Regression Diff Between Test Steps
+
+**Goal:** Catch unexpected UI changes early.
+
+**Use:**
+- `snapshot({ showDiffSinceLastCall: true })`
+
+**Example:** Run snapshot diff after each core step (`login -> cart -> checkout -> confirmation`) and fail test if unexpected controls appear/disappear.
+
+**Success looks like:** Smaller, reviewable diffs in CI logs.
+
+### 2) Validate Article/Release Notes Updates
+
+**Goal:** Verify content releases actually changed required sections.
+
+**Use:**
+- `pageMarkdown({ search: /feature-x|deprecation|breaking/i })`
+
+**Example execute flow:**
+
+```javascript
+await state.page.goto('https://example.com/changelog');
+await waitForPageLoad();
+return await pageMarkdown({ search: /feature-x|deprecation|breaking/i });
+```
+
+**Success looks like:** You immediately see whether required terms exist in published content.
+
+## Support / Incident Response
+
+### 1) Triaging User Reports Quickly
+
+**Goal:** Determine whether issue is UI, extension, or relay routing.
+
+**Use:**
+- `snapshot({ showDiffSinceLastCall: false })`
+- `getLogs({ count: 30 })`
+- `~/.browserforce/cdp.jsonl`
+
+**Flow:**
+1. Capture full snapshot.
+2. Capture console logs.
+3. Check CDP direction flow in JSONL.
+
+**Success looks like:** Clear fault domain in minutes, not guesswork.
+
+### 2) Verify Page-Load Deadlocks
+
+**Goal:** Confirm whether page is stuck vs automation issue.
+
+**Use:**
+- `waitForPageLoad({ timeout: ... })`
+- `snapshot({ showDiffSinceLastCall: true })`
+
+**Success looks like:** You can prove if the page state is unchanged over time and isolate blocker overlays quickly.
+
+## Compliance / Risk
+
+### 1) Continuous Monitoring Of Disclosures
+
+**Goal:** Alert on modifications in legal disclosures/policy text.
+
+**Use:**
+- `cleanHTML('main', { showDiffSinceLastCall: true })`
+- `pageMarkdown({ showDiffSinceLastCall: true })`
+
+**Success looks like:** Only meaningful textual/structural changes trigger review tickets.
+
+### 2) Local Audit Trail For Automation
+
+**Goal:** Keep evidence of what automation asked and what browser returned.
+
+**Use:**
+- `~/.browserforce/cdp.jsonl`
+
+**Run:**
+
+```bash
+tail -n 200 ~/.browserforce/cdp.jsonl
+```
+
+**Success looks like:** Actionable timeline for audits and postmortems.
+
+## Rollout Pattern For Teams
+
+1. Start with one workflow in `diff` mode (`showDiffSinceLastCall: true`).
+2. Keep one “escape hatch” call in full mode (`showDiffSinceLastCall: false`) for debugging.
+3. Add JSONL checks to incident runbooks.
+4. Standardize around `snapshot -> refToLocator -> action -> snapshot diff`.

From 3040db1749da06e2ac0e25d91ff88c098b554ade Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 20:38:54 +0530
Subject: [PATCH 049/192] docs: update positioning to parallel ai agents in
 your chrome

---
 GUIDE.md  | 2 ++
 README.md | 2 +-
 2 files changed, 3 insertions(+), 1 deletion(-)

diff --git a/GUIDE.md b/GUIDE.md
index bcae7bc..7dd06bd 100644
--- a/GUIDE.md
+++ b/GUIDE.md
@@ -1,5 +1,7 @@
 # BrowserForce — User Guide
 
+**BrowserForce // Parallel AI Agents in "your" Chrome!**
+
 ## What is this?
 
 BrowserForce gives AI agents — like [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-compatible tool — access to **your real Chrome browser**. The one you're already logged into. No headless browser, no fake profiles. The AI sees your actual tabs and can interact with any website using your existing sessions.
diff --git a/README.md b/README.md
index 488e651..0df7634 100644
--- a/README.md
+++ b/README.md
@@ -1,4 +1,4 @@
-# BrowserForce // 
+# BrowserForce // Parallel AI Agents in "your" Chrome!
 
 > "a lion doesn't concern itself with token counting" — [@steipete](https://x.com/steipete), creator of [OpenClaw](https://github.com/openclaw/openclaw)
 >

From f2cd8da3577877f9727149c3e7b65db4fdbb12d0 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 20:59:22 +0530
Subject: [PATCH 050/192] feat(relay): add client arbitration config defaults

---
 relay/src/index.js              | 10 ++++++++++
 relay/test/relay-server.test.js |  6 ++++++
 2 files changed, 16 insertions(+)

diff --git a/relay/src/index.js b/relay/src/index.js
index 5326072..c2f2cf1 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -16,6 +16,8 @@ const BF_DIR = path.join(os.homedir(), '.browserforce');
 const TOKEN_FILE = path.join(BF_DIR, 'auth-token');
 const CDP_URL_FILE = path.join(BF_DIR, 'cdp-url');
 const BF_PLUGINS_DIR = path.join(BF_DIR, 'plugins');
+const CLIENT_MODE_SINGLE = 'single-active';
+const CLIENT_MODE_MULTI = 'multi-client';
 
 // ─── Logging ─────────────────────────────────────────────────────────────────
 
@@ -48,6 +50,11 @@ function writeCdpUrlFile(cdpUrl) {
   }
 }
 
+function getClientMode() {
+  const mode = (process.env.BF_CLIENT_MODE || CLIENT_MODE_SINGLE).trim();
+  return mode === CLIENT_MODE_MULTI ? CLIENT_MODE_MULTI : CLIENT_MODE_SINGLE;
+}
+
 // ─── RelayServer ─────────────────────────────────────────────────────────────
 
 const DEFAULT_BROWSER_CONTEXT_ID = 'bf-default-context';
@@ -130,6 +137,9 @@ class RelayServer {
     this.port = port;
     this.pluginsDir = pluginsDir;
     this.authToken = getOrCreateAuthToken();
+    this.clientMode = getClientMode();
+    this.activeClient = null; // { id, ws, connectedAt, lastSeenAt }
+    this.clientSeq = 0;
 
     // Extension connection (single slot)
     this.ext = null;
diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index 3b36b2a..bc8a02c 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -101,6 +101,12 @@ describe('Token Persistence', () => {
   const tmpDir = path.join(os.tmpdir(), `bf-test-${crypto.randomBytes(4).toString('hex')}`);
   const origBfDir = BF_DIR;
 
+  it('defaults to single-active client mode', () => {
+    delete process.env.BF_CLIENT_MODE;
+    const relay = new RelayServer(getRandomPort());
+    assert.equal(relay.clientMode, 'single-active');
+  });
+
   it('creates auth token file on first run', () => {
     // RelayServer reads token from the global BF_DIR.
     // We just verify the token is a non-empty string.

From b6fce07b695060960bd119d1a43014dfed318a4d Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:07:55 +0530
Subject: [PATCH 051/192] feat(relay): enforce single active cdp client mode

---
 relay/src/index.js              | 26 ++++++++++++++++++++++++--
 relay/test/relay-server.test.js | 22 ++++++++++++++++++++++
 2 files changed, 46 insertions(+), 2 deletions(-)

diff --git a/relay/src/index.js b/relay/src/index.js
index c2f2cf1..472fd63 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -182,7 +182,7 @@ class RelayServer {
 
     server.on('upgrade', (req, socket, head) => this._handleUpgrade(req, socket, head));
     this.extWss.on('connection', (ws) => this._onExtConnect(ws));
-    this.cdpWss.on('connection', (ws) => this._onCdpConnect(ws));
+    this.cdpWss.on('connection', (ws, req) => this._onCdpConnect(ws, req));
 
     this.server = server;
 
@@ -419,6 +419,16 @@ class RelayServer {
         socket.destroy();
         return;
       }
+      if (this.clientMode === CLIENT_MODE_SINGLE) {
+        if (this.activeClient && this.activeClient.ws.readyState === WebSocket.OPEN) {
+          const body = JSON.stringify({ error: 'Another CDP client is already connected' });
+          socket.write(
+            `HTTP/1.1 409 Conflict\r\nContent-Type: application/json\r\nContent-Length: ${Buffer.byteLength(body)}\r\nConnection: close\r\n\r\n${body}`
+          );
+          socket.destroy();
+          return;
+        }
+      }
       this.cdpWss.handleUpgrade(req, socket, head, (ws) => {
         this.cdpWss.emit('connection', ws, req);
       });
@@ -644,11 +654,20 @@ class RelayServer {
 
   // ─── CDP Client Connection ──────────────────────────────────────────────
 
-  _onCdpConnect(ws) {
+  _onCdpConnect(ws, req) {
+    const clientId = `bf-cdp-${++this.clientSeq}`;
+    ws._bfClientId = clientId;
+    if (this.clientMode === CLIENT_MODE_SINGLE) {
+      const now = Date.now();
+      this.activeClient = { id: clientId, ws, connectedAt: now, lastSeenAt: now };
+    }
     log('[relay] CDP client connected');
     this.clients.add(ws);
 
     ws.on('message', (data) => {
+      if (this.clientMode === CLIENT_MODE_SINGLE && this.activeClient?.id === clientId) {
+        this.activeClient.lastSeenAt = Date.now();
+      }
       try {
         const msg = JSON.parse(data.toString());
         this._handleCdpClientMessage(ws, msg);
@@ -660,6 +679,9 @@ class RelayServer {
     ws.on('close', () => {
       log('[relay] CDP client disconnected');
       this.clients.delete(ws);
+      if (this.activeClient?.id === clientId) {
+        this.activeClient = null;
+      }
     });
 
     ws.on('error', (err) => {
diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index bc8a02c..ab21dac 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -439,6 +439,28 @@ describe('WebSocket Security', () => {
     ws.close();
   });
 
+  it('rejects second /cdp client in single-active mode', async () => {
+    const prevMode = process.env.BF_CLIENT_MODE;
+    process.env.BF_CLIENT_MODE = 'single-active';
+    let c1;
+    let c2;
+    try {
+      c1 = await connectWs(`ws://127.0.0.1:${port}/cdp?token=${relay.authToken}`);
+      await assert.rejects(
+        (async () => {
+          c2 = await connectWs(`ws://127.0.0.1:${port}/cdp?token=${relay.authToken}`);
+          c2.close();
+        })(),
+        /409|Unexpected/
+      );
+    } finally {
+      if (c1 && c1.readyState === WebSocket.OPEN) c1.close();
+      if (c2 && c2.readyState === WebSocket.OPEN) c2.close();
+      if (prevMode === undefined) delete process.env.BF_CLIENT_MODE;
+      else process.env.BF_CLIENT_MODE = prevMode;
+    }
+  });
+
   it('rejects second extension connection (single slot)', async () => {
     const ws1 = await connectWs(`ws://127.0.0.1:${port}/extension`, {
       headers: { Origin: 'chrome-extension://first' },

From b7b5093d036246908b997cf1510c43935b800fec Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:14:56 +0530
Subject: [PATCH 052/192] test(relay): make single-active contention test
 deterministic

---
 relay/test/relay-server.test.js | 8 ++++++--
 1 file changed, 6 insertions(+), 2 deletions(-)

diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index ab21dac..d232244 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -442,13 +442,16 @@ describe('WebSocket Security', () => {
   it('rejects second /cdp client in single-active mode', async () => {
     const prevMode = process.env.BF_CLIENT_MODE;
     process.env.BF_CLIENT_MODE = 'single-active';
+    const singleRelay = new RelayServer(getRandomPort());
+    await singleRelay.start({ writeCdpUrl: false });
     let c1;
     let c2;
     try {
-      c1 = await connectWs(`ws://127.0.0.1:${port}/cdp?token=${relay.authToken}`);
+      assert.equal(singleRelay.clientMode, 'single-active');
+      c1 = await connectWs(`ws://127.0.0.1:${singleRelay.port}/cdp?token=${singleRelay.authToken}`);
       await assert.rejects(
         (async () => {
-          c2 = await connectWs(`ws://127.0.0.1:${port}/cdp?token=${relay.authToken}`);
+          c2 = await connectWs(`ws://127.0.0.1:${singleRelay.port}/cdp?token=${singleRelay.authToken}`);
           c2.close();
         })(),
         /409|Unexpected/
@@ -456,6 +459,7 @@ describe('WebSocket Security', () => {
     } finally {
       if (c1 && c1.readyState === WebSocket.OPEN) c1.close();
       if (c2 && c2.readyState === WebSocket.OPEN) c2.close();
+      singleRelay.stop();
       if (prevMode === undefined) delete process.env.BF_CLIENT_MODE;
       else process.env.BF_CLIENT_MODE = prevMode;
     }

From 5ab3d522d63bf0572bfb143cba447d74566477e7 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:26:50 +0530
Subject: [PATCH 053/192] feat(relay): release active slot on disconnect and
 expose slot status

---
 relay/src/index.js              | 14 ++++-
 relay/test/relay-server.test.js | 97 +++++++++++++++++++++++++++++++++
 2 files changed, 110 insertions(+), 1 deletion(-)

diff --git a/relay/src/index.js b/relay/src/index.js
index 472fd63..d2c8611 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -233,6 +233,18 @@ class RelayServer {
       return;
     }
 
+    if (url.pathname === '/client-slot') {
+      const activeWsOpen = this.activeClient?.ws?.readyState === WebSocket.OPEN;
+      const busy = this.clientMode === CLIENT_MODE_SINGLE && activeWsOpen;
+      res.end(JSON.stringify({
+        mode: this.clientMode,
+        busy,
+        activeClientId: busy ? this.activeClient.id : null,
+        connectedAt: busy ? this.activeClient.connectedAt : null,
+      }));
+      return;
+    }
+
     if (url.pathname === '/json/version') {
       res.end(JSON.stringify({
         Browser: 'BrowserForce/1.0',
@@ -679,7 +691,7 @@ class RelayServer {
     ws.on('close', () => {
       log('[relay] CDP client disconnected');
       this.clients.delete(ws);
-      if (this.activeClient?.id === clientId) {
+      if (this.activeClient?.ws === ws) {
         this.activeClient = null;
       }
     });
diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index d232244..2da74b4 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -465,6 +465,103 @@ describe('WebSocket Security', () => {
     }
   });
 
+  it('allows standby client after active client disconnects', async () => {
+    const prevMode = process.env.BF_CLIENT_MODE;
+    process.env.BF_CLIENT_MODE = 'single-active';
+    const singleRelay = new RelayServer(getRandomPort());
+    await singleRelay.start({ writeCdpUrl: false });
+
+    let activeClient;
+    let standbyClient;
+    let rejectedClient;
+    try {
+      activeClient = await connectWs(`ws://127.0.0.1:${singleRelay.port}/cdp?token=${singleRelay.authToken}`);
+      const slotWhileActive = await httpGet(`http://127.0.0.1:${singleRelay.port}/client-slot`);
+      assert.equal(slotWhileActive.status, 200);
+      assert.equal(slotWhileActive.body.busy, true);
+
+      await assert.rejects(
+        (async () => {
+          rejectedClient = await connectWs(`ws://127.0.0.1:${singleRelay.port}/cdp?token=${singleRelay.authToken}`);
+          rejectedClient.close();
+        })(),
+        /409|Unexpected/
+      );
+
+      const activeClosed = new Promise((resolve) => activeClient.once('close', resolve));
+      activeClient.close();
+      await activeClosed;
+
+      await waitForCondition(() => singleRelay.activeClient === null, {
+        description: 'active client slot release',
+      });
+
+      const slotAfterDisconnect = await httpGet(`http://127.0.0.1:${singleRelay.port}/client-slot`);
+      assert.equal(slotAfterDisconnect.status, 200);
+      assert.equal(slotAfterDisconnect.body.busy, false);
+
+      standbyClient = await connectWs(`ws://127.0.0.1:${singleRelay.port}/cdp?token=${singleRelay.authToken}`);
+      assert.equal(standbyClient.readyState, WebSocket.OPEN);
+    } finally {
+      if (activeClient && activeClient.readyState === WebSocket.OPEN) activeClient.close();
+      if (standbyClient && standbyClient.readyState === WebSocket.OPEN) standbyClient.close();
+      if (rejectedClient && rejectedClient.readyState === WebSocket.OPEN) rejectedClient.close();
+      singleRelay.stop();
+      if (prevMode === undefined) delete process.env.BF_CLIENT_MODE;
+      else process.env.BF_CLIENT_MODE = prevMode;
+    }
+  });
+
+  it('GET /client-slot returns mode and active status', async () => {
+    const prevMode = process.env.BF_CLIENT_MODE;
+    process.env.BF_CLIENT_MODE = 'single-active';
+    const singleRelay = new RelayServer(getRandomPort());
+    await singleRelay.start({ writeCdpUrl: false });
+
+    let activeClient;
+    try {
+      const before = await httpGet(`http://127.0.0.1:${singleRelay.port}/client-slot`);
+      assert.equal(before.status, 200);
+      assert.deepEqual(before.body, {
+        mode: 'single-active',
+        busy: false,
+        activeClientId: null,
+        connectedAt: null,
+      });
+
+      activeClient = await connectWs(`ws://127.0.0.1:${singleRelay.port}/cdp?token=${singleRelay.authToken}`);
+
+      const during = await httpGet(`http://127.0.0.1:${singleRelay.port}/client-slot`);
+      assert.equal(during.status, 200);
+      assert.equal(during.body.mode, 'single-active');
+      assert.equal(during.body.busy, true);
+      assert.equal(typeof during.body.activeClientId, 'string');
+      assert.equal(typeof during.body.connectedAt, 'number');
+
+      const activeClosed = new Promise((resolve) => activeClient.once('close', resolve));
+      activeClient.close();
+      await activeClosed;
+
+      await waitForCondition(() => singleRelay.activeClient === null, {
+        description: 'active client slot release',
+      });
+
+      const after = await httpGet(`http://127.0.0.1:${singleRelay.port}/client-slot`);
+      assert.equal(after.status, 200);
+      assert.deepEqual(after.body, {
+        mode: 'single-active',
+        busy: false,
+        activeClientId: null,
+        connectedAt: null,
+      });
+    } finally {
+      if (activeClient && activeClient.readyState === WebSocket.OPEN) activeClient.close();
+      singleRelay.stop();
+      if (prevMode === undefined) delete process.env.BF_CLIENT_MODE;
+      else process.env.BF_CLIENT_MODE = prevMode;
+    }
+  });
+
   it('rejects second extension connection (single slot)', async () => {
     const ws1 = await connectWs(`ws://127.0.0.1:${port}/extension`, {
       headers: { Origin: 'chrome-extension://first' },

From a9cd31e3ff0f22059b10b4aac4930e205e7b2ba1 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:34:45 +0530
Subject: [PATCH 054/192] feat(relay): preserve multi-client fallback mode

---
 relay/test/relay-server.test.js | 21 +++++++++++++++++++++
 1 file changed, 21 insertions(+)

diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index 2da74b4..ef3c581 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -465,6 +465,27 @@ describe('WebSocket Security', () => {
     }
   });
 
+  it('allows multiple /cdp clients when BF_CLIENT_MODE=multi-client', async () => {
+    const prevMode = process.env.BF_CLIENT_MODE;
+    process.env.BF_CLIENT_MODE = 'multi-client';
+    const multiRelay = new RelayServer(getRandomPort());
+    await multiRelay.start({ writeCdpUrl: false });
+    let c1;
+    let c2;
+    try {
+      c1 = await connectWs(`ws://127.0.0.1:${multiRelay.port}/cdp?token=${multiRelay.authToken}`);
+      c2 = await connectWs(`ws://127.0.0.1:${multiRelay.port}/cdp?token=${multiRelay.authToken}`);
+      assert.equal(c1.readyState, WebSocket.OPEN);
+      assert.equal(c2.readyState, WebSocket.OPEN);
+    } finally {
+      if (c1 && c1.readyState === WebSocket.OPEN) c1.close();
+      if (c2 && c2.readyState === WebSocket.OPEN) c2.close();
+      multiRelay.stop();
+      if (prevMode === undefined) delete process.env.BF_CLIENT_MODE;
+      else process.env.BF_CLIENT_MODE = prevMode;
+    }
+  });
+
   it('allows standby client after active client disconnects', async () => {
     const prevMode = process.env.BF_CLIENT_MODE;
     process.env.BF_CLIENT_MODE = 'single-active';

From d1793491c8e9d3036f751bcd24b9add961174144 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:39:59 +0530
Subject: [PATCH 055/192] docs(mcp): add parallel-first tab swarm policy and
 real-world examples

---
 GUIDE.md                   |  83 +++++++++++
 README.md                  | 277 ++++++++++++++++++++++++-------------
 mcp/src/index.js           |  22 ++-
 mcp/test/mcp-tools.test.js |  28 ++++
 4 files changed, 313 insertions(+), 97 deletions(-)

diff --git a/GUIDE.md b/GUIDE.md
index 7dd06bd..990d484 100644
--- a/GUIDE.md
+++ b/GUIDE.md
@@ -323,6 +323,89 @@ Need concrete persona-based workflows? See [Actionable Use Cases](docs/USE_CASES
 
 The `execute` tool gives the agent full Playwright access — it can navigate, click, type, screenshot, read accessibility trees, and run JavaScript in the page context. All within your real browser session.
 
+### BrowserForce Tab Swarms // Parallel Tabs Processing
+
+Use this for read-only count/list/extraction tasks where each target is independent (different pages, dates, or items).
+
+- Start parallel-first with `Promise.all` and a concurrency cap (`3-8`, usually start at `5`).
+- If you hit `429`, anti-bot pages, or repeated timeout failures, automatically retry with reduced concurrency.
+- If reduced concurrency still fails, fall back to sequential processing.
+- Return telemetry on every swarm run: `peakConcurrentTasks`, `wallClockMs`, `sumTaskDurationsMs`, `failures`, `retries`.
+
+Example execute pattern:
+
+```javascript
+const items = state.items ?? [];
+const startedAt = Date.now();
+let peakConcurrentTasks = 0;
+let sumTaskDurationsMs = 0;
+let failures = 0;
+let retries = 0;
+
+async function runTask(item, page) {
+  const t0 = Date.now();
+  try {
+    await page.goto(item.url);
+    await waitForPageLoad({ timeout: 15000 });
+    const value = await page.locator(item.selector).first().textContent();
+    return { ok: true, item, value };
+  } catch (error) {
+    const msg = String(error?.message || error);
+    const retryable = /429|timeout|captcha|challenge|blocked/i.test(msg);
+    return { ok: false, item, retryable, error: msg };
+  } finally {
+    sumTaskDurationsMs += Date.now() - t0;
+  }
+}
+
+async function runWithCap(targetItems, cap) {
+  const results = [];
+  for (let i = 0; i < targetItems.length; i += cap) {
+    const batch = targetItems.slice(i, i + cap);
+    peakConcurrentTasks = Math.max(peakConcurrentTasks, batch.length);
+    const tabs = await Promise.all(batch.map(() => context.newPage()));
+    const batchResults = await Promise.all(batch.map((item, idx) => runTask(item, tabs[idx])));
+    await Promise.all(tabs.map((p) => p.close().catch(() => {})));
+    results.push(...batchResults);
+  }
+  return results;
+}
+
+let results = await runWithCap(items, 5);
+let retryable = results.filter((r) => !r.ok && r.retryable).map((r) => r.item);
+
+if (retryable.length) {
+  retries += 1;
+  const retried = await runWithCap(retryable, 2); // reduced concurrency fallback
+  const settled = new Map(results.filter((r) => r.ok).map((r) => [r.item.url, r]));
+  for (const r of retried) settled.set(r.item.url, r);
+  results = [...settled.values()];
+  retryable = results.filter((r) => !r.ok && r.retryable).map((r) => r.item);
+}
+
+if (retryable.length) {
+  retries += 1;
+  for (const item of retryable) {
+    const tab = await context.newPage();
+    const r = await runTask(item, tab); // sequential fallback
+    await tab.close().catch(() => {});
+    results.push(r);
+  }
+}
+
+failures = results.filter((r) => !r.ok).length;
+return {
+  results,
+  telemetry: {
+    peakConcurrentTasks,
+    wallClockMs: Date.now() - startedAt,
+    sumTaskDurationsMs,
+    failures,
+    retries,
+  },
+};
+```
+
 ## Examples
 
 These prompts show how 10x users work with BrowserForce. The AI generates the code and handles the work — you just describe what you need.
diff --git a/README.md b/README.md
index 0df7634..a9a81ae 100644
--- a/README.md
+++ b/README.md
@@ -1,4 +1,6 @@
-# BrowserForce // Parallel AI Agents in "your" Chrome!
+# BrowserForce // Parallel AI Agents in "your" Browser!
+
+Give AI agents controlled access to the browser you already use.
 
 > "a lion doesn't concern itself with token counting" — [@steipete](https://x.com/steipete), creator of [OpenClaw](https://github.com/openclaw/openclaw)
 >
@@ -12,16 +14,18 @@ Works with [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-
 
 ## Comparison
 
-| | Playwright MCP | OpenClaw Browser | Playwriter | Claude Extension | BrowserForce |
-|---|---|---|---|---|---|
-| Browser | Spawns new Chrome | Separate profile | Your Chrome | Your Chrome | **Your Chrome** |
-| Login state | Fresh | Fresh (isolated) | Yours | Yours | **Yours** |
-| Tab access | N/A (new browser) | Managed by agent | Click each tab | Click each tab | **Auto mode + manual attached tabs** |
-| Autonomous | Yes | Yes | No (manual click) | No (manual click) | **Yes (fully autonomous)** |
-| Context method | Screenshots (100KB+) | Screenshots + snapshots | A11y snapshots (5-20KB) | Screenshots (100KB+) | **A11y snapshots (5-20KB)** |
-| Tools | Many dedicated | 1 `browser` tool | 1 `execute` tool | Built-in | **2 tools: `execute`, `reset`** |
-| Agent support | Any MCP client | OpenClaw only | Any MCP client | Claude only | **Any MCP client** |
-| Playwright API | Partial | No | Full | No | **Full** |
+
+|                | Playwright MCP       | OpenClaw Browser        | Playwriter              | Claude Extension     | BrowserForce                         |
+| -------------- | -------------------- | ----------------------- | ----------------------- | -------------------- | ------------------------------------ |
+| Browser        | Spawns new Chrome    | Separate profile        | Your Chrome             | Your Chrome          | **Your Chrome**                      |
+| Login state    | Fresh                | Fresh (isolated)        | Yours                   | Yours                | **Yours**                            |
+| Tab access     | N/A (new browser)    | Managed by agent        | Click each tab          | Click each tab       | **Auto mode + manual attached tabs** |
+| Autonomous     | Yes                  | Yes                     | No (manual click)       | No (manual click)    | **Yes (fully autonomous)**           |
+| Context method | Screenshots (100KB+) | Screenshots + snapshots | A11y snapshots (5-20KB) | Screenshots (100KB+) | **A11y snapshots (5-20KB)**          |
+| Tools          | Many dedicated       | 1 `browser` tool        | 1 `execute` tool        | Built-in             | **2 tools: `execute`, `reset`**      |
+| Agent support  | Any MCP client       | OpenClaw only           | Any MCP client          | Claude only          | **Any MCP client**                   |
+| Playwright API | Partial              | No                      | Full                    | No                   | **Full**                             |
+
 
 ## Your Credentials Stay Yours
 
@@ -30,6 +34,7 @@ Every other approach asks you to hand over something: an API key, an OAuth token
 **Why?** Because you're already logged in. BrowserForce talks to your running Chrome — it doesn't extract credentials, store cookies, or replay tokens. The browser handles auth exactly as it always has. Your agent inherits your sessions the same way a new Chrome tab does.
 
 What you never need to provide:
+
 - No passwords
 - No API keys
 - No OAuth tokens
@@ -61,8 +66,8 @@ pnpm install
 2. Open `chrome://extensions/` in Chrome
 3. Enable **Developer mode** (top-right toggle)
 4. Click **Load unpacked** → a file picker opens
-   - **macOS**: press `Cmd+Shift+G`, paste the path from step 1, press Enter
-   - **Windows/Linux**: paste the path directly into the address bar of the dialog
+  - **macOS**: press `Cmd+Shift+G`, paste the path from step 1, press Enter
+  - **Windows/Linux**: paste the path directly into the address bar of the dialog
 
 ❗ After every BrowserForce update, re-run `browserforce install-extension`, then reload the extension in `chrome://extensions/` (click the ↺ icon next to BrowserForce).
 
@@ -104,14 +109,13 @@ browserforce serve
 
 **Verify it works** — send this to your agent:
 
-> Go to https://x.com and give me top tweets
+> Go to [https://x.com](https://x.com) and give me top tweets
 
 If your agent browses to the page and responds with the title, you're all set.
 
 **MCP setup (advanced):**
 
-<details>
-<summary><b>OpenClaw (MCP adapter)</b></summary>
+**OpenClaw (MCP adapter)**
 
 Add to `~/.openclaw/openclaw.json`:
 
@@ -137,10 +141,9 @@ Add to `~/.openclaw/openclaw.json`:
 }
 ```
 
-</details>
 
-<details>
-<summary><b>Claude Desktop</b></summary>
+
+**Claude Desktop**
 
 Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
 
@@ -155,10 +158,9 @@ Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
 }
 ```
 
-</details>
 
-<details>
-<summary><b>Claude Code</b></summary>
+
+**Claude Code**
 
 Add to `~/.claude/mcp.json`:
 
@@ -173,10 +175,9 @@ Add to `~/.claude/mcp.json`:
 }
 ```
 
-</details>
 
-<details>
-<summary><b>Codex</b></summary>
+
+**Codex**
 
 Add to `~/.codex/config.toml`:
 
@@ -186,10 +187,9 @@ command = "npx"
 args = ["-y", "browserforce@latest", "mcp"]
 ```
 
-</details>
 
-<details>
-<summary><b>Cursor</b></summary>
+
+**Cursor**
 
 Add to `~/.cursor/mcp.json`:
 
@@ -204,10 +204,9 @@ Add to `~/.cursor/mcp.json`:
 }
 ```
 
-</details>
 
-<details>
-<summary><b>Antigravity</b></summary>
+
+**Antigravity**
 
 In Antigravity: Agent panel -> `...` -> `Manage MCP Servers` -> `View raw config`.
 Add the same `mcpServers` entry:
@@ -223,7 +222,7 @@ Add the same `mcpServers` entry:
 }
 ```
 
-</details>
+
 
 If MCP startup fails with `connection closed: initialize response`:
 
@@ -268,10 +267,12 @@ That's it. Restart MCP (or Claude Desktop) and `highlight()` is available in eve
 
 ### Official plugins
 
-| Plugin | What it adds | Install |
-|--------|-------------|---------|
+
+| Plugin      | What it adds                                                                                   | Install                                 |
+| ----------- | ---------------------------------------------------------------------------------------------- | --------------------------------------- |
 | `highlight` | `highlight(selector, color?)` — outlines matching elements; `clearHighlights()` — removes them | `browserforce plugin install highlight` |
 
+
 ### Use an installed plugin
 
 After installing `highlight`, your agent can call it directly:
@@ -367,10 +368,12 @@ state.results = await page.evaluate(() => document.title);
 
 ### MCP Tools
 
-| Tool | Description |
-|------|-------------|
+
+| Tool      | Description                                                                                                                                                                                                                    |
+| --------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
 | `execute` | Run Playwright JavaScript in your real Chrome. Access `page`, `context`, `state`, `snapshot()`, `waitForPageLoad()`, `getLogs()`, `screenshotWithAccessibilityLabels()`, `cleanHTML()`, `pageMarkdown()`, and Node.js globals. |
-| `reset` | Reconnect to the relay and clear state. Use when the connection drops. |
+| `reset`   | Reconnect to the relay and clear state. Use when the connection drops.                                                                                                                                                         |
+
 
 ### Diff-Aware Helpers
 
@@ -383,133 +386,204 @@ await cleanHTML('body', { showDiffSinceLastCall: false });
 await pageMarkdown({ showDiffSinceLastCall: true });
 ```
 
+### BrowserForce Tab Swarms // Parallel Tabs Processing
+
+BrowserForce uses a parallel-first policy for independent extraction jobs, so agents finish list/count/scrape tasks faster with bounded risk.
+
+- Rule: For count/list/extraction across independent pages, dates, or items, run parallel tabs first using `Promise.all` with a concurrency cap (`3-8`, typically start at `5`).
+- Fallback: If the site starts rate-limiting (`429`), anti-bot challenges appear, or timeouts repeat, automatically retry with reduced concurrency and then sequential as a final fallback.
+- Safety: This swarm exception is for read-only bulk extraction only; no user-tab mutation (checkout/purchase/send/delete/settings changes) during swarm runs.
+- Required telemetry return: `peakConcurrentTasks`, `wallClockMs`, `sumTaskDurationsMs`, `failures`, `retries`.
+
 Need role-based, real workflows? See [Actionable Use Cases](docs/USE_CASES.md).
 
 ## Examples
 
 Get started with simple prompts. The AI generates code and does the work.
 
-<details>
-<summary><b>Example 1: Read page content (X.com search)</b></summary>
+**Example 1: Read page content (X.com search)**
 
 **Prompt to AI:**
+
 > Go to x.com/search and search for "browserforce". Show me the top 5 tweets you find.
 
 **What the AI does:** Navigates to X, searches the term, extracts top tweets, returns them to you.
 
 **Use case:** Quick research, trend tracking, social listening.
 
-</details>
 
-<details>
-<summary><b>Example 2: Interact with a form (GitHub search)</b></summary>
+
+**Example 2: Interact with a form (GitHub search)**
 
 **Prompt to AI:**
+
 > Go to GitHub and search for "ai agents". Show me the top 3 repositories and their star counts.
 
 **What the AI does:** Fills GitHub search, waits for results, extracts repo names + stars, returns them.
 
 **Use case:** Finding libraries, competitive research, project discovery.
 
-</details>
+
 
 ### Multi-Tab Workflows
 
-<details>
-<summary><b>Example 3: Search → Extract → Return</b></summary>
+**Example 3: Search → Extract → Return**
 
 **Prompt to AI:**
+
 > Search ProductHunt for "AI tools" and give me the top 5 products with their taglines and upvote counts.
 
 **What the AI does:** Navigates ProductHunt, searches, extracts product info, returns structured data.
 
 **Use case:** Market research, finding tools, competitive analysis.
 
-</details>
 
-<details>
-<summary><b>Example 4: Open result in new tab, process there</b></summary>
+
+**Example 4: Open result in new tab, process there**
 
 **Prompt to AI:**
+
 > Find the #1 product from your last ProductHunt search, click into it, and read the full description. Tell me what it does.
 
 **What the AI does:** Opens the product page from previous results, reads the description, summarizes it.
 
 **Use case:** Deep-dive research, understanding competitors, due diligence.
 
-</details>
 
-<details>
-<summary><b>Example 5: Debugging workflow (inspect + verify)</b></summary>
+
+**Example 5: Debugging workflow (inspect + verify)**
 
 **Prompt to AI:**
+
 > Go to my staging site at staging.myapp.com/checkout and take a labeled screenshot. Tell me if the "Complete Purchase" button is visible and what's around it.
 
 **What the AI does:** Navigates, takes screenshot with interactive labels, analyzes button state and layout.
 
 **Use case:** Visual debugging, QA checks, spotting broken elements.
 
-</details>
 
-<details>
-<summary><b>Example 6: Test form with data</b></summary>
+
+**Example 6: Test form with data**
 
 **Prompt to AI:**
-> Sign up for Substack using the email test.user@example.com. Tell me if the signup completes successfully.
+
+> Sign up for Substack using the email [test.user@example.com](mailto:test.user@example.com). Tell me if the signup completes successfully.
 
 **What the AI does:** Fills the form, submits, waits for confirmation, reports success/failure.
 
 **Use case:** Testing sign-up flows, QA automation, form validation.
 
-</details>
 
-<details>
-<summary><b>Example 7: Content pipeline (search → extract → compare)</b></summary>
+
+**Example 7: Content pipeline (search → extract → compare)**
 
 **Prompt to AI:**
+
 > Search for "AI regulation" on both X.com and LinkedIn. Give me the top 5 trending posts from each and tell me which topics overlap.
 
 **What the AI does:** Searches both platforms, extracts posts, compares content, returns analysis.
 
 **Use case:** Multi-source research, trend analysis, market sentiment.
 
-</details>
 
-<details>
-<summary><b>Example 8: Data extraction → CSV pipeline</b></summary>
+
+**Example 8: Data extraction → CSV pipeline**
 
 **Prompt to AI:**
+
 > Go to Hacker News and extract the top 10 stories with their titles and vote counts. Format as CSV so I can import into a spreadsheet.
 
 **What the AI does:** Navigates HN, extracts story data, formats as CSV, returns it ready to paste.
 
 **Use case:** Data workflows, trend tracking, content curation.
 
-</details>
 
-<details>
-<summary><b>Example 9: A/B testing across variants</b></summary>
+
+**Example 9: A/B testing across variants**
 
 **Prompt to AI:**
+
 > Visit myapp.com/?variant=red and myapp.com/?variant=blue. Compare the two designs and tell me which button color is more prominent and what other differences exist.
 
 **What the AI does:** Opens both variants, compares layouts/colors/text, reports visual differences.
 
 **Use case:** Design QA, A/B testing, variant comparison.
 
-</details>
 
-<details>
-<summary><b>Example 10: Monitor + alert workflow</b></summary>
+
+**Example 10: Monitor + alert workflow**
 
 **Prompt to AI:**
+
 > Check our status page at status.myapp.com every few minutes. Tell me the current status of the API and database. Alert me if anything changes from green to red.
 
 **What the AI does:** Monitors status page, reads indicators, alerts on degradation.
 
 **Use case:** Uptime monitoring, incident detection, SLA tracking.
 
-</details>
+
+
+### Parallel Tab Swarms: Real-World Use Cases
+
+**Example 11: Retail price swarm (SKU × store matrix)**
+
+**Prompt to AI:**
+
+> For these 25 SKUs, check Amazon, Walmart, Target, and Best Buy in parallel tabs. Return the best price, in-stock status, and fastest delivery ETA per SKU.
+
+**What the AI does:** Runs independent `(sku, store)` checks in capped parallel tab batches, retries with reduced concurrency on `429`/timeouts, then falls back sequentially if needed.
+
+**Use case:** Pricing intelligence, buy-box monitoring, merchandising ops.
+
+
+
+**Example 12: Travel fare grid (date × route sweep)**
+
+**Prompt to AI:**
+
+> For SFO → JFK, scan the next 14 Fridays and Sundays across Google Flights, Kayak, and Expedia. Return the cheapest refundable option for each date.
+
+**What the AI does:** Opens independent `(date, site)` tasks in parallel, extracts fare + refundability, and returns a normalized comparison table.
+
+**Use case:** Travel operations, procurement, rapid itinerary optimization.
+
+
+
+**Example 13: Competitor launch radar (company × source)**
+
+**Prompt to AI:**
+
+> Track the last 7 days of updates for these 30 competitors across release notes, changelogs, docs, and blog posts. Group findings by feature category.
+
+**What the AI does:** Parallelizes `(company, source)` extraction, deduplicates announcements, and returns a launch digest with links.
+
+**Use case:** Product strategy, PM intelligence, competitive monitoring.
+
+
+
+**Example 14: Lead qualification swarm (account × signal source)**
+
+**Prompt to AI:**
+
+> For this account list, check careers pages, LinkedIn jobs, pricing pages, and press/news for expansion signals. Score each account and rank top opportunities.
+
+**What the AI does:** Executes independent account-source checks in parallel tabs, extracts signal evidence, and returns ranked lead scores with rationale.
+
+**Use case:** Sales research, outbound prioritization, RevOps signal mining.
+
+
+
+**Example 15: Security exposure triage (domain × surface)**
+
+**Prompt to AI:**
+
+> For these domains, inspect login pages, robots.txt, status pages, public docs, and likely staging links. Flag suspicious exposures with evidence links.
+
+**What the AI does:** Runs read-only `(domain, surface)` checks in a swarm, retries degraded paths safely, and returns a risk-prioritized findings report.
+
+**Use case:** Security reviews, surface mapping, pre-audit triage.
+
+
 
 **More examples** and detailed walkthrough available in the [User Guide](GUIDE.md#examples).
 
@@ -544,16 +618,18 @@ In **Auto mode**, the agent can create and control tabs it opens. In **Manual mo
 
 Click the extension icon to configure restrictions. Your browser, your rules:
 
-| Setting | What it does |
-|---------|-------------|
-| **Auto / Manual mode** | Let the agent create tabs freely, or hand-pick which tabs it can access |
-| **Lock URL** | Prevent the agent from navigating away from the current page |
-| **No new tabs** | Block the agent from opening new tabs |
-| **Read-only** | Observe only — no clicks, no typing, no interactions |
-| **Auto-detach** | Automatically detach inactive tabs after 5-60 minutes |
-| **Auto-close** | Automatically close agent-created tabs after 5-60 minutes |
+
+| Setting                 | What it does                                                             |
+| ----------------------- | ------------------------------------------------------------------------ |
+| **Auto / Manual mode**  | Let the agent create tabs freely, or hand-pick which tabs it can access  |
+| **Lock URL**            | Prevent the agent from navigating away from the current page             |
+| **No new tabs**         | Block the agent from opening new tabs                                    |
+| **Read-only**           | Observe only — no clicks, no typing, no interactions                     |
+| **Auto-detach**         | Automatically detach inactive tabs after 5-60 minutes                    |
+| **Auto-close**          | Automatically close agent-created tabs after 5-60 minutes                |
 | **Custom instructions** | Pass text instructions to the agent (e.g. "don't click any buy buttons") |
 
+
 ### Controlled Tab Workflows
 
 - **Manually attach a tab:** Open the tab you want, click the extension popup, then click **+ Attach Current Tab**.
@@ -566,19 +642,22 @@ For step-by-step setups, see the [Controlled Tabs Playbook](GUIDE.md#controlled-
 
 ## Security
 
-| Layer | Control |
-|-------|---------|
-| **Network** | Relay binds to `127.0.0.1` only — never exposed to the internet |
-| **Auth** | Random token required for every CDP connection |
-| **Origin** | Extension only accepts connections from its own Chrome origin |
-| **Visibility** | Chrome shows "controlled by automated test software" on active tabs |
+
+| Layer            | Control                                                                 |
+| ---------------- | ----------------------------------------------------------------------- |
+| **Network**      | Relay binds to `127.0.0.1` only — never exposed to the internet         |
+| **Auth**         | Random token required for every CDP connection                          |
+| **Origin**       | Extension only accepts connections from its own Chrome origin           |
+| **Visibility**   | Chrome shows "controlled by automated test software" on active tabs     |
 | **Restrictions** | Lock URLs, block navigation, read-only mode — enforced at the CDP level |
 
+
 Everything runs on your machine. The auth token is stored at `~/.browserforce/auth-token` with owner-only permissions.
 
 ## Configuration
 
 **Custom relay port:**
+
 ```bash
 RELAY_PORT=19333 browserforce serve
 ```
@@ -586,6 +665,7 @@ RELAY_PORT=19333 browserforce serve
 **Extension relay URL:** Click the extension icon → change the URL → Save. Default: `ws://127.0.0.1:19222/extension`
 
 **Override CDP URL for MCP:**
+
 ```json
 {
   "env": {
@@ -596,23 +676,27 @@ RELAY_PORT=19333 browserforce serve
 
 ## API
 
-| Endpoint | Description |
-|----------|-------------|
-| `GET /` | Health check (extension status, target count) |
-| `GET /json/version` | CDP discovery |
-| `GET /json/list` | List attached targets |
-| `ws://.../extension` | Chrome extension WebSocket |
-| `ws://.../cdp?token=...` | Agent CDP connection |
+
+| Endpoint                 | Description                                   |
+| ------------------------ | --------------------------------------------- |
+| `GET /`                  | Health check (extension status, target count) |
+| `GET /json/version`      | CDP discovery                                 |
+| `GET /json/list`         | List attached targets                         |
+| `ws://.../extension`     | Chrome extension WebSocket                    |
+| `ws://.../cdp?token=...` | Agent CDP connection                          |
+
 
 ## Troubleshooting
 
-| Problem | Fix |
-|---------|-----|
-| Extension stays gray | Is the relay running? Check `http://127.0.0.1:19222/` |
-| "Another debugger attached" | Close DevTools for that tab |
-| Agent sees 0 pages | Open at least one regular webpage (not `chrome://`) |
-| Extension keeps reconnecting | Normal — MV3 kills idle workers; it auto-recovers |
-| Port in use | `lsof -ti:19222 \| xargs kill -9` |
+
+| Problem                      | Fix                                                   |
+| ---------------------------- | ----------------------------------------------------- |
+| Extension stays gray         | Is the relay running? Check `http://127.0.0.1:19222/` |
+| "Another debugger attached"  | Close DevTools for that tab                           |
+| Agent sees 0 pages           | Open at least one regular webpage (not `chrome://`)   |
+| Extension keeps reconnecting | Normal — MV3 kills idle workers; it auto-recovers     |
+| Port in use                  | `lsof -ti:19222 | xargs kill -9`                      |
+
 
 CDP traffic is logged to `~/.browserforce/cdp.jsonl` (recreated on each relay start). Summarize traffic by direction + method:
 
@@ -623,3 +707,4 @@ jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.
 For practical debugging and operations flows, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
 
 > **Want the full walkthrough?** Read the [User Guide](https://github.com/ivalsaraj/browserforce/blob/main/GUIDE.md) for a plain-English explanation of what this does and how to get started.
+
diff --git a/mcp/src/index.js b/mcp/src/index.js
index b3bc12d..e523484 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -180,6 +180,7 @@ After every action, verify its result before proceeding:
 
 Never chain multiple actions blindly. If you click a button, verify it worked before clicking the next.
 Each execute call should do ONE meaningful action and return verification.
+Exception: Multi-step is allowed for read-only bulk extraction when actions are independent and no user-tab mutation occurs.
 
 When navigating:
   await state.page.goto(url);
@@ -294,6 +295,24 @@ snapshot vs cleanHTML vs pageMarkdown:
   3) Use pageMarkdown() for article/blog/news pages where nav/ads should be removed.
   4) Use screenshotWithAccessibilityLabels() only when layout/visual evidence is required.
 
+═══ BROWSERFORCE TAB SWARMS // PARALLEL TABS PROCESSING ═══
+
+Parallel-first policy for independent extraction:
+  1) For count/list/extraction across independent pages, dates, or items, start with parallel tabs first.
+  2) Use Promise.all with a concurrency cap (typically 3-8; start at 5 unless site limits are known).
+  3) Keep swarm runs read-only and isolated to agent-created tabs (no checkout/purchase/send/delete/profile changes).
+  4) If you hit 429, anti-bot challenges, or repeated timeouts, automatically retry with reduced concurrency.
+  5) If reduced concurrency still fails, retry sequentially.
+
+Always return telemetry for swarm runs:
+  {
+    peakConcurrentTasks,
+    wallClockMs,
+    sumTaskDurationsMs,
+    failures,
+    retries
+  }
+
 ═══ DEBUGGING WORKFLOW ═══
 
 Combine snapshot + logs:
@@ -375,7 +394,8 @@ When you need the full tree instead of diff output:
 ✗ Don't chain actions without verifying — observe after each action
 ✗ Don't use page.waitForTimeout() — use waitForPageLoad() or waitFor()
 ✗ Don't forget to return a value — every call should return verification
-✗ Don't write complex multi-step scripts — split into separate execute calls
+✗ Don't write complex multi-step scripts by default — split into separate execute calls
+✓ Exception: Multi-step is allowed for read-only bulk extraction when actions are independent and no user-tab mutation occurs
 ✗ Don't use page variable directly — use state.page after first call setup
 
 ═══ ERROR RECOVERY ═══
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index fcefc4b..da8f2c0 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -148,6 +148,34 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('Downloads'), 'should include download pattern');
   });
 
+  it('execute prompt includes parallel-first swarm policy and telemetry contract', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/index.js'),
+      'utf8'
+    );
+    const promptStart = source.indexOf('const EXECUTE_PROMPT');
+    const promptEnd = source.indexOf("server.tool(\n  'execute'");
+    const promptBlock = source.slice(promptStart, promptEnd);
+
+    assert.ok(
+      promptBlock.includes('BROWSERFORCE TAB SWARMS // PARALLEL TABS PROCESSING'),
+      'should include tab swarm policy section'
+    );
+    assert.ok(
+      promptBlock.includes('Promise.all with a concurrency cap'),
+      'should include parallel-first concurrency guidance'
+    );
+    assert.ok(
+      promptBlock.includes('Multi-step is allowed for read-only bulk extraction'),
+      'should include explicit anti-pattern exception for read-only bulk extraction'
+    );
+    assert.ok(promptBlock.includes('peakConcurrentTasks'), 'should require peakConcurrentTasks telemetry');
+    assert.ok(promptBlock.includes('wallClockMs'), 'should require wallClockMs telemetry');
+    assert.ok(promptBlock.includes('sumTaskDurationsMs'), 'should require sumTaskDurationsMs telemetry');
+    assert.ok(promptBlock.includes('failures'), 'should require failures telemetry');
+    assert.ok(promptBlock.includes('retries'), 'should require retries telemetry');
+  });
+
   it('execute tool has code and optional timeout params', () => {
     const source = readFileSync(
       join(import.meta.url.replace('file://', ''), '../../src/index.js'),

From 1915a008ae9b6226e810cd453c0ebe623437d33d Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:42:10 +0530
Subject: [PATCH 056/192] docs(readme): collapse advanced MCP setup into
 details block

---
 README.md | 18 ++++--------------
 1 file changed, 4 insertions(+), 14 deletions(-)

diff --git a/README.md b/README.md
index a9a81ae..bf71c1b 100644
--- a/README.md
+++ b/README.md
@@ -113,7 +113,8 @@ browserforce serve
 
 If your agent browses to the page and responds with the title, you're all set.
 
-**MCP setup (advanced):**
+<details>
+<summary><b>MCP setup (advanced)</b></summary>
 
 **OpenClaw (MCP adapter)**
 
@@ -141,8 +142,6 @@ Add to `~/.openclaw/openclaw.json`:
 }
 ```
 
-
-
 **Claude Desktop**
 
 Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
@@ -158,8 +157,6 @@ Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
 }
 ```
 
-
-
 **Claude Code**
 
 Add to `~/.claude/mcp.json`:
@@ -175,8 +172,6 @@ Add to `~/.claude/mcp.json`:
 }
 ```
 
-
-
 **Codex**
 
 Add to `~/.codex/config.toml`:
@@ -187,8 +182,6 @@ command = "npx"
 args = ["-y", "browserforce@latest", "mcp"]
 ```
 
-
-
 **Cursor**
 
 Add to `~/.cursor/mcp.json`:
@@ -204,8 +197,6 @@ Add to `~/.cursor/mcp.json`:
 }
 ```
 
-
-
 **Antigravity**
 
 In Antigravity: Agent panel -> `...` -> `Manage MCP Servers` -> `View raw config`.
@@ -222,14 +213,14 @@ Add the same `mcpServers` entry:
 }
 ```
 
-
-
 If MCP startup fails with `connection closed: initialize response`:
 
 1. Ensure args include `"mcp"` (without it, BrowserForce prints help and exits).
 2. If running from a local clone, install deps first: `pnpm install`.
 3. Validate the launch command manually: `npx -y browserforce@latest mcp`
 
+</details>
+
 ### CLI
 
 ```bash
@@ -707,4 +698,3 @@ jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.
 For practical debugging and operations flows, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
 
 > **Want the full walkthrough?** Read the [User Guide](https://github.com/ivalsaraj/browserforce/blob/main/GUIDE.md) for a plain-English explanation of what this does and how to get started.
-

From edf2bb0b6afc49473f27c56bccdb88150dcbf2de Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:42:11 +0530
Subject: [PATCH 057/192] feat(mcp): retry when relay slot is busy instead of
 failing immediately

---
 mcp/src/exec-engine.js     | 37 +++++++++++++++++++++++++++++++++++++
 mcp/src/index.js           | 29 +++++++++++++++++++++++++++--
 mcp/test/mcp-tools.test.js | 10 ++++++++++
 3 files changed, 74 insertions(+), 2 deletions(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index c4f2e86..a2848c6 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -45,6 +45,43 @@ export function getRelayHttpUrl() {
   }
 }
 
+export function isCdpBusyError(err) {
+  const message = String(err?.message || '').toLowerCase();
+  return (
+    message.includes('409') ||
+    message.includes('slot busy') ||
+    message.includes('slot is busy') ||
+    message.includes('busy') ||
+    message.includes('already connected') ||
+    message.includes('already in use') ||
+    message.includes('another cdp client')
+  );
+}
+
+export async function waitForFreeClientSlot({ timeoutMs = 30000, baseUrl } = {}) {
+  const start = Date.now();
+  const resolvedBaseUrl = String(baseUrl || getRelayHttpUrl()).replace(/\/+$/, '');
+  const slotUrl = `${resolvedBaseUrl}/client-slot`;
+
+  while (Date.now() - start < timeoutMs) {
+    try {
+      const res = await fetch(slotUrl, { signal: AbortSignal.timeout(2000) });
+      if (res.ok) {
+        const data = await res.json();
+        if (data && data.busy === false) return true;
+      }
+    } catch { /* keep polling until timeout */ }
+
+    const elapsed = Date.now() - start;
+    const remaining = timeoutMs - elapsed;
+    if (remaining <= 0) break;
+    const jitteredDelayMs = 200 + Math.floor(Math.random() * 200);
+    await new Promise((r) => globalThis.setTimeout(r, Math.min(jitteredDelayMs, remaining)));
+  }
+
+  return false;
+}
+
 // ─── Auto-start relay ───────────────────────────────────────────────────────
 
 function getRelayPort() {
diff --git a/mcp/src/index.js b/mcp/src/index.js
index b3bc12d..c5ce9ab 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -7,7 +7,8 @@ import { StdioServerTransport } from '@modelcontextprotocol/sdk/server/stdio.js'
 import { z } from 'zod';
 import { chromium } from 'playwright-core';
 import {
-  getCdpUrl, ensureRelay, CodeExecutionTimeoutError, buildExecContext, runCode, formatResult,
+  getCdpUrl, getRelayHttpUrl, ensureRelay, isCdpBusyError, waitForFreeClientSlot,
+  CodeExecutionTimeoutError, buildExecContext, runCode, formatResult,
 } from './exec-engine.js';
 import { loadPlugins, buildPluginHelpers, buildPluginSkillAppendix } from './plugin-loader.js';
 import { checkForUpdate } from './update-check.js';
@@ -63,12 +64,36 @@ function ensureAllPagesCapture() {
 // ─── Browser Connection ──────────────────────────────────────────────────────
 
 let browser = null;
+const CONNECT_RETRY_TIMEOUT_MS = 30000;
 
 async function ensureBrowser() {
   if (browser?.isConnected()) return;
   await ensureRelay();
   const cdpUrl = getCdpUrl();
-  browser = await chromium.connectOverCDP(cdpUrl);
+  const baseUrl = getRelayHttpUrl();
+  const deadline = Date.now() + CONNECT_RETRY_TIMEOUT_MS;
+  let lastBusyError = null;
+
+  while (!browser && Date.now() < deadline) {
+    try {
+      browser = await chromium.connectOverCDP(cdpUrl);
+    } catch (err) {
+      if (!isCdpBusyError(err)) throw err;
+      lastBusyError = err;
+      const remainingMs = deadline - Date.now();
+      if (remainingMs <= 0) break;
+      const slotFreed = await waitForFreeClientSlot({
+        timeoutMs: remainingMs,
+        baseUrl,
+      });
+      if (!slotFreed) break;
+    }
+  }
+
+  if (!browser) {
+    throw lastBusyError || new Error('Failed to connect to CDP relay');
+  }
+
   browser.on('disconnected', () => {
     browser = null;
     contextListenerAttached = false;
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index fcefc4b..32a5ad8 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -544,3 +544,13 @@ describe('smartWaitForPageLoad', () => {
     assert.equal(expectedShape.timedOut, false);
   });
 });
+
+// ─── CDP Busy Helpers ───────────────────────────────────────────────────────
+
+describe('CDP Busy Helpers', () => {
+  it('detects relay slot contention errors', async () => {
+    const { isCdpBusyError } = await import('../src/exec-engine.js');
+    assert.equal(isCdpBusyError(new Error('Unexpected server response: 409')), true);
+    assert.equal(isCdpBusyError(new Error('ECONNREFUSED')), false);
+  });
+});

From c2d8011bb44cb986b943867f80c2107f45a761c3 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:47:47 +0530
Subject: [PATCH 058/192] test(mcp): cover busy retry path for CDP connection

---
 mcp/src/exec-engine.js     | 26 ++++++++++++++++++
 mcp/src/index.js           | 31 +++++-----------------
 mcp/test/mcp-tools.test.js | 54 ++++++++++++++++++++++++++++++++++++++
 3 files changed, 87 insertions(+), 24 deletions(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index a2848c6..b593861 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -82,6 +82,32 @@ export async function waitForFreeClientSlot({ timeoutMs = 30000, baseUrl } = {})
   return false;
 }
 
+export async function connectOverCdpWithBusyRetry({
+  connect,
+  cdpUrl,
+  baseUrl = getRelayHttpUrl(),
+  timeoutMs = 30000,
+  waitForFreeSlot = waitForFreeClientSlot,
+} = {}) {
+  const deadline = Date.now() + timeoutMs;
+  let lastBusyError = null;
+
+  while (Date.now() < deadline) {
+    try {
+      return await connect(cdpUrl);
+    } catch (err) {
+      if (!isCdpBusyError(err)) throw err;
+      lastBusyError = err;
+      const remainingMs = deadline - Date.now();
+      if (remainingMs <= 0) break;
+      const slotFreed = await waitForFreeSlot({ timeoutMs: remainingMs, baseUrl });
+      if (!slotFreed) break;
+    }
+  }
+
+  throw lastBusyError || new Error('Failed to connect to CDP relay');
+}
+
 // ─── Auto-start relay ───────────────────────────────────────────────────────
 
 function getRelayPort() {
diff --git a/mcp/src/index.js b/mcp/src/index.js
index c5ce9ab..8bd7fb1 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -7,7 +7,7 @@ import { StdioServerTransport } from '@modelcontextprotocol/sdk/server/stdio.js'
 import { z } from 'zod';
 import { chromium } from 'playwright-core';
 import {
-  getCdpUrl, getRelayHttpUrl, ensureRelay, isCdpBusyError, waitForFreeClientSlot,
+  getCdpUrl, getRelayHttpUrl, ensureRelay, connectOverCdpWithBusyRetry,
   CodeExecutionTimeoutError, buildExecContext, runCode, formatResult,
 } from './exec-engine.js';
 import { loadPlugins, buildPluginHelpers, buildPluginSkillAppendix } from './plugin-loader.js';
@@ -70,29 +70,12 @@ async function ensureBrowser() {
   if (browser?.isConnected()) return;
   await ensureRelay();
   const cdpUrl = getCdpUrl();
-  const baseUrl = getRelayHttpUrl();
-  const deadline = Date.now() + CONNECT_RETRY_TIMEOUT_MS;
-  let lastBusyError = null;
-
-  while (!browser && Date.now() < deadline) {
-    try {
-      browser = await chromium.connectOverCDP(cdpUrl);
-    } catch (err) {
-      if (!isCdpBusyError(err)) throw err;
-      lastBusyError = err;
-      const remainingMs = deadline - Date.now();
-      if (remainingMs <= 0) break;
-      const slotFreed = await waitForFreeClientSlot({
-        timeoutMs: remainingMs,
-        baseUrl,
-      });
-      if (!slotFreed) break;
-    }
-  }
-
-  if (!browser) {
-    throw lastBusyError || new Error('Failed to connect to CDP relay');
-  }
+  browser = await connectOverCdpWithBusyRetry({
+    connect: (url) => chromium.connectOverCDP(url),
+    cdpUrl,
+    baseUrl: getRelayHttpUrl(),
+    timeoutMs: CONNECT_RETRY_TIMEOUT_MS,
+  });
 
   browser.on('disconnected', () => {
     browser = null;
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 32a5ad8..ddf64e6 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -553,4 +553,58 @@ describe('CDP Busy Helpers', () => {
     assert.equal(isCdpBusyError(new Error('Unexpected server response: 409')), true);
     assert.equal(isCdpBusyError(new Error('ECONNREFUSED')), false);
   });
+
+  it('retries busy connect and succeeds after slot is free', async () => {
+    const { connectOverCdpWithBusyRetry } = await import('../src/exec-engine.js');
+
+    let connectCalls = 0;
+    const expectedBrowser = { connected: true };
+    const connect = async () => {
+      connectCalls += 1;
+      if (connectCalls === 1) {
+        throw new Error('Unexpected server response: 409');
+      }
+      return expectedBrowser;
+    };
+
+    let waitCalls = 0;
+    const waitForFreeSlot = async () => {
+      waitCalls += 1;
+      return true;
+    };
+
+    const browser = await connectOverCdpWithBusyRetry({
+      connect,
+      cdpUrl: 'ws://127.0.0.1:19222/cdp?token=test',
+      baseUrl: 'http://127.0.0.1:19222',
+      timeoutMs: 5000,
+      waitForFreeSlot,
+    });
+
+    assert.equal(browser, expectedBrowser);
+    assert.equal(connectCalls, 2);
+    assert.equal(waitCalls, 1);
+  });
+
+  it('does not retry non-busy connect errors', async () => {
+    const { connectOverCdpWithBusyRetry } = await import('../src/exec-engine.js');
+
+    let waitCalls = 0;
+    const error = new Error('ECONNREFUSED');
+
+    await assert.rejects(
+      () => connectOverCdpWithBusyRetry({
+        connect: async () => { throw error; },
+        cdpUrl: 'ws://127.0.0.1:19222/cdp?token=test',
+        timeoutMs: 5000,
+        waitForFreeSlot: async () => {
+          waitCalls += 1;
+          return true;
+        },
+      }),
+      /ECONNREFUSED/
+    );
+
+    assert.equal(waitCalls, 0);
+  });
 });

From 1d90a9830674d1d35a868d2d9d17a1d362cb89f6 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:49:09 +0530
Subject: [PATCH 059/192] Revert "docs(readme): collapse advanced MCP setup
 into details block"

This reverts commit 1915a008ae9b6226e810cd453c0ebe623437d33d.
---
 README.md | 18 ++++++++++++++----
 1 file changed, 14 insertions(+), 4 deletions(-)

diff --git a/README.md b/README.md
index bf71c1b..a9a81ae 100644
--- a/README.md
+++ b/README.md
@@ -113,8 +113,7 @@ browserforce serve
 
 If your agent browses to the page and responds with the title, you're all set.
 
-<details>
-<summary><b>MCP setup (advanced)</b></summary>
+**MCP setup (advanced):**
 
 **OpenClaw (MCP adapter)**
 
@@ -142,6 +141,8 @@ Add to `~/.openclaw/openclaw.json`:
 }
 ```
 
+
+
 **Claude Desktop**
 
 Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
@@ -157,6 +158,8 @@ Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
 }
 ```
 
+
+
 **Claude Code**
 
 Add to `~/.claude/mcp.json`:
@@ -172,6 +175,8 @@ Add to `~/.claude/mcp.json`:
 }
 ```
 
+
+
 **Codex**
 
 Add to `~/.codex/config.toml`:
@@ -182,6 +187,8 @@ command = "npx"
 args = ["-y", "browserforce@latest", "mcp"]
 ```
 
+
+
 **Cursor**
 
 Add to `~/.cursor/mcp.json`:
@@ -197,6 +204,8 @@ Add to `~/.cursor/mcp.json`:
 }
 ```
 
+
+
 **Antigravity**
 
 In Antigravity: Agent panel -> `...` -> `Manage MCP Servers` -> `View raw config`.
@@ -213,14 +222,14 @@ Add the same `mcpServers` entry:
 }
 ```
 
+
+
 If MCP startup fails with `connection closed: initialize response`:
 
 1. Ensure args include `"mcp"` (without it, BrowserForce prints help and exits).
 2. If running from a local clone, install deps first: `pnpm install`.
 3. Validate the launch command manually: `npx -y browserforce@latest mcp`
 
-</details>
-
 ### CLI
 
 ```bash
@@ -698,3 +707,4 @@ jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.
 For practical debugging and operations flows, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
 
 > **Want the full walkthrough?** Read the [User Guide](https://github.com/ivalsaraj/browserforce/blob/main/GUIDE.md) for a plain-English explanation of what this does and how to get started.
+

From 7d902cf0c492edcb77d8039d7495ee1e2332c60f Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:49:41 +0530
Subject: [PATCH 060/192] docs(readme): make advanced MCP providers
 individually collapsible

---
 README.md | 31 +++++++++++++++++++------------
 1 file changed, 19 insertions(+), 12 deletions(-)

diff --git a/README.md b/README.md
index a9a81ae..a9416ff 100644
--- a/README.md
+++ b/README.md
@@ -115,7 +115,8 @@ If your agent browses to the page and responds with the title, you're all set.
 
 **MCP setup (advanced):**
 
-**OpenClaw (MCP adapter)**
+<details>
+<summary><b>OpenClaw (MCP adapter)</b></summary>
 
 Add to `~/.openclaw/openclaw.json`:
 
@@ -141,9 +142,10 @@ Add to `~/.openclaw/openclaw.json`:
 }
 ```
 
+</details>
 
-
-**Claude Desktop**
+<details>
+<summary><b>Claude Desktop</b></summary>
 
 Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
 
@@ -158,9 +160,10 @@ Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
 }
 ```
 
+</details>
 
-
-**Claude Code**
+<details>
+<summary><b>Claude Code</b></summary>
 
 Add to `~/.claude/mcp.json`:
 
@@ -175,9 +178,10 @@ Add to `~/.claude/mcp.json`:
 }
 ```
 
+</details>
 
-
-**Codex**
+<details>
+<summary><b>Codex</b></summary>
 
 Add to `~/.codex/config.toml`:
 
@@ -187,9 +191,10 @@ command = "npx"
 args = ["-y", "browserforce@latest", "mcp"]
 ```
 
+</details>
 
-
-**Cursor**
+<details>
+<summary><b>Cursor</b></summary>
 
 Add to `~/.cursor/mcp.json`:
 
@@ -204,9 +209,10 @@ Add to `~/.cursor/mcp.json`:
 }
 ```
 
+</details>
 
-
-**Antigravity**
+<details>
+<summary><b>Antigravity</b></summary>
 
 In Antigravity: Agent panel -> `...` -> `Manage MCP Servers` -> `View raw config`.
 Add the same `mcpServers` entry:
@@ -222,6 +228,8 @@ Add the same `mcpServers` entry:
 }
 ```
 
+</details>
+
 
 
 If MCP startup fails with `connection closed: initialize response`:
@@ -707,4 +715,3 @@ jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.
 For practical debugging and operations flows, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
 
 > **Want the full walkthrough?** Read the [User Guide](https://github.com/ivalsaraj/browserforce/blob/main/GUIDE.md) for a plain-English explanation of what this does and how to get started.
-

From 00652e23f17348ae6a4010e6d0e24b5115e5778f Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:51:49 +0530
Subject: [PATCH 061/192] docs: add single-active arbitration mode and fallback
 behavior

---
 AGENTS.md | 18 ++++++++++++++++++
 README.md | 20 ++++++++++++++++++++
 2 files changed, 38 insertions(+)

diff --git a/AGENTS.md b/AGENTS.md
index 2a6a8c3..326958e 100644
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -157,6 +157,18 @@ When a user clicks "Cancel" on Chrome's automation infobar, Chrome detaches the
 
 `RelayServer.start()` accepts `{ writeCdpUrl: false }` to prevent test instances from clobbering `~/.browserforce/cdp-url`. **All test `relay.start()` calls must pass `{ writeCdpUrl: false }`** or the production cdp-url file gets overwritten with random test ports.
 
+### Client Arbitration: BF_CLIENT_MODE
+
+`BF_CLIENT_MODE` controls agent-side CDP arbitration:
+- `single-active` (default): only one active `/cdp` client connection at a time.
+- `multi-client`: fallback mode that allows concurrent `/cdp` clients.
+
+In `single-active`, contention returns HTTP `409 Conflict` for additional `/cdp` connects while the slot is busy. Slot state is exposed at `GET /client-slot` (`mode`, `busy`, `activeClientId`, `connectedAt`).
+
+### MCP Standby Polling
+
+MCP handles `409`/busy connect errors by entering standby and polling `GET /client-slot` with short jittered intervals (~200-400ms), then reconnecting when `busy: false` (up to a 30s connect timeout).
+
 ## Security Rules
 
 - Relay binds to `127.0.0.1` ONLY. Never `0.0.0.0`.
@@ -165,6 +177,12 @@ When a user clicks "Cancel" on Chrome's automation infobar, Chrome detaches the
 - Token file permissions: `0o600` (owner read/write only).
 - Single extension slot. Second extension connection gets HTTP 409.
 
+## Operational Non-Goals
+
+- No new dependencies for client arbitration or standby behavior.
+- No per-tab ownership model; arbitration is one relay-level client slot.
+- No extension protocol changes for this feature area.
+
 ## Development Workflow
 
 ### Commands
diff --git a/README.md b/README.md
index 0df7634..fa77b02 100644
--- a/README.md
+++ b/README.md
@@ -594,11 +594,31 @@ RELAY_PORT=19333 browserforce serve
 }
 ```
 
+**Client arbitration mode (`BF_CLIENT_MODE`):**
+
+```bash
+# default: one active /cdp client at a time
+BF_CLIENT_MODE=single-active browserforce serve
+
+# fallback: allow concurrent /cdp clients
+BF_CLIENT_MODE=multi-client browserforce serve
+```
+
+In `single-active` mode, the relay enforces one active client slot. A second `/cdp` connection receives HTTP `409 Conflict` (busy). In `multi-client` mode, slot arbitration is disabled.
+
+**MCP standby polling (single-active mode):** if MCP sees a busy/`409` connect error, it enters standby and polls `GET /client-slot` until `busy: false` (about every 200-400ms, up to 30s), then retries connect.
+
+**Operational non-goals:**
+- No new dependencies for arbitration or standby logic.
+- No per-tab ownership complexity; arbitration is process-level client-slot control.
+- No extension protocol changes (no new extension↔relay message types).
+
 ## API
 
 | Endpoint | Description |
 |----------|-------------|
 | `GET /` | Health check (extension status, target count) |
+| `GET /client-slot` | Client-slot state: `{ mode, busy, activeClientId, connectedAt }` |
 | `GET /json/version` | CDP discovery |
 | `GET /json/list` | List attached targets |
 | `ws://.../extension` | Chrome extension WebSocket |

From 11da90cf783001ac8d4d48e3d42ca07991ceacbf Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 24 Feb 2026 21:56:34 +0530
Subject: [PATCH 062/192] docs: dedupe non-goals and point README to AGENTS

---
 README.md | 5 +----
 1 file changed, 1 insertion(+), 4 deletions(-)

diff --git a/README.md b/README.md
index fa77b02..0ffd10f 100644
--- a/README.md
+++ b/README.md
@@ -608,10 +608,7 @@ In `single-active` mode, the relay enforces one active client slot. A second `/c
 
 **MCP standby polling (single-active mode):** if MCP sees a busy/`409` connect error, it enters standby and polls `GET /client-slot` until `busy: false` (about every 200-400ms, up to 30s), then retries connect.
 
-**Operational non-goals:**
-- No new dependencies for arbitration or standby logic.
-- No per-tab ownership complexity; arbitration is process-level client-slot control.
-- No extension protocol changes (no new extension↔relay message types).
+**Operational non-goals:** canonical list is maintained in [AGENTS.md](AGENTS.md#operational-non-goals).
 
 ## API
 

From a87579df1f94cd9785896897390e40eb26c631dc Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 00:01:55 +0530
Subject: [PATCH 063/192] docs(guide): revise user guide to focus on advanced
 workflows and controlled tab management

---
 GUIDE.md  | 571 +++++++++++-------------------------------------------
 README.md |   2 +-
 2 files changed, 112 insertions(+), 461 deletions(-)

diff --git a/GUIDE.md b/GUIDE.md
index 990d484..26563ea 100644
--- a/GUIDE.md
+++ b/GUIDE.md
@@ -1,345 +1,95 @@
-# BrowserForce — User Guide
+# BrowserForce - Advanced Guide
 
-**BrowserForce // Parallel AI Agents in "your" Chrome!**
+This guide is an extension of README, not a duplicate.
 
-## What is this?
+Use README for onboarding and baseline usage:
+- Install and extension setup: [README Setup](README.md#setup)
+- Agent connection and MCP snippets: [README Connect Your Agent](README.md#connect-your-agent)
+- CLI commands: [README CLI](README.md#cli)
+- Core examples: [README Examples](README.md#examples)
+- Security model: [README Security](README.md#security)
 
-BrowserForce gives AI agents — like [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-compatible tool — access to **your real Chrome browser**. The one you're already logged into. No headless browser, no fake profiles. The AI sees your actual tabs and can interact with any website using your existing sessions.
-
-**Example:** You tell your agent "go to my Gmail and summarize my latest emails" — and it actually opens your Gmail (already logged in), reads the page, and gives you a summary. No passwords, no login flows.
-
-## What can it do?
-
-### Browse the web as you
-
-| Capability | What it means |
-|------------|---------------|
-| **See your tabs** | AI sees all your open Chrome tabs instantly |
-| **Navigate** | Open any URL in your real browser (with your cookies) |
-| **Open new tabs** | Create tabs that inherit all your sessions |
-| **Close tabs** | Clean up when done |
-
-### Interact with pages
-
-| Capability | What it means |
-|------------|---------------|
-| **Click** | Click buttons, links, menus — anything |
-| **Type** | Type text into any input, search box, or contenteditable field |
-| **Fill forms** | Fill input fields (clears existing value first) |
-| **Press keys** | Enter, Tab, Escape, Ctrl+C, any key combo |
-| **Scroll** | Scroll pages or specific elements |
-| **Hover** | Trigger hover menus and tooltips |
-| **Select dropdowns** | Pick options from `<select>` elements |
-
-### Observe and extract
-
-| Capability | What it means |
-|------------|---------------|
-| **Accessibility snapshots** | Read structured page content as text (fast, cheap — preferred over screenshots) |
-| **Screenshots** | Take a screenshot of any tab (viewport or full page) |
-| **Run JavaScript** | Execute any JS in the page context and get the result |
-| **Wait for elements** | Wait until a specific element appears, URL changes, or page loads |
-
-## How does it work?
-
-Three pieces work together:
-
-```
-  YOU tell AI:                AI sends commands:          Extension executes:
-  "check my Gmail"  →  [MCP/Playwright]  →  [Relay Server]  →  [Chrome Extension]  →  YOUR BROWSER
-                         (AI agent)         (localhost proxy)   (debugger bridge)      (real Chrome)
-```
-
-### Step by step
-
-1. **Chrome Extension** — Lives in your browser. Connects to the relay server. When asked, it attaches Chrome's built-in debugger to your tabs. This is how it can click, type, and screenshot — exactly like Chrome DevTools does.
-
-2. **Relay Server** — Runs on your computer (localhost only, never exposed to the internet). It's the middleman. It speaks CDP (Chrome DevTools Protocol) to the AI agent on one side, and WebSocket to the extension on the other. Think of it as a translator.
-
-3. **AI Agent** — Connects to the relay using standard tools (Playwright or MCP). It sees your browser tabs as controllable pages and can interact with them programmatically.
-
-### Why not just use a headless browser?
-
-| | Headless browser | BrowserForce |
-|---|---|---|
-| **Logged-in sessions** | No — starts fresh every time | Yes — uses YOUR cookies |
-| **2FA/captchas** | Blocked — can't pass them | Already passed (you did it) |
-| **Browser profile** | Empty/sandboxed | Your real profile |
-| **Extensions** | None | Your installed extensions |
-| **Bot detection** | Easily flagged | Runs in your real profile |
-
-## Quick Start
-
-### 1. Install
-
-```bash
-npm install -g browserforce
-```
-
-Or from source:
-
-```bash
-git clone https://github.com/ivalsaraj/browserforce.git
-cd browserforce
-pnpm install
-```
-
-### 2. Load the Chrome extension
-
-1. Open Chrome and go to `chrome://extensions/`
-2. Turn on **Developer mode** (toggle in top-right corner)
-3. Click **Load unpacked**
-4. Select the `extension/` folder from this project
-5. You'll see the extension icon in your toolbar (gray = disconnected)
-
-### 3. Done
-
-The relay auto-starts when you run any command or connect via MCP — no manual step needed. The extension icon turns green once connected.
-
-To run the relay manually (optional):
-
-```bash
-browserforce serve
-```
-
-### 4. Connect an AI agent
-
-**Option A: OpenClaw**
-
-Most OpenClaw users chat with their agent from Telegram or WhatsApp. Copy-paste this into your terminal:
-
-```bash
-npm install -g browserforce && npx -y skills add ivalsaraj/browserforce
-```
-
-This installs BrowserForce and teaches your OpenClaw agent how to use it. Then start the relay (keep it running):
-
-```bash
-browserforce serve
-```
-
-**Verify it works** — send this to your agent:
-
-> Go to https://example.com and tell me the page title
-
-If your agent browses to the page and responds with the title, you're all set. Your agent can now browse as you — no login flows, no captchas — even from WhatsApp or Telegram.
-
-<details>
-<summary><b>Alternative: MCP server</b> (advanced)</summary>
-
-If you prefer MCP over the skill, add to `~/.openclaw/openclaw.json`:
-
-```json
-{
-  "plugins": {
-    "entries": {
-      "mcp-adapter": {
-        "enabled": true,
-        "config": {
-          "servers": [
-            {
-              "name": "browserforce",
-              "transport": "stdio",
-              "command": "npx",
-              "args": ["-y", "browserforce", "mcp"]
-            }
-          ]
-        }
-      }
-    }
-  }
-}
-```
-
-</details>
-
-**Option B: Claude Desktop / Claude Code (via MCP)**
-
-Add to your Claude config:
-
-```json
-{
-  "mcpServers": {
-    "browserforce": {
-      "command": "npx",
-      "args": ["-y", "browserforce", "mcp"]
-    }
-  }
-}
-```
-
-**Option B.1: Codex (via MCP)**
-
-Add to `~/.codex/config.toml`:
-
-```toml
-[mcp_servers.browserforce]
-command = "npx"
-args = ["-y", "browserforce@latest", "mcp"]
-```
-
-**Option B.2: Cursor (via MCP)**
-
-Add to `~/.cursor/mcp.json`:
-
-```json
-{
-  "mcpServers": {
-    "browserforce": {
-      "command": "npx",
-      "args": ["-y", "browserforce@latest", "mcp"]
-    }
-  }
-}
-```
-
-If startup fails with `connection closed: initialize response`:
-
-1. Ensure args include `"mcp"` (without it, BrowserForce exits after printing help).
-2. If launching from a local clone, run `pnpm install` first.
-3. Verify manually: `npx -y browserforce@latest mcp`
-
-Then just talk to Claude: *"Open twitter.com and take a screenshot"*
-
-**Option C: Custom Playwright script**
-
-```javascript
-const { chromium } = require('playwright');
-
-const browser = await chromium.connectOverCDP(
-  'ws://127.0.0.1:19222/cdp?token=<YOUR_TOKEN>'
-);
-
-const pages = browser.contexts()[0].pages();
-console.log('Your open tabs:');
-for (const page of pages) {
-  console.log(' -', page.url());
-}
-```
+Use this guide for operator workflows, strict tab control, parallel extraction strategy, and production diagnostics.
 
 ## Controlled Tabs Playbook
 
-Use this section when you want strict control over what the agent can touch.
+Use this when you need hard boundaries on what an agent can touch.
 
-### 1) Manually Attach A Tab
+### 1) Manual attach (single trusted page)
 
-1. Open the exact tab you want the agent to use.
+1. Open the exact target tab.
 2. Click the BrowserForce extension icon.
-3. In the popup, click **+ Attach Current Tab**.
-4. Confirm it appears under **Controlled Tabs**.
+3. Click `+ Attach Current Tab`.
+4. Confirm it appears in `Controlled Tabs`.
 
-This is the fastest way to grant access to an already logged-in page without exposing other tabs.
+This is the safest default for logged-in or sensitive pages.
 
-### 2) Single-Tab Locked Workflow
+### 2) Locked single-tab workflow
 
-For high-safety tasks (admin pages, billing pages, production dashboards):
+For admin, billing, or production dashboards:
 
-1. Set **Mode** to `Manual`.
-2. Attach only one tab using **+ Attach Current Tab**.
-3. Enable **No new tabs**.
-4. Optionally enable **Lock URL** and **Read-only** depending on the task.
+1. Set `Mode = Manual`.
+2. Attach only one tab.
+3. Enable `No new tabs`.
+4. Optionally enable `Lock URL` and `Read-only`.
 
-Result: the agent is constrained to one attached tab and cannot open additional tabs.
+Result: one-tab sandbox with no lateral movement.
 
-### 3) Multi-Tab Controlled Workflow
+### 3) Controlled multi-tab workflow
 
-If the task needs a few trusted tabs:
+For tasks that need a small trusted set:
 
-1. Keep **Mode** on `Manual`.
-2. Switch to each required tab and click **+ Attach Current Tab**.
-3. Keep **No new tabs** on if you want to block any extra tab creation.
+1. Keep `Mode = Manual`.
+2. Attach each required tab explicitly.
+3. Keep `No new tabs` enabled to prevent expansion.
 
-Result: the agent can work only across the tabs you explicitly attached.
+Result: the agent can operate only in your approved set.
 
-### 4) Restriction Modes (How To Combine Them)
+### 4) Restriction presets
 
-- **Lock URL**: blocks navigation away from the current page (reload is still possible).
-- **No new tabs**: blocks agent-driven tab creation.
-- **Read-only**: blocks interaction methods (click/type/edit); useful for inspection-only runs.
+- Audit preset: `Manual + No new tabs + Read-only`
+- Form-test preset: `Manual + No new tabs`
+- Pinned-page preset: `Manual + Lock URL + No new tabs`
 
-Common presets:
+### 5) Cleanup and session hygiene
 
-- **Audit preset**: `Manual + No new tabs + Read-only`
-- **Form testing preset**: `Manual + No new tabs` (leave Read-only off)
-- **Pinned page preset**: `Manual + Lock URL + No new tabs`
+- `Auto-detach inactive tabs`: remove debugger from stale tabs (recommended `10-15 min`).
+- `Auto-close agent tabs`: close exploration tabs after runs.
 
-### 5) Auto-Cleanup After Use
+Use both in long-running sessions to limit drift and memory growth.
 
-- **Auto-detach inactive tabs**: detaches tabs after 5-60 minutes of inactivity.
-- **Auto-close agent tabs**: closes tabs created by the agent after 5-60 minutes.
+## BrowserForce Tab Swarms // Parallel Tabs Processing
 
-Recommended:
+This is the operating policy for independent read-only extraction at scale.
 
-- Use `10-15 min` auto-detach for normal sessions.
-- Use auto-close when running broad exploration tasks that open many tabs.
+### When to use a swarm
 
-## CLI
+Use parallel tabs when each task is independent:
+- count/list/extract over many pages
+- date sweeps
+- item matrices (SKU x store, company x source, domain x surface)
 
-Once installed globally (`npm install -g browserforce`), the CLI is available:
+Avoid swarms for stateful flows (checkout, purchases, sends, profile changes).
 
-```bash
-browserforce serve              # Start the relay server
-browserforce status             # Check relay and extension status
-browserforce tabs               # List open browser tabs
-browserforce snapshot [n]       # Accessibility tree of tab n (default: 0)
-browserforce screenshot [n]     # Screenshot tab n (default: 0, PNG to stdout)
-browserforce navigate <url>     # Open URL in a new tab
-browserforce -e "<code>"        # Run Playwright JavaScript (one-shot)
-```
+### Parallel-first policy
 
-Each `-e` command is one-shot — state does not persist between calls. For persistent state, use the MCP server.
+1. Start with `Promise.all` and concurrency cap `3-8` (default start: `5`).
+2. On `429`, anti-bot pages, or repeated timeouts: retry with reduced concurrency.
+3. If still unstable: fall back to sequential.
+4. Always return telemetry:
+- `peakConcurrentTasks`
+- `wallClockMs`
+- `sumTaskDurationsMs`
+- `failures`
+- `retries`
 
-**Examples:**
-
-```bash
-browserforce tabs
-browserforce -e "return await snapshot()"
-browserforce -e "await page.goto('https://github.com'); return await snapshot()"
-browserforce screenshot 0 > page.png
-browserforce navigate https://gmail.com
-```
-
-## MCP Tools Reference
-
-When connected via MCP (OpenClaw, Claude Desktop, Claude Code), the AI has two tools:
-
-| Tool | What it does |
-|------|-------------|
-| `execute` | Run Playwright JavaScript in your real Chrome. Access `page`, `context`, `state`, `snapshot()`, `waitForPageLoad()`, `getLogs()`, `screenshotWithAccessibilityLabels()`, `cleanHTML()`, `pageMarkdown()`, and Node.js globals. |
-| `reset` | Reconnect to the relay and clear state. Use when the connection drops. |
-
-### Diff-Aware Helpers
-
-Use `showDiffSinceLastCall` to control diff output vs full output in execute helper calls:
-
-```javascript
-await snapshot({ showDiffSinceLastCall: true });
-await snapshot({ showDiffSinceLastCall: false });
-await cleanHTML('body', { showDiffSinceLastCall: false });
-await pageMarkdown({ showDiffSinceLastCall: true });
-```
-
-Need concrete persona-based workflows? See [Actionable Use Cases](docs/USE_CASES.md).
-
-The `execute` tool gives the agent full Playwright access — it can navigate, click, type, screenshot, read accessibility trees, and run JavaScript in the page context. All within your real browser session.
-
-### BrowserForce Tab Swarms // Parallel Tabs Processing
-
-Use this for read-only count/list/extraction tasks where each target is independent (different pages, dates, or items).
-
-- Start parallel-first with `Promise.all` and a concurrency cap (`3-8`, usually start at `5`).
-- If you hit `429`, anti-bot pages, or repeated timeout failures, automatically retry with reduced concurrency.
-- If reduced concurrency still fails, fall back to sequential processing.
-- Return telemetry on every swarm run: `peakConcurrentTasks`, `wallClockMs`, `sumTaskDurationsMs`, `failures`, `retries`.
-
-Example execute pattern:
+### Minimal swarm template
 
 ```javascript
 const items = state.items ?? [];
 const startedAt = Date.now();
 let peakConcurrentTasks = 0;
 let sumTaskDurationsMs = 0;
-let failures = 0;
 let retries = 0;
 
 async function runTask(item, page) {
@@ -351,24 +101,23 @@ async function runTask(item, page) {
     return { ok: true, item, value };
   } catch (error) {
     const msg = String(error?.message || error);
-    const retryable = /429|timeout|captcha|challenge|blocked/i.test(msg);
-    return { ok: false, item, retryable, error: msg };
+    return { ok: false, item, retryable: /429|timeout|captcha|challenge|blocked/i.test(msg), error: msg };
   } finally {
     sumTaskDurationsMs += Date.now() - t0;
   }
 }
 
 async function runWithCap(targetItems, cap) {
-  const results = [];
+  const out = [];
   for (let i = 0; i < targetItems.length; i += cap) {
     const batch = targetItems.slice(i, i + cap);
     peakConcurrentTasks = Math.max(peakConcurrentTasks, batch.length);
-    const tabs = await Promise.all(batch.map(() => context.newPage()));
-    const batchResults = await Promise.all(batch.map((item, idx) => runTask(item, tabs[idx])));
-    await Promise.all(tabs.map((p) => p.close().catch(() => {})));
-    results.push(...batchResults);
+    const pages = await Promise.all(batch.map(() => context.newPage()));
+    const results = await Promise.all(batch.map((item, idx) => runTask(item, pages[idx])));
+    await Promise.all(pages.map((p) => p.close().catch(() => {})));
+    out.push(...results);
   }
-  return results;
+  return out;
 }
 
 let results = await runWithCap(items, 5);
@@ -376,205 +125,107 @@ let retryable = results.filter((r) => !r.ok && r.retryable).map((r) => r.item);
 
 if (retryable.length) {
   retries += 1;
-  const retried = await runWithCap(retryable, 2); // reduced concurrency fallback
-  const settled = new Map(results.filter((r) => r.ok).map((r) => [r.item.url, r]));
-  for (const r of retried) settled.set(r.item.url, r);
-  results = [...settled.values()];
+  const retried = await runWithCap(retryable, 2);
+  results = [...results.filter((r) => r.ok), ...retried];
   retryable = results.filter((r) => !r.ok && r.retryable).map((r) => r.item);
 }
 
 if (retryable.length) {
   retries += 1;
   for (const item of retryable) {
-    const tab = await context.newPage();
-    const r = await runTask(item, tab); // sequential fallback
-    await tab.close().catch(() => {});
-    results.push(r);
+    const p = await context.newPage();
+    results.push(await runTask(item, p));
+    await p.close().catch(() => {});
   }
 }
 
-failures = results.filter((r) => !r.ok).length;
 return {
   results,
   telemetry: {
     peakConcurrentTasks,
     wallClockMs: Date.now() - startedAt,
     sumTaskDurationsMs,
-    failures,
+    failures: results.filter((r) => !r.ok).length,
     retries,
   },
 };
 ```
 
-## Examples
-
-These prompts show how 10x users work with BrowserForce. The AI generates the code and handles the work — you just describe what you need.
-
-### Foundation: Simple, Single-Tab
-
-<details>
-<summary><b>Example 1: Read page content</b></summary>
-
-**Prompt to AI:**
-> Go to x.com/search and search for "browserforce". Show me the top 5 tweets you find.
+## MCP Execution Patterns
 
-**What the AI does:** Navigates to X, searches the term, extracts top tweets, returns them to you.
+Use this split to reduce flakiness and context bloat:
 
-**Use case:** Quick research, trend tracking, social listening.
-
-</details>
-
-<details>
-<summary><b>Example 2: Interact with a form</b></summary>
-
-**Prompt to AI:**
-> Go to GitHub and search for "ai agents". Show me the top 3 repositories and their star counts.
-
-**What the AI does:** Fills GitHub search, waits for results, extracts repo names + stars, returns them.
-
-**Use case:** Finding libraries, competitive research, project discovery.
-
-</details>
-
-### Multi-Tab Workflows: Coordination + State
-
-<details>
-<summary><b>Example 3: Search → Extract → Return</b></summary>
-
-**Prompt to AI:**
-> Search ProductHunt for "AI tools" and give me the top 5 products with their taglines and upvote counts.
-
-**What the AI does:** Navigates ProductHunt, searches, extracts product info, returns structured data.
-
-**Use case:** Market research, finding tools, competitive analysis.
-
-</details>
-
-<details>
-<summary><b>Example 4: Open result in new tab, process there</b></summary>
+- One execute call: one meaningful action plus verification.
+- Exception: multi-step is allowed for read-only independent bulk extraction (swarm runs).
+- Prefer `snapshot()` over screenshots for text/structure extraction.
+- Use `showDiffSinceLastCall: false` when you need full tree output.
+- Use `reset` on connection/page lifecycle failures, not for normal task errors.
 
-**Prompt to AI:**
-> Find the #1 product from your last ProductHunt search, click into it, and read the full description. Tell me what it does.
-
-**What the AI does:** Opens the product page from previous results, reads the description, summarizes it.
-
-**Use case:** Deep-dive research, understanding competitors, due diligence.
-
-</details>
-
-<details>
-<summary><b>Example 5: Debugging workflow (inspect + verify)</b></summary>
-
-**Prompt to AI:**
-> Go to my staging site at staging.myapp.com/checkout and take a labeled screenshot. Tell me if the "Complete Purchase" button is visible and what's around it.
-
-**What the AI does:** Navigates, takes screenshot with interactive labels, analyzes button state and layout.
-
-**Use case:** Visual debugging, QA checks, spotting broken elements.
-
-</details>
-
-<details>
-<summary><b>Example 6: Test form with data</b></summary>
-
-**Prompt to AI:**
-> Sign up for Substack using the email test.user@example.com. Tell me if the signup completes successfully.
-
-**What the AI does:** Fills the form, submits, waits for confirmation, reports success/failure.
-
-**Use case:** Testing sign-up flows, QA automation, form validation.
-
-</details>
-
-<details>
-<summary><b>Example 7: Content pipeline (search → extract → compare)</b></summary>
-
-**Prompt to AI:**
-> Search for "AI regulation" on both X.com and LinkedIn. Give me the top 5 trending posts from each and tell me which topics overlap.
-
-**What the AI does:** Searches both platforms, extracts posts, compares content, returns analysis.
-
-**Use case:** Multi-source research, trend analysis, market sentiment.
+## Examples
 
-</details>
+These are advanced examples that complement (not repeat) README examples.
 
 <details>
-<summary><b>Example 8: Data extraction → CSV pipeline</b></summary>
+<summary><b>Example A: Retail price swarm (SKU x store)</b></summary>
 
 **Prompt to AI:**
-> Go to Hacker News and extract the top 10 stories with their titles and vote counts. Format as CSV so I can import into a spreadsheet.
-
-**What the AI does:** Navigates HN, extracts story data, formats as CSV, returns it ready to paste.
+> For these 25 SKUs, check Amazon, Walmart, Target, and Best Buy in parallel tabs. Return best price, in-stock status, and fastest delivery ETA per SKU, with swarm telemetry.
 
-**Use case:** Data workflows, trend tracking, content curation.
+**Expected output:**
+- Per-SKU normalized comparison table
+- Cheapest source per SKU
+- Swarm telemetry block
 
 </details>
 
 <details>
-<summary><b>Example 9: A/B testing across variants</b></summary>
+<summary><b>Example B: Competitor launch radar (company x source)</b></summary>
 
 **Prompt to AI:**
-> Visit myapp.com/?variant=red and myapp.com/?variant=blue. Compare the two designs and tell me which button color is more prominent and what other differences exist.
+> For these 30 competitors, scan release notes, changelogs, docs, and blogs for the last 7 days. Group launches by category and include links.
 
-**What the AI does:** Opens both variants, compares layouts/colors/text, reports visual differences.
-
-**Use case:** Design QA, A/B testing, variant comparison.
+**Expected output:**
+- Deduped launch digest
+- Category breakdown
+- Swarm telemetry block
 
 </details>
 
 <details>
-<summary><b>Example 10: Monitor + alert workflow</b></summary>
+<summary><b>Example C: Security surface triage (domain x surface)</b></summary>
 
 **Prompt to AI:**
-> Check our status page at status.myapp.com every few minutes. Tell me the current status of the API and database. Alert me if anything changes from green to red.
-
-**What the AI does:** Monitors status page, reads indicators, alerts on degradation.
+> For these domains, inspect login pages, robots.txt, status pages, public docs, and likely staging links. Return prioritized findings with evidence links.
 
-**Use case:** Uptime monitoring, incident detection, SLA tracking.
+**Expected output:**
+- Risk-prioritized findings list
+- Evidence URLs
+- Swarm telemetry block
 
 </details>
 
-## Security
+Need broader persona workflows? See [Actionable Use Cases](docs/USE_CASES.md).
 
-- **Local only** — The relay server binds to `127.0.0.1`. Nothing is exposed to the network.
-- **Auth token** — Every connection requires a random token (auto-generated, stored locally).
-- **Your machine only** — The extension, relay, and agent all run on your computer.
-- **Visible** — Chrome shows a "controlled by automated test software" bar so you always know when automation is active.
-
-## Common Questions
-
-**Q: Can the AI see my passwords?**
-A: The AI can see whatever is visible on the page, just like a screenshot. It cannot access saved passwords in Chrome's password manager.
-
-**Q: Can someone else control my browser?**
-A: No. The relay only accepts connections from `127.0.0.1` (your own machine) with a secret token.
-
-**Q: Does it work with any AI?**
-A: Any AI that supports MCP (OpenClaw, Claude Desktop, Claude Code) or any tool that speaks CDP (Playwright, Puppeteer scripts).
-
-**Q: What happens if Chrome kills the extension?**
-A: Chrome aggressively kills MV3 extensions after 30 seconds of inactivity. The relay sends keepalive pings every 5 seconds to prevent this. If the extension does restart, it auto-reconnects.
-
-**Q: Can I control which tabs the AI accesses?**
-A: Yes. In Auto mode the agent can create and control its own tabs. In Manual mode, you explicitly attach tabs with **+ Attach Current Tab**. You can also lock URLs, block new tabs, or enable read-only mode.
-
-**Q: Does it work with multiple windows?**
-A: Yes. All tabs across all Chrome windows are visible.
-
-## Troubleshooting
+## Troubleshooting and Diagnostics
 
 | Problem | Fix |
-|---------|-----|
-| Extension icon stays gray | Is the relay running? Run `browserforce serve` |
-| "Another debugger is attached" | Close DevTools for that tab |
-| AI sees 0 pages | Open at least one regular webpage (not `chrome://`) |
-| Extension keeps disconnecting | Normal MV3 behavior — it auto-reconnects |
-| Port already in use | Run `lsof -ti:19222 \| xargs kill -9` to kill stale process |
+|---|---|
+| Extension stays gray | Start relay: `browserforce serve` |
+| `Another debugger is attached` | Close DevTools for that tab |
+| Agent sees 0 pages | Open at least one normal webpage (not `chrome://`) |
+| Frequent disconnections | MV3 worker churn is expected; relay keepalive should reconnect |
+| Port collision on `19222` | `lsof -ti:19222 | xargs kill -9` |
 
-CDP traffic is logged to `~/.browserforce/cdp.jsonl` (recreated on each relay start). Summarize traffic by direction + method:
+CDP traffic log: `~/.browserforce/cdp.jsonl` (recreated each relay start).
+
+Summarize CDP traffic by direction + method:
 
 ```bash
 jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.jsonl | uniq -c
 ```
 
-For incident/debug playbooks, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
+If MCP startup fails with `connection closed: initialize response`:
+
+1. Ensure args include `"mcp"`.
+2. If running from local clone, run `pnpm install`.
+3. Validate manually: `npx -y browserforce@latest mcp`.
diff --git a/README.md b/README.md
index 817d0dc..a1fc6bf 100644
--- a/README.md
+++ b/README.md
@@ -729,4 +729,4 @@ jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.
 
 For practical debugging and operations flows, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
 
-> **Want the full walkthrough?** Read the [User Guide](https://github.com/ivalsaraj/browserforce/blob/main/GUIDE.md) for a plain-English explanation of what this does and how to get started.
+> **Need advanced operator playbooks?** Read the [User Guide](https://github.com/ivalsaraj/browserforce/blob/main/GUIDE.md) for controlled-tab workflows, parallel swarm patterns, and production diagnostics.

From 2c01d8d46ebc57abcc57cc3e2fc0c75d755e8121 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 13:23:28 +0530
Subject: [PATCH 064/192] feat(relay): add extension-gated logs APIs and client
 labeling

---
 mcp/src/index.js                |  18 ++-
 relay/src/index.js              | 253 ++++++++++++++++++++++++++++++--
 relay/test/relay-server.test.js | 143 ++++++++++++++++++
 3 files changed, 397 insertions(+), 17 deletions(-)

diff --git a/mcp/src/index.js b/mcp/src/index.js
index b0ab274..ef6f256 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -66,17 +66,31 @@ function ensureAllPagesCapture() {
 let browser = null;
 const CONNECT_RETRY_TIMEOUT_MS = 30000;
 
+function withClientLabel(cdpUrl) {
+  try {
+    const url = new URL(cdpUrl);
+    if (!url.searchParams.get('label')) {
+      url.searchParams.set(
+        'label',
+        process.env.BROWSERFORCE_CDP_CLIENT_LABEL || 'browserforce-mcp',
+      );
+    }
+    return url.toString();
+  } catch {
+    return cdpUrl;
+  }
+}
+
 async function ensureBrowser() {
   if (browser?.isConnected()) return;
   await ensureRelay();
-  const cdpUrl = getCdpUrl();
+  const cdpUrl = withClientLabel(getCdpUrl());
   browser = await connectOverCdpWithBusyRetry({
     connect: (url) => chromium.connectOverCDP(url),
     cdpUrl,
     baseUrl: getRelayHttpUrl(),
     timeoutMs: CONNECT_RETRY_TIMEOUT_MS,
   });
-
   browser.on('disconnected', () => {
     browser = null;
     contextListenerAttached = false;
diff --git a/relay/src/index.js b/relay/src/index.js
index d2c8611..ae13a3f 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -11,6 +11,7 @@ const { createCdpLogger } = require('./cdp-log.js');
 const DEFAULT_PORT = 19222;
 const COMMAND_TIMEOUT_MS = 30000;
 const PING_INTERVAL_MS = 5000;
+const DEFAULT_CDP_LOG_BUFFER_LIMIT = 10000;
 
 const BF_DIR = path.join(os.homedir(), '.browserforce');
 const TOKEN_FILE = path.join(BF_DIR, 'auth-token');
@@ -25,6 +26,21 @@ function ts() { return new Date().toTimeString().slice(0, 8); }
 function log(...args) { console.log(`[${ts()}]`, ...args); }
 function logErr(...args) { console.error(`[${ts()}]`, ...args); }
 
+function resolvePositiveInt(value, fallback) {
+  const parsed = Number(value);
+  if (!Number.isFinite(parsed) || parsed <= 0) {
+    return fallback;
+  }
+  return Math.floor(parsed);
+}
+
+function sanitizeClientLabel(label) {
+  if (typeof label !== 'string') return null;
+  const cleaned = label.trim().replace(/[^\w .:@/-]/g, '');
+  if (!cleaned) return null;
+  return cleaned.slice(0, 80);
+}
+
 // ─── Token Persistence ──────────────────────────────────────────────────────
 
 function getOrCreateAuthToken() {
@@ -149,6 +165,8 @@ class RelayServer {
 
     // CDP clients
     this.clients = new Set();
+    this.clientMeta = new WeakMap();
+    this.clientById = new Map();
 
     // Target tracking
     this.targets = new Map();      // sessionId -> { tabId, targetId, targetInfo }
@@ -165,9 +183,23 @@ class RelayServer {
 
     // CDP traffic logger, initialized on start.
     this.cdpLogger = null;
+
+    // In-memory log buffer for options UI polling.
+    this.cdpLogEntries = [];
+    this.cdpLogSeq = 0;
+    this.cdpLogBufferLimit = resolvePositiveInt(
+      process.env.BROWSERFORCE_CDP_LOG_BUFFER_LIMIT,
+      DEFAULT_CDP_LOG_BUFFER_LIMIT,
+    );
+
+    this.startedAt = Date.now();
   }
 
   start({ writeCdpUrl = true } = {}) {
+    this.startedAt = Date.now();
+    this.cdpLogEntries = [];
+    this.cdpLogSeq = 0;
+    this.clientById.clear();
     try {
       this.cdpLogger = createCdpLogger();
     } catch (err) {
@@ -181,7 +213,7 @@ class RelayServer {
     this.cdpWss = new WebSocketServer({ noServer: true });
 
     server.on('upgrade', (req, socket, head) => this._handleUpgrade(req, socket, head));
-    this.extWss.on('connection', (ws) => this._onExtConnect(ws));
+    this.extWss.on('connection', (ws, req) => this._onExtConnect(ws, req));
     this.cdpWss.on('connection', (ws, req) => this._onCdpConnect(ws, req));
 
     this.server = server;
@@ -207,13 +239,29 @@ class RelayServer {
   }
 
   _logCdp(entry) {
+    const withClientLabel = { ...entry };
+    if (withClientLabel.clientId && !withClientLabel.clientLabel) {
+      const meta = this.clientById.get(withClientLabel.clientId);
+      if (meta?.label) {
+        withClientLabel.clientLabel = meta.label;
+      }
+    }
+
+    const withTimestamp = {
+      timestamp: new Date().toISOString(),
+      ...withClientLabel,
+    };
+    this.cdpLogSeq += 1;
+    const bufferedEntry = { seq: this.cdpLogSeq, ...withTimestamp };
+    this.cdpLogEntries.push(bufferedEntry);
+    if (this.cdpLogEntries.length > this.cdpLogBufferLimit) {
+      this.cdpLogEntries.shift();
+    }
+
     if (!this.cdpLogger || typeof this.cdpLogger.log !== 'function') {
       return;
     }
-    this.cdpLogger.log({
-      timestamp: new Date().toISOString(),
-      ...entry,
-    });
+    this.cdpLogger.log(withTimestamp);
   }
 
   // ─── HTTP ────────────────────────────────────────────────────────────────
@@ -281,6 +329,20 @@ class RelayServer {
       return;
     }
 
+    if (url.pathname === '/logs/status' && req.method === 'GET') {
+      if (!this._requireExtensionOrigin(req, res)) return;
+      res.end(JSON.stringify(this._logsStatus()));
+      return;
+    }
+
+    if (url.pathname === '/logs/cdp' && req.method === 'GET') {
+      if (!this._requireExtensionOrigin(req, res)) return;
+      const after = resolvePositiveInt(url.searchParams.get('after'), 0);
+      const limit = Math.min(resolvePositiveInt(url.searchParams.get('limit'), 300), 1000);
+      res.end(JSON.stringify(this._logsSlice({ after, limit })));
+      return;
+    }
+
     // ─── Plugin Routes ───────────────────────────────────────────────────────
 
     if (url.pathname === '/plugins' && req.method === 'GET') {
@@ -399,6 +461,140 @@ class RelayServer {
     return true;
   }
 
+  _extensionOriginFromReq(req) {
+    const origin = req?.headers?.origin || '';
+    if (!origin.startsWith('chrome-extension://')) {
+      return null;
+    }
+    return origin;
+  }
+
+  _deriveClientLabel(req) {
+    try {
+      const url = new URL(req.url, `http://${req.headers.host || '127.0.0.1'}`);
+      const fromQuery = sanitizeClientLabel(
+        url.searchParams.get('label')
+          || url.searchParams.get('clientLabel')
+          || url.searchParams.get('client')
+          || '',
+      );
+      if (fromQuery) return fromQuery;
+    } catch {
+      // Ignore malformed request URL; fall back to header-based label.
+    }
+
+    const origin = req?.headers?.origin || '';
+    if (origin.startsWith('chrome-extension://')) {
+      const extensionId = origin.replace('chrome-extension://', '');
+      return `extension:${extensionId.slice(0, 12)}`;
+    }
+
+    const ua = (req?.headers?.['user-agent'] || '').toLowerCase();
+    if (ua.includes('claude')) return 'claude-client';
+    if (ua.includes('openai')) return 'openai-client';
+    if (ua.includes('playwright')) return 'playwright-client';
+    if (ua.includes('node')) return 'node-client';
+
+    return 'cdp-client';
+  }
+
+  _requireExtensionOrigin(req, res) {
+    const origin = this._extensionOriginFromReq(req);
+    if (!origin) {
+      res.statusCode = 403;
+      res.end(JSON.stringify({ error: 'Forbidden — extension origin required' }));
+      return false;
+    }
+
+    const trustedOrigin = this.ext?.origin;
+    if (!trustedOrigin) {
+      res.statusCode = 503;
+      res.end(JSON.stringify({ error: 'Extension not connected' }));
+      return false;
+    }
+    if (trustedOrigin && origin !== trustedOrigin) {
+      res.statusCode = 403;
+      res.end(JSON.stringify({ error: 'Forbidden — extension origin mismatch' }));
+      return false;
+    }
+    return true;
+  }
+
+  _logsStatus() {
+    const clients = [];
+    for (const client of this.clients) {
+      const meta = this.clientMeta.get(client);
+      if (!meta) continue;
+      clients.push({
+        id: meta.id,
+        label: meta.label,
+        connectedAt: meta.connectedAt,
+        origin: meta.origin,
+        userAgent: meta.userAgent,
+        remoteAddress: meta.remoteAddress,
+      });
+    }
+
+    const counts = {
+      fromPlaywright: 0,
+      toPlaywright: 0,
+      fromExtension: 0,
+      toExtension: 0,
+    };
+    for (const entry of this.cdpLogEntries) {
+      if (entry.direction === 'from-playwright') counts.fromPlaywright += 1;
+      if (entry.direction === 'to-playwright') counts.toPlaywright += 1;
+      if (entry.direction === 'from-extension') counts.fromExtension += 1;
+      if (entry.direction === 'to-extension') counts.toExtension += 1;
+    }
+
+    return {
+      relay: {
+        connectedSince: new Date(this.startedAt).toISOString(),
+        uptimeMs: Date.now() - this.startedAt,
+      },
+      extension: this.ext
+        ? {
+            connected: true,
+            connectedAt: this.ext.connectedAt,
+            origin: this.ext.origin,
+            userAgent: this.ext.userAgent,
+            remoteAddress: this.ext.remoteAddress,
+          }
+        : { connected: false },
+      clients: {
+        count: this.clients.size,
+        items: clients,
+      },
+      targets: this.targets.size,
+      logs: {
+        entriesBuffered: this.cdpLogEntries.length,
+        latestSeq: this.cdpLogSeq,
+        directionCounts: counts,
+      },
+    };
+  }
+
+  _logsSlice({ after = 0, limit = 300 } = {}) {
+    const oldestSeq = this.cdpLogEntries.length > 0
+      ? this.cdpLogEntries[0].seq
+      : this.cdpLogSeq + 1;
+    const tooOld = after > 0 && after < oldestSeq - 1;
+
+    const newer = this.cdpLogEntries.filter((entry) => entry.seq > after);
+    const skipped = Math.max(0, newer.length - limit);
+    const entries = skipped > 0 ? newer.slice(skipped) : newer;
+
+    return {
+      after,
+      latestSeq: this.cdpLogSeq,
+      oldestSeq,
+      resetRequired: tooOld,
+      skipped,
+      entries,
+    };
+  }
+
   // ─── WebSocket Upgrade ───────────────────────────────────────────────────
 
   _handleUpgrade(req, socket, head) {
@@ -406,8 +602,8 @@ class RelayServer {
 
     if (url.pathname === '/extension') {
       // Validate origin
-      const origin = req.headers.origin || '';
-      if (!origin.startsWith('chrome-extension://')) {
+      const origin = this._extensionOriginFromReq(req);
+      if (!origin) {
         socket.write('HTTP/1.1 403 Forbidden\r\n\r\n');
         socket.destroy();
         return;
@@ -453,9 +649,16 @@ class RelayServer {
 
   // ─── Extension Connection ────────────────────────────────────────────────
 
-  _onExtConnect(ws) {
+  _onExtConnect(ws, req) {
     log('[relay] Extension connected');
-    this.ext = { ws };
+    const origin = this._extensionOriginFromReq(req);
+    this.ext = {
+      ws,
+      connectedAt: new Date().toISOString(),
+      origin: origin || null,
+      userAgent: req?.headers?.['user-agent'] || null,
+      remoteAddress: req?.socket?.remoteAddress || null,
+    };
 
     ws.on('message', (data) => {
       try {
@@ -673,7 +876,17 @@ class RelayServer {
       const now = Date.now();
       this.activeClient = { id: clientId, ws, connectedAt: now, lastSeenAt: now };
     }
-    log('[relay] CDP client connected');
+    const clientMeta = {
+      id: clientId,
+      label: this._deriveClientLabel(req),
+      connectedAt: new Date().toISOString(),
+      origin: req?.headers?.origin || null,
+      userAgent: req?.headers?.['user-agent'] || null,
+      remoteAddress: req?.socket?.remoteAddress || null,
+    };
+    this.clientMeta.set(ws, clientMeta);
+    this.clientById.set(clientId, clientMeta);
+    log(`[relay] CDP client connected (${clientId})`);
     this.clients.add(ws);
 
     ws.on('message', (data) => {
@@ -689,7 +902,9 @@ class RelayServer {
     });
 
     ws.on('close', () => {
-      log('[relay] CDP client disconnected');
+      const meta = this.clientMeta.get(ws);
+      log(`[relay] CDP client disconnected (${meta?.id || 'unknown'})`);
+      if (meta?.id) this.clientById.delete(meta.id);
       this.clients.delete(ws);
       if (this.activeClient?.ws === ws) {
         this.activeClient = null;
@@ -702,24 +917,27 @@ class RelayServer {
   }
 
   async _handleCdpClientMessage(ws, msg) {
+    const clientId = this.clientMeta.get(ws)?.id || null;
     const { id, method, params, sessionId } = msg;
     this._logCdp({
       direction: 'from-playwright',
+      clientId,
       message: { id, method, params, sessionId },
     });
 
     try {
       let result;
       if (sessionId) {
-        result = await this._forwardToTab(sessionId, method, params, id);
+        result = await this._forwardToTab(sessionId, method, params, id, clientId);
       } else {
-        result = await this._handleBrowserCommand(ws, id, method, params);
+        result = await this._handleBrowserCommand(ws, id, method, params, clientId);
       }
       if (result !== undefined) {
         const response = { id, result };
         if (sessionId) response.sessionId = sessionId;
         this._logCdp({
           direction: 'to-playwright',
+          clientId,
           message: response,
         });
         ws.send(JSON.stringify(response));
@@ -732,13 +950,14 @@ class RelayServer {
       if (sessionId) response.sessionId = sessionId;
       this._logCdp({
         direction: 'to-playwright',
+        clientId,
         message: response,
       });
       ws.send(JSON.stringify(response));
     }
   }
 
-  async _handleBrowserCommand(ws, msgId, method, params) {
+  async _handleBrowserCommand(ws, msgId, method, params, clientId) {
     switch (method) {
       case 'Browser.getVersion':
         return {
@@ -766,6 +985,7 @@ class RelayServer {
           };
           this._logCdp({
             direction: 'to-playwright',
+            clientId,
             message: event,
           });
           ws.send(JSON.stringify(event));
@@ -778,6 +998,7 @@ class RelayServer {
         // Respond immediately, then attach tabs asynchronously
         this._logCdp({
           direction: 'to-playwright',
+          clientId,
           message: { id: msgId, result: {} },
         });
         ws.send(JSON.stringify({ id: msgId, result: {} }));
@@ -993,7 +1214,7 @@ class RelayServer {
 
   // ─── CDP Command Forwarding ─────────────────────────────────────────────
 
-  async _forwardToTab(sessionId, method, params, id) {
+  async _forwardToTab(sessionId, method, params, id, clientId) {
     // Main session
     const target = this.targets.get(sessionId);
     if (target) {
@@ -1010,6 +1231,7 @@ class RelayServer {
       }
       this._logCdp({
         direction: 'to-extension',
+        clientId,
         message: {
           id,
           method,
@@ -1036,6 +1258,7 @@ class RelayServer {
       }
       this._logCdp({
         direction: 'to-extension',
+        clientId,
         message: {
           id,
           method,
diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index ef3c581..3cab703 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -32,6 +32,32 @@ function httpGet(url) {
   });
 }
 
+/** HTTP GET with custom headers */
+function httpGetWithHeaders(url, headers = {}) {
+  return new Promise((resolve, reject) => {
+    const opts = new URL(url);
+    const req = http.request({
+      hostname: opts.hostname,
+      port: opts.port,
+      path: opts.pathname + opts.search,
+      method: 'GET',
+      headers,
+    }, (res) => {
+      let body = '';
+      res.on('data', (d) => (body += d));
+      res.on('end', () => {
+        try {
+          resolve({ status: res.statusCode, body: JSON.parse(body) });
+        } catch {
+          resolve({ status: res.statusCode, body });
+        }
+      });
+    });
+    req.on('error', reject);
+    req.end();
+  });
+}
+
 /** Connect a WebSocket and wait for open */
 function connectWs(url, options = {}) {
   return new Promise((resolve, reject) => {
@@ -175,6 +201,121 @@ describe('HTTP Endpoints', () => {
   });
 });
 
+// ─── Logs Viewer Endpoints ───────────────────────────────────────────────────
+
+describe('Logs Viewer Endpoints', () => {
+  let relay;
+  let port;
+
+  before(async () => {
+    port = getRandomPort();
+    relay = new RelayServer(port);
+    relay.start({ writeCdpUrl: false });
+    await sleep(200);
+  });
+
+  after(() => {
+    relay.stop();
+  });
+
+  it('GET /logs/status requires chrome-extension origin', async () => {
+    const { status, body } = await httpGet(`http://127.0.0.1:${port}/logs/status`);
+    assert.equal(status, 403);
+    assert.match(body.error, /extension origin required/);
+  });
+
+  it('GET /logs/cdp requires chrome-extension origin', async () => {
+    const { status, body } = await httpGet(`http://127.0.0.1:${port}/logs/cdp`);
+    assert.equal(status, 403);
+    assert.match(body.error, /extension origin required/);
+  });
+
+  it('GET /logs/status returns active client metadata and direction counters', async () => {
+    const ext = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://test' },
+    });
+    ext.on('message', (data) => {
+      const msg = JSON.parse(data.toString());
+      if (msg.method === 'ping') ext.send(JSON.stringify({ method: 'pong' }));
+    });
+
+    const cdp = await connectWs(`ws://127.0.0.1:${port}/cdp?token=${relay.authToken}`);
+    cdp.send(JSON.stringify({ id: 1, method: 'Browser.getVersion' }));
+    await readMessage(cdp, 3000);
+
+    const { status, body } = await httpGetWithHeaders(`http://127.0.0.1:${port}/logs/status`, {
+      Origin: 'chrome-extension://test',
+    });
+    assert.equal(status, 200);
+    assert.equal(body.clients.count, 1);
+    assert.ok(Array.isArray(body.clients.items));
+    assert.equal(body.clients.items.length, 1);
+    assert.match(body.clients.items[0].id, /^bf-(cdp|client)-\d+$/);
+    assert.ok(body.clients.items[0].label, 'client label should be present');
+    assert.ok(body.logs.directionCounts.fromPlaywright >= 1);
+    assert.ok(body.logs.directionCounts.toPlaywright >= 1);
+
+    cdp.close();
+    ext.close();
+    await sleep(50);
+  });
+
+  it('GET /logs/cdp supports incremental polling with after/limit', async () => {
+    const ext = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://test' },
+    });
+    ext.on('message', (data) => {
+      const msg = JSON.parse(data.toString());
+      if (msg.method === 'ping') ext.send(JSON.stringify({ method: 'pong' }));
+    });
+
+    const cdp = await connectWs(`ws://127.0.0.1:${port}/cdp?token=${relay.authToken}`);
+    cdp.send(JSON.stringify({ id: 10, method: 'Browser.getVersion' }));
+    await readMessage(cdp, 3000);
+
+    const first = await httpGetWithHeaders(`http://127.0.0.1:${port}/logs/cdp?after=0&limit=200`, {
+      Origin: 'chrome-extension://test',
+    });
+    assert.equal(first.status, 200);
+    assert.ok(Array.isArray(first.body.entries));
+    assert.ok(first.body.entries.length > 0);
+    const newestSeq = first.body.latestSeq;
+    const hasBrowserGetVersion = first.body.entries.some((entry) => entry.message?.method === 'Browser.getVersion');
+    assert.equal(hasBrowserGetVersion, true, 'Should include Browser.getVersion CDP entry');
+
+    const second = await httpGetWithHeaders(`http://127.0.0.1:${port}/logs/cdp?after=${newestSeq}&limit=200`, {
+      Origin: 'chrome-extension://test',
+    });
+    assert.equal(second.status, 200);
+    assert.equal(second.body.entries.length, 0);
+    assert.equal(second.body.after, newestSeq);
+    assert.equal(second.body.resetRequired, false);
+
+    cdp.close();
+    ext.close();
+    await sleep(50);
+  });
+
+  it('GET /logs/status rejects extension origins that do not match connected extension', async () => {
+    const ext = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://test' },
+    });
+    ext.on('message', (data) => {
+      const msg = JSON.parse(data.toString());
+      if (msg.method === 'ping') ext.send(JSON.stringify({ method: 'pong' }));
+    });
+
+    const { status, body } = await httpGetWithHeaders(`http://127.0.0.1:${port}/logs/status`, {
+      Origin: 'chrome-extension://other',
+    });
+    assert.equal(status, 403);
+    assert.match(body.error, /origin mismatch/);
+
+    ext.close();
+    await sleep(50);
+  });
+});
+
 // ─── Plugin Endpoints ────────────────────────────────────────────────────────
 
 describe('Plugin API Endpoints', () => {
@@ -1226,6 +1367,7 @@ describe('CDP JSONL Logging', () => {
       const entries = readJsonlEntries(logFilePath);
       const directions = new Set(entries.map((entry) => entry.direction));
       const methods = entries.map((entry) => entry?.message?.method).filter(Boolean);
+      const labeledClientEntry = entries.find((entry) => entry.clientId);
 
       assert.ok(directions.has('from-playwright'), 'Should log from-playwright direction');
       assert.ok(directions.has('to-extension'), 'Should log to-extension direction');
@@ -1233,6 +1375,7 @@ describe('CDP JSONL Logging', () => {
       assert.ok(directions.has('to-playwright'), 'Should log to-playwright direction');
       assert.ok(methods.includes('Runtime.evaluate'), 'Should log Runtime.evaluate method');
       assert.ok(methods.includes('Page.loadEventFired'), 'Should log Page.loadEventFired method');
+      assert.ok(labeledClientEntry?.clientLabel, 'Client-labeled entries should include clientLabel');
     } finally {
       cdp?.close();
       ext?.close();

From 6152fa502e0cf1df75f5587917ba076bb63878a2 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 13:23:31 +0530
Subject: [PATCH 065/192] feat(extension): add dedicated options-page log
 viewer

---
 extension/manifest.json |   5 +
 extension/options.css   | 230 ++++++++++++++++++++++++++++++++++++
 extension/options.html  |  80 +++++++++++++
 extension/options.js    | 250 ++++++++++++++++++++++++++++++++++++++++
 extension/popup.css     |  20 ++++
 extension/popup.html    |   2 +
 extension/popup.js      |   5 +
 7 files changed, 592 insertions(+)
 create mode 100644 extension/options.css
 create mode 100644 extension/options.html
 create mode 100644 extension/options.js

diff --git a/extension/manifest.json b/extension/manifest.json
index 4234c97..6240fc6 100644
--- a/extension/manifest.json
+++ b/extension/manifest.json
@@ -10,9 +10,14 @@
     "storage",
     "alarms"
   ],
+  "host_permissions": [
+    "http://127.0.0.1/*",
+    "http://localhost/*"
+  ],
   "background": {
     "service_worker": "background.js"
   },
+  "options_page": "options.html",
   "icons": {
     "16": "icons/icon16.png",
     "48": "icons/icon48.png",
diff --git a/extension/options.css b/extension/options.css
new file mode 100644
index 0000000..b0a3721
--- /dev/null
+++ b/extension/options.css
@@ -0,0 +1,230 @@
+* {
+  box-sizing: border-box;
+}
+
+body {
+  margin: 0;
+  font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, sans-serif;
+  background: #f5f7fb;
+  color: #1c2333;
+}
+
+.layout {
+  max-width: 1200px;
+  margin: 0 auto;
+  padding: 20px;
+}
+
+.topbar {
+  display: flex;
+  align-items: flex-start;
+  justify-content: space-between;
+  gap: 16px;
+  margin-bottom: 16px;
+}
+
+h1 {
+  margin: 0;
+  font-size: 24px;
+}
+
+h2 {
+  margin: 0 0 10px;
+  font-size: 14px;
+  text-transform: uppercase;
+  letter-spacing: 0.04em;
+  color: #4c5770;
+}
+
+.subtitle {
+  margin: 6px 0 0;
+  font-size: 13px;
+  color: #5f6d8a;
+}
+
+.controls {
+  display: flex;
+  gap: 8px;
+}
+
+button {
+  border: 1px solid #2f6dff;
+  border-radius: 8px;
+  background: #2f6dff;
+  color: #fff;
+  height: 36px;
+  padding: 0 14px;
+  font-size: 13px;
+  cursor: pointer;
+}
+
+button:hover {
+  background: #1f5cff;
+}
+
+button.ghost {
+  border-color: #c5ccda;
+  background: #fff;
+  color: #27314a;
+}
+
+button.ghost:hover {
+  background: #f2f4f9;
+}
+
+.cards {
+  display: grid;
+  grid-template-columns: repeat(3, minmax(0, 1fr));
+  gap: 12px;
+  margin-bottom: 12px;
+}
+
+.card {
+  border: 1px solid #d7ddea;
+  border-radius: 10px;
+  padding: 12px;
+  background: #fff;
+  min-height: 90px;
+}
+
+.card p {
+  margin: 0 0 6px;
+  font-size: 13px;
+}
+
+.client-list {
+  margin: 0;
+  padding-left: 16px;
+  max-height: 96px;
+  overflow: auto;
+}
+
+.client-list li {
+  font-size: 12px;
+  margin-bottom: 4px;
+}
+
+.notes {
+  display: flex;
+  gap: 6px;
+  padding: 10px 12px;
+  border: 1px solid #d7ddea;
+  border-radius: 10px;
+  background: #fff;
+  margin-bottom: 12px;
+  font-size: 13px;
+}
+
+.error {
+  margin: 0 0 12px;
+  border: 1px solid #f2bcc1;
+  background: #fff5f6;
+  color: #8a1d2f;
+  padding: 10px 12px;
+  border-radius: 8px;
+  font-size: 13px;
+}
+
+.logs-panel,
+.details-panel {
+  border: 1px solid #d7ddea;
+  border-radius: 10px;
+  background: #fff;
+  margin-bottom: 12px;
+}
+
+.logs-header {
+  display: flex;
+  align-items: center;
+  justify-content: space-between;
+  padding: 12px;
+  border-bottom: 1px solid #e6ebf5;
+}
+
+.logs-header h2 {
+  margin: 0;
+}
+
+.table-wrap {
+  max-height: 480px;
+  overflow: auto;
+}
+
+table {
+  width: 100%;
+  border-collapse: collapse;
+}
+
+th,
+td {
+  font-size: 12px;
+  text-align: left;
+  padding: 8px 10px;
+  border-bottom: 1px solid #edf1f8;
+  vertical-align: top;
+}
+
+th {
+  position: sticky;
+  top: 0;
+  background: #f8faff;
+  z-index: 1;
+  color: #4c5770;
+  font-weight: 600;
+}
+
+tr.clickable {
+  cursor: pointer;
+}
+
+tr.clickable:hover {
+  background: #f7f9ff;
+}
+
+tr.active {
+  background: #eef3ff;
+}
+
+.empty {
+  color: #74809b;
+  text-align: center;
+}
+
+.details-panel {
+  padding: 12px;
+}
+
+pre {
+  margin: 0;
+  max-height: 280px;
+  overflow: auto;
+  font-size: 12px;
+  line-height: 1.45;
+  white-space: pre-wrap;
+  word-break: break-word;
+}
+
+.mono {
+  font-family: ui-monospace, SFMono-Regular, Menlo, Monaco, Consolas, 'Liberation Mono', monospace;
+}
+
+@media (max-width: 960px) {
+  .cards {
+    grid-template-columns: 1fr;
+  }
+
+  .topbar {
+    flex-direction: column;
+    align-items: stretch;
+  }
+
+  .controls {
+    width: 100%;
+    flex-wrap: wrap;
+  }
+
+  button {
+    flex: 1;
+    min-width: 110px;
+  }
+}
diff --git a/extension/options.html b/extension/options.html
new file mode 100644
index 0000000..9232520
--- /dev/null
+++ b/extension/options.html
@@ -0,0 +1,80 @@
+<!DOCTYPE html>
+<html lang="en">
+<head>
+  <meta charset="UTF-8">
+  <meta name="viewport" content="width=device-width, initial-scale=1.0">
+  <title>BrowserForce Logs</title>
+  <link rel="stylesheet" href="options.css">
+</head>
+<body>
+  <main class="layout">
+    <header class="topbar">
+      <div>
+        <h1>BrowserForce Logs</h1>
+        <p class="subtitle">Live CDP traffic from relay, polled every second while this page is visible.</p>
+      </div>
+      <div class="controls">
+        <button id="bf-refresh" type="button">Refresh</button>
+        <button id="bf-pause" type="button">Pause</button>
+        <button id="bf-clear" type="button" class="ghost">Clear View</button>
+      </div>
+    </header>
+
+    <section class="cards">
+      <article class="card">
+        <h2>Relay</h2>
+        <p id="bf-relay-url" class="mono">-</p>
+        <p id="bf-relay-health">-</p>
+      </article>
+      <article class="card">
+        <h2>Connections</h2>
+        <p id="bf-conn-summary">-</p>
+        <ul id="bf-clients" class="client-list"></ul>
+      </article>
+      <article class="card">
+        <h2>Log Stats</h2>
+        <p id="bf-log-summary">-</p>
+        <p id="bf-last-updated">-</p>
+      </article>
+    </section>
+
+    <section class="notes">
+      <strong>Entry fields:</strong>
+      <span><code>seq</code>, <code>timestamp</code>, <code>direction</code>, optional <code>clientId</code>/<code>clientLabel</code>, and nested <code>message</code> payload.</span>
+    </section>
+
+    <p id="bf-error" class="error" hidden></p>
+
+    <section class="logs-panel">
+      <div class="logs-header">
+        <h2>CDP Entries</h2>
+        <span id="bf-entry-count">0 entries</span>
+      </div>
+      <div class="table-wrap">
+        <table>
+          <thead>
+            <tr>
+              <th>Seq</th>
+              <th>Time</th>
+              <th>Direction</th>
+              <th>Client</th>
+              <th>Method</th>
+              <th>Session</th>
+            </tr>
+          </thead>
+          <tbody id="bf-log-rows">
+            <tr><td colspan="6" class="empty">No logs yet.</td></tr>
+          </tbody>
+        </table>
+      </div>
+    </section>
+
+    <section class="details-panel">
+      <h2>Selected Entry</h2>
+      <pre id="bf-entry-details" class="mono">Select a row to inspect full JSON payload.</pre>
+    </section>
+  </main>
+
+  <script src="options.js"></script>
+</body>
+</html>
diff --git a/extension/options.js b/extension/options.js
new file mode 100644
index 0000000..4a2688c
--- /dev/null
+++ b/extension/options.js
@@ -0,0 +1,250 @@
+const RELAY_URL_DEFAULT = 'ws://127.0.0.1:19222/extension';
+const POLL_INTERVAL_MS = 1000;
+const MAX_RENDERED_ENTRIES = 10000;
+
+const relayUrlEl = document.getElementById('bf-relay-url');
+const relayHealthEl = document.getElementById('bf-relay-health');
+const connSummaryEl = document.getElementById('bf-conn-summary');
+const clientsEl = document.getElementById('bf-clients');
+const logSummaryEl = document.getElementById('bf-log-summary');
+const lastUpdatedEl = document.getElementById('bf-last-updated');
+const errorEl = document.getElementById('bf-error');
+const entryCountEl = document.getElementById('bf-entry-count');
+const rowsEl = document.getElementById('bf-log-rows');
+const detailsEl = document.getElementById('bf-entry-details');
+const refreshBtn = document.getElementById('bf-refresh');
+const pauseBtn = document.getElementById('bf-pause');
+const clearBtn = document.getElementById('bf-clear');
+
+const state = {
+  relayWsUrl: RELAY_URL_DEFAULT,
+  relayHttpBase: wsToHttpBase(RELAY_URL_DEFAULT),
+  timer: null,
+  inFlight: false,
+  paused: false,
+  lastSeq: 0,
+  entries: [],
+  selectedSeq: null,
+};
+
+chrome.storage.local.get(['relayUrl'], (stored) => {
+  const relayUrl = stored.relayUrl || RELAY_URL_DEFAULT;
+  state.relayWsUrl = relayUrl;
+  state.relayHttpBase = wsToHttpBase(relayUrl);
+  relayUrlEl.textContent = state.relayHttpBase;
+  pollOnce();
+});
+
+chrome.storage.onChanged.addListener((changes) => {
+  if (!changes.relayUrl) return;
+  const nextRelay = changes.relayUrl.newValue || RELAY_URL_DEFAULT;
+  state.relayWsUrl = nextRelay;
+  state.relayHttpBase = wsToHttpBase(nextRelay);
+  relayUrlEl.textContent = state.relayHttpBase;
+  state.lastSeq = 0;
+  state.entries = [];
+  state.selectedSeq = null;
+  renderEntries();
+  pollOnce();
+});
+
+refreshBtn.addEventListener('click', () => {
+  pollOnce();
+});
+
+pauseBtn.addEventListener('click', () => {
+  state.paused = !state.paused;
+  pauseBtn.textContent = state.paused ? 'Resume' : 'Pause';
+  if (state.paused) {
+    stopPolling();
+  } else if (!document.hidden) {
+    startPolling();
+    pollOnce();
+  }
+});
+
+clearBtn.addEventListener('click', () => {
+  state.entries = [];
+  state.selectedSeq = null;
+  renderEntries();
+  detailsEl.textContent = 'Select a row to inspect full JSON payload.';
+});
+
+document.addEventListener('visibilitychange', () => {
+  if (document.hidden) {
+    stopPolling();
+    return;
+  }
+  if (!state.paused) {
+    startPolling();
+    pollOnce();
+  }
+});
+
+window.addEventListener('beforeunload', () => {
+  stopPolling();
+});
+
+relayUrlEl.textContent = state.relayHttpBase;
+startPolling();
+pollOnce();
+
+function wsToHttpBase(wsUrl) {
+  try {
+    const parsed = new URL(wsUrl);
+    const protocol = parsed.protocol === 'wss:' ? 'https:' : 'http:';
+    return `${protocol}//${parsed.host}`;
+  } catch {
+    return 'http://127.0.0.1:19222';
+  }
+}
+
+function startPolling() {
+  if (state.timer || state.paused) return;
+  state.timer = setInterval(() => {
+    if (state.inFlight) return;
+    pollOnce();
+  }, POLL_INTERVAL_MS);
+}
+
+function stopPolling() {
+  if (!state.timer) return;
+  clearInterval(state.timer);
+  state.timer = null;
+}
+
+async function pollOnce() {
+  if (state.inFlight) return;
+  state.inFlight = true;
+
+  try {
+    const [status, logs] = await Promise.all([
+      fetchJson('/logs/status'),
+      fetchJson(`/logs/cdp?after=${state.lastSeq}&limit=500`),
+    ]);
+
+    if (logs.resetRequired) {
+      state.entries = [];
+      state.selectedSeq = null;
+      detailsEl.textContent = 'Log buffer rotated. Showing current buffered entries.';
+    }
+
+    if (Array.isArray(logs.entries) && logs.entries.length > 0) {
+      state.entries.push(...logs.entries);
+      if (state.entries.length > MAX_RENDERED_ENTRIES) {
+        state.entries.splice(0, state.entries.length - MAX_RENDERED_ENTRIES);
+      }
+    }
+
+    state.lastSeq = logs.latestSeq || state.lastSeq;
+    renderStatus(status);
+    renderEntries();
+    setError('');
+  } catch (err) {
+    setError(err.message || String(err));
+  } finally {
+    state.inFlight = false;
+  }
+}
+
+async function fetchJson(pathname) {
+  const response = await fetch(`${state.relayHttpBase}${pathname}`, {
+    method: 'GET',
+    cache: 'no-store',
+  });
+
+  if (!response.ok) {
+    const body = await response.text();
+    throw new Error(`Request failed (${response.status}): ${body || response.statusText}`);
+  }
+
+  return response.json();
+}
+
+function renderStatus(status) {
+  const ext = status.extension?.connected
+    ? `Extension connected (${status.extension.origin || 'unknown origin'})`
+    : 'Extension disconnected';
+  relayHealthEl.textContent = `${ext} • targets: ${status.targets}`;
+
+  connSummaryEl.textContent = `${status.clients.count} active CDP client(s)`;
+  clientsEl.innerHTML = '';
+  const clients = status.clients.items || [];
+  if (clients.length === 0) {
+    const li = document.createElement('li');
+    li.textContent = 'No active clients.';
+    clientsEl.appendChild(li);
+  } else {
+    for (const client of clients) {
+      const li = document.createElement('li');
+      const origin = client.origin || 'no origin';
+      const label = client.label || 'unlabeled';
+      li.textContent = `${label} (${client.id}) • ${origin}`;
+      clientsEl.appendChild(li);
+    }
+  }
+
+  const counts = status.logs.directionCounts;
+  logSummaryEl.textContent = `from-playwright ${counts.fromPlaywright} • to-playwright ${counts.toPlaywright} • from-extension ${counts.fromExtension} • to-extension ${counts.toExtension}`;
+  lastUpdatedEl.textContent = `Updated: ${new Date().toLocaleTimeString()}`;
+}
+
+function renderEntries() {
+  entryCountEl.textContent = `${state.entries.length} entries`;
+
+  if (state.entries.length === 0) {
+    rowsEl.innerHTML = '<tr><td colspan="6" class="empty">No logs yet.</td></tr>';
+    return;
+  }
+
+  rowsEl.innerHTML = '';
+  for (const entry of state.entries) {
+    const row = document.createElement('tr');
+    row.className = 'clickable';
+    if (state.selectedSeq === entry.seq) row.classList.add('active');
+
+    const method = entry.message?.method || 'response';
+    const sessionId = entry.message?.sessionId || '';
+    const time = formatTime(entry.timestamp);
+
+    row.innerHTML = [
+      `<td class="mono">${entry.seq}</td>`,
+      `<td class="mono">${time}</td>`,
+      `<td>${entry.direction}</td>`,
+      `<td class="mono">${escapeHtml(entry.clientLabel || entry.clientId || '-')}</td>`,
+      `<td class="mono">${escapeHtml(method)}</td>`,
+      `<td class="mono">${escapeHtml(sessionId)}</td>`,
+    ].join('');
+
+    row.addEventListener('click', () => {
+      state.selectedSeq = entry.seq;
+      detailsEl.textContent = JSON.stringify(entry, null, 2);
+      renderEntries();
+    });
+
+    rowsEl.appendChild(row);
+  }
+}
+
+function formatTime(iso) {
+  if (!iso) return '-';
+  const date = new Date(iso);
+  if (Number.isNaN(date.getTime())) return iso;
+  return date.toLocaleTimeString();
+}
+
+function setError(message) {
+  if (!message) {
+    errorEl.hidden = true;
+    errorEl.textContent = '';
+    return;
+  }
+  errorEl.hidden = false;
+  errorEl.textContent = message;
+}
+
+function escapeHtml(value) {
+  const div = document.createElement('div');
+  div.textContent = String(value || '');
+  return div.innerHTML;
+}
diff --git a/extension/popup.css b/extension/popup.css
index d31b4e8..512f6f3 100644
--- a/extension/popup.css
+++ b/extension/popup.css
@@ -237,6 +237,26 @@ button:active { background: #388e3c; }
 .attach-btn:hover { background: #eee; border-color: #aaa; }
 .attach-btn:active { background: #e0e0e0; }
 
+.logs-btn {
+  width: 100%;
+  margin-top: 8px;
+  padding: 9px;
+  background: #fff;
+  color: #333;
+  font-size: 12px;
+  font-weight: 500;
+  border: 1px solid #ddd;
+  border-radius: 6px;
+}
+
+.logs-btn:hover {
+  background: #f7f7f7;
+}
+
+.logs-btn:active {
+  background: #efefef;
+}
+
 /* Settings groups */
 .settings-group {
   border: 1px solid #eee;
diff --git a/extension/popup.html b/extension/popup.html
index ae19f9b..7f23d5c 100644
--- a/extension/popup.html
+++ b/extension/popup.html
@@ -38,6 +38,8 @@ <h1>BrowserForce</h1>
       </section>
 
       <button id="bf-attach-tab" class="attach-btn">+ Attach Current Tab</button>
+
+      <button id="bf-open-logs" class="logs-btn">View Full Logs</button>
     </div>
 
     <!-- Settings Tab -->
diff --git a/extension/popup.js b/extension/popup.js
index bf06c78..f6bc646 100644
--- a/extension/popup.js
+++ b/extension/popup.js
@@ -19,6 +19,7 @@ const tabCountEl = document.getElementById('bf-tab-count');
 const tabsListEl = document.getElementById('bf-tabs-list');
 const autoTimerEl = document.getElementById('bf-auto-timer');
 const attachBtn = document.getElementById('bf-attach-tab');
+const openLogsBtn = document.getElementById('bf-open-logs');
 const modeSelect = document.getElementById('bf-mode');
 const lockUrlCb = document.getElementById('bf-lock-url');
 const noNewTabsCb = document.getElementById('bf-no-new-tabs');
@@ -147,6 +148,10 @@ attachBtn.addEventListener('click', () => {
   });
 });
 
+openLogsBtn.addEventListener('click', () => {
+  chrome.runtime.openOptionsPage();
+});
+
 // --- Status Polling ---
 
 function refreshStatus() {

From a461f6a6444c278610aa62e3f5fdd1a0abe4de99 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 13:23:37 +0530
Subject: [PATCH 066/192] docs: document full logs viewer and logs endpoints

---
 GUIDE.md  | 18 ++++++++++--------
 README.md |  9 +++++++--
 2 files changed, 17 insertions(+), 10 deletions(-)

diff --git a/GUIDE.md b/GUIDE.md
index 26563ea..f87bf45 100644
--- a/GUIDE.md
+++ b/GUIDE.md
@@ -19,8 +19,9 @@ Use this when you need hard boundaries on what an agent can touch.
 
 1. Open the exact target tab.
 2. Click the BrowserForce extension icon.
-3. Click `+ Attach Current Tab`.
-4. Confirm it appears in `Controlled Tabs`.
+3. In the popup, click **+ Attach Current Tab**.
+4. Confirm it appears under **Controlled Tabs**.
+5. For full CDP traffic, click **View Full Logs** to open the dedicated logs page.
 
 This is the safest default for logged-in or sensitive pages.
 
@@ -209,12 +210,13 @@ Need broader persona workflows? See [Actionable Use Cases](docs/USE_CASES.md).
 ## Troubleshooting and Diagnostics
 
 | Problem | Fix |
-|---|---|
-| Extension stays gray | Start relay: `browserforce serve` |
-| `Another debugger is attached` | Close DevTools for that tab |
-| Agent sees 0 pages | Open at least one normal webpage (not `chrome://`) |
-| Frequent disconnections | MV3 worker churn is expected; relay keepalive should reconnect |
-| Port collision on `19222` | `lsof -ti:19222 | xargs kill -9` |
+|---------|-----|
+| Extension icon stays gray | Is the relay running? Run `browserforce serve` |
+| "Another debugger is attached" | Close DevTools for that tab |
+| AI sees 0 pages | Open at least one regular webpage (not `chrome://`) |
+| Extension keeps disconnecting | Normal MV3 behavior — it auto-reconnects |
+| Port already in use | Run `lsof -ti:19222 \| xargs kill -9` to kill stale process |
+| Need full traffic visibility | Popup → **View Full Logs** (polls relay logs while page is open) |
 
 CDP traffic log: `~/.browserforce/cdp.jsonl` (recreated each relay start).
 
diff --git a/README.md b/README.md
index a1fc6bf..abb4504 100644
--- a/README.md
+++ b/README.md
@@ -641,6 +641,7 @@ Click the extension icon to configure restrictions. Your browser, your rules:
 ### Controlled Tab Workflows
 
 - **Manually attach a tab:** Open the tab you want, click the extension popup, then click **+ Attach Current Tab**.
+- **Open full log viewer:** In the popup, click **View Full Logs** to open the extension options page.
 - **Restrict to one controlled tab:** Use **Manual mode**, attach one tab, and enable **No new tabs**.
 - **Allow multiple controlled tabs:** Stay in **Manual mode** and attach each tab you want the agent to access.
 - **Restriction modes:** Use **Lock URL** (no navigation), **No new tabs**, and **Read-only** (observe only) together or separately.
@@ -706,8 +707,12 @@ In `single-active` mode, the relay enforces one active client slot. A second `/c
 | `GET /client-slot`       | Client-slot state: `{ mode, busy, activeClientId, connectedAt }` |
 | `GET /json/version`      | CDP discovery                                 |
 | `GET /json/list`         | List attached targets                         |
-| `ws://.../extension`     | Chrome extension WebSocket                    |
-| `ws://.../cdp?token=...` | Agent CDP connection                          |
+| `GET /logs/status` | Logs viewer status (extension-only origin) |
+| `GET /logs/cdp?after=&limit=` | Incremental CDP log polling feed (extension-only origin) |
+| `ws://.../extension` | Chrome extension WebSocket |
+| `ws://.../cdp?token=...` | Agent CDP connection |
+
+Tip: add `&label=<name>` to the CDP URL to tag client connections in the logs viewer (MCP defaults to `browserforce-mcp`).
 
 ## Troubleshooting
 

From 773b4b3fba083fd980f851f428d97f39560f0d68 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 15:04:57 +0530
Subject: [PATCH 067/192] docs: add troubleshooting section for MCP errors in
 README

---
 README.md | 49 +++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 49 insertions(+)

diff --git a/README.md b/README.md
index abb4504..4794103 100644
--- a/README.md
+++ b/README.md
@@ -726,6 +726,55 @@ Tip: add `&label=<name>` to the CDP URL to tag client connections in the logs vi
 | Port in use                  | `lsof -ti:19222 | xargs kill -9`                      |
 
 
+### MCP Error: `Protocol error (Target.createTarget): Extension not connected`
+
+This usually means MCP is talking to a relay process that is not the one your extension is connected to.
+
+Quick checks:
+
+```bash
+curl -s http://127.0.0.1:19222/ | jq
+cat ~/.browserforce/cdp-url
+echo "${BF_CDP_URL:-<unset>}"
+```
+
+If MCP is reading a stale `~/.browserforce/cdp-url` (or `BF_CDP_URL` override), it may connect to the wrong port.
+
+Recovery steps (npm/npx workflow):
+
+```bash
+# 1) Stop stale relay listeners
+lsof -tiTCP:19222 -sTCP:LISTEN | xargs kill -9 2>/dev/null || true
+lsof -tiTCP:19888 -sTCP:LISTEN | xargs kill -9 2>/dev/null || true
+
+# 2) Clear stale override for this shell
+unset BF_CDP_URL
+
+# 3) Start one fresh relay on default port
+RELAY_PORT=19222 npx -y browserforce@latest serve
+```
+
+Then verify:
+
+```bash
+cat ~/.browserforce/cdp-url
+curl -s http://127.0.0.1:19222/client-slot | jq
+```
+
+Expected: `cdp-url` points to `ws://127.0.0.1:19222/...` and `/client-slot` returns `{ mode, busy, activeClientId, connectedAt }`.
+
+### MCP Error: `Unexpected server response: 409`
+
+This means single-active arbitration is working and another CDP client is currently holding the slot.
+
+Check:
+
+```bash
+curl -s http://127.0.0.1:19222/client-slot | jq
+```
+
+If `busy: true`, close the other MCP/CDP session or set `BF_CLIENT_MODE=multi-client` for explicit concurrent-client fallback.
+
 CDP traffic is logged to `~/.browserforce/cdp.jsonl` (recreated on each relay start). Summarize traffic by direction + method:
 
 ```bash

From 1657a518149a0c14844f18fbe3dcef01885cd400 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 17:26:57 +0530
Subject: [PATCH 068/192] feat(mcp): make startup non-blocking with standby
 reconnect loop

---
 README.frontpage.md | 893 ++++++++++++++++++++++++++++++++++++++++++++
 README.md           |  33 ++
 mcp/src/index.js    | 107 ++++--
 3 files changed, 1005 insertions(+), 28 deletions(-)
 create mode 100644 README.frontpage.md

diff --git a/README.frontpage.md b/README.frontpage.md
new file mode 100644
index 0000000..4aed4f7
--- /dev/null
+++ b/README.frontpage.md
@@ -0,0 +1,893 @@
+# BrowserForce // Parallel AI Agents in "your" Browser!
+
+Give AI agents controlled access to the browser you already use.
+
+> "a lion doesn't concern itself with token counting" — [@steipete](https://x.com/steipete), creator of [OpenClaw](https://github.com/openclaw/openclaw)
+>
+> "a 10x user doesn't concern itself with sandboxed browsers // sandboxes are for kids" — BrowserForce, your friendly neighborhood power source.
+
+**You're giving an AI your real Chrome — your logins, cookies, and sessions. That takes conviction.**
+BrowserForce is built for people who use the best models and don't look back.
+
+**Autonomous when you want it, controlled when you need it.**
+Run hands-off in Auto mode, or switch to Manual mode and explicitly attach only the tabs you trust.
+
+Works with [OpenClaw](https://github.com/openclaw/openclaw), Claude, Codex, Cursor, or any MCP-compatible agent.
+
+## Why BrowserForce
+
+|                | Playwright MCP       | OpenClaw Browser        | Playwriter              | Claude Extension     | BrowserForce                         |
+| -------------- | -------------------- | ----------------------- | ----------------------- | -------------------- | ------------------------------------ |
+| Browser        | Spawns new Chrome    | Separate profile        | Your Chrome             | Your Chrome          | **Your Chrome**                      |
+| Login state    | Fresh                | Fresh (isolated)        | Yours                   | Yours                | **Yours**                            |
+| Tab access     | N/A (new browser)    | Managed by agent        | Click each tab          | Click each tab       | **Auto mode + manual attached tabs** |
+| Autonomous     | Yes                  | Yes                     | No (manual click)       | No (manual click)    | **Yes (fully autonomous)**           |
+| Context method | Screenshots (100KB+) | Screenshots + snapshots | A11y snapshots (5-20KB) | Screenshots (100KB+) | **A11y snapshots (5-20KB)**          |
+| Tools          | Many dedicated       | 1 `browser` tool        | 1 `execute` tool        | Built-in             | **2 tools: `execute`, `reset`**      |
+| Agent support  | Any MCP client       | OpenClaw only           | Any MCP client          | Claude only          | **Any MCP client**                   |
+| Playwright API | Partial              | No                      | Full                    | No                   | **Full**                             |
+
+## 60-Second Start (MCP-First)
+
+1. Install:
+
+```bash
+npm install -g browserforce
+```
+
+2. Install extension files:
+
+```bash
+browserforce install-extension
+```
+
+3. Load extension in `chrome://extensions` -> Developer mode -> Load unpacked -> use path printed by command.
+
+4. Start relay:
+
+```bash
+browserforce serve
+```
+
+5. In your MCP client config, run BrowserForce via npm:
+
+```json
+{
+  "command": "npx",
+  "args": ["-y", "browserforce@latest", "mcp"]
+}
+```
+
+## Critical Reliability Notes (for MCP users)
+
+- MCP reads CDP URL from `~/.browserforce/cdp-url` unless `BF_CDP_URL` is set.
+- If multiple relay processes run on different ports, MCP may connect to the wrong relay.
+- Single-active mode is default (`BF_CLIENT_MODE=single-active`): a second client can get `409` while slot is busy.
+
+Quick checks:
+
+```bash
+cat ~/.browserforce/cdp-url
+curl -s http://127.0.0.1:19222/ | jq
+curl -s http://127.0.0.1:19222/client-slot | jq
+echo "${BF_CDP_URL:-<unset>}"
+```
+
+If you hit `Protocol error (Target.createTarget): Extension not connected`:
+
+```bash
+lsof -tiTCP:19222 -sTCP:LISTEN | xargs kill -9 2>/dev/null || true
+lsof -tiTCP:19888 -sTCP:LISTEN | xargs kill -9 2>/dev/null || true
+unset BF_CDP_URL
+RELAY_PORT=19222 npx -y browserforce@latest serve
+```
+
+## Security in Plain English
+
+- Relay binds to `127.0.0.1` only.
+- Extension origin is validated (`chrome-extension://...`).
+- CDP uses auth token in URL query.
+- Token file permissions are owner-only.
+- You can lock URLs, block navigation, and run read-only workflows.
+
+## What This Draft Changes
+
+This file is a **front-page candidate** optimized for:
+- Primary audience: OpenClaw users actively running MCP browser workflows.
+- Secondary audience: developers doing deep debugging.
+
+To avoid dropping anything, the complete current README is preserved below as a collapsed appendix.
+
+---
+
+<details>
+<summary><b>Full Current README (Preserved, Unchanged Snapshot)</b></summary>
+
+# BrowserForce // Parallel AI Agents in "your" Browser!
+
+Give AI agents controlled access to the browser you already use.
+
+> "a lion doesn't concern itself with token counting" — [@steipete](https://x.com/steipete), creator of [OpenClaw](https://github.com/openclaw/openclaw)
+>
+> "a 10x user doesn't concern itself with sandboxed browsers // sandboxes are for kids" — BrowserForce, your friendly neighborhood power source.
+
+**You're giving an AI your real Chrome — your logins, cookies, and sessions. That takes conviction.** BrowserForce is built for people who use the best models and don't look back. Security is built in: lock URLs, block navigation, read-only mode, auto-cleanup — you stay in control.
+
+**Autonomous when you want it, controlled when you need it.** Your agent can run hands-off in Auto mode, or you can switch to Manual mode and explicitly attach only the tabs you trust. BrowserForce connects to **your running browser** with one Chrome extension and full Playwright API support.
+
+Works with [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-compatible agent.
+
+## Comparison
+
+
+|                | Playwright MCP       | OpenClaw Browser        | Playwriter              | Claude Extension     | BrowserForce                         |
+| -------------- | -------------------- | ----------------------- | ----------------------- | -------------------- | ------------------------------------ |
+| Browser        | Spawns new Chrome    | Separate profile        | Your Chrome             | Your Chrome          | **Your Chrome**                      |
+| Login state    | Fresh                | Fresh (isolated)        | Yours                   | Yours                | **Yours**                            |
+| Tab access     | N/A (new browser)    | Managed by agent        | Click each tab          | Click each tab       | **Auto mode + manual attached tabs** |
+| Autonomous     | Yes                  | Yes                     | No (manual click)       | No (manual click)    | **Yes (fully autonomous)**           |
+| Context method | Screenshots (100KB+) | Screenshots + snapshots | A11y snapshots (5-20KB) | Screenshots (100KB+) | **A11y snapshots (5-20KB)**          |
+| Tools          | Many dedicated       | 1 `browser` tool        | 1 `execute` tool        | Built-in             | **2 tools: `execute`, `reset`**      |
+| Agent support  | Any MCP client       | OpenClaw only           | Any MCP client          | Claude only          | **Any MCP client**                   |
+| Playwright API | Partial              | No                      | Full                    | No                   | **Full**                             |
+
+
+## Your Credentials Stay Yours
+
+Every other approach asks you to hand over something: an API key, an OAuth token, stored passwords, session cookies in a config file. BrowserForce asks for none of it.
+
+**Why?** Because you're already logged in. BrowserForce talks to your running Chrome — it doesn't extract credentials, store cookies, or replay tokens. The browser handles auth exactly as it always has. Your agent inherits your sessions the same way a new Chrome tab does.
+
+What you never need to provide:
+
+- No passwords
+- No API keys
+- No OAuth tokens
+- No session cookies in env vars or config files
+
+It's a security win *and* a setup win — there are no secrets to rotate, leak, or manage. Your logins live in Chrome. They stay in Chrome.
+
+## Setup
+
+### 1. Install
+
+```bash
+npm install -g browserforce
+```
+
+Or from source:
+
+```bash
+git clone https://github.com/ivalsaraj/browserforce.git
+cd browserforce
+pnpm install
+```
+
+### 2. Load the Chrome extension
+
+**If you installed via npm:**
+
+1. Run: `browserforce install-extension` — note the path it prints (e.g. `/Users/you/.browserforce/extension`)
+2. Open `chrome://extensions/` in Chrome
+3. Enable **Developer mode** (top-right toggle)
+4. Click **Load unpacked** → a file picker opens
+  - **macOS**: press `Cmd+Shift+G`, paste the path from step 1, press Enter
+  - **Windows/Linux**: paste the path directly into the address bar of the dialog
+
+❗ After every BrowserForce update, re-run `browserforce install-extension`, then reload the extension in `chrome://extensions/` (click the ↺ icon next to BrowserForce).
+
+**If you cloned the repo:**
+
+1. Open `chrome://extensions/` in Chrome
+2. Enable **Developer mode** (top-right toggle)
+3. Click **Load unpacked** → select the `extension/` folder
+
+After loading, the extension icon appears in your toolbar (gray = disconnected).
+
+### 3. Done
+
+The relay auto-starts when you run any command or connect via MCP — no manual step needed. Extension icon turns green once connected.
+
+To run the relay manually (optional):
+
+```bash
+browserforce serve
+```
+
+## Connect Your Agent
+
+### OpenClaw
+
+Most OpenClaw users chat with their agent from Telegram or WhatsApp. BrowserForce lets your agent browse the web as you — no login flows, no captchas — even from a messaging app.
+
+**Quick setup** (copy-paste into your terminal):
+
+```bash
+npm install -g browserforce && browserforce install-extension && npx -y skills add ivalsaraj/browserforce
+```
+
+Then start the relay (keep this running):
+
+```bash
+browserforce serve
+```
+
+**Verify it works** — send this to your agent:
+
+> Go to [https://x.com](https://x.com) and give me top tweets
+
+If your agent browses to the page and responds with the title, you're all set.
+
+**MCP setup (advanced):**
+
+<details>
+<summary><b>OpenClaw (MCP adapter)</b></summary>
+
+Add to `~/.openclaw/openclaw.json`:
+
+```json
+{
+  "plugins": {
+    "entries": {
+      "mcp-adapter": {
+        "enabled": true,
+        "config": {
+          "servers": [
+            {
+              "name": "browserforce",
+              "transport": "stdio",
+              "command": "npx",
+              "args": ["-y", "browserforce@latest", "mcp"]
+            }
+          ]
+        }
+      }
+    }
+  }
+}
+```
+
+</details>
+
+<details>
+<summary><b>Claude Desktop</b></summary>
+
+Add to `~/Library/Application Support/Claude/claude_desktop_config.json`:
+
+```json
+{
+  "mcpServers": {
+    "browserforce": {
+      "command": "npx",
+      "args": ["-y", "browserforce@latest", "mcp"]
+    }
+  }
+}
+```
+
+</details>
+
+<details>
+<summary><b>Claude Code</b></summary>
+
+Add to `~/.claude/mcp.json`:
+
+```json
+{
+  "mcpServers": {
+    "browserforce": {
+      "command": "npx",
+      "args": ["-y", "browserforce@latest", "mcp"]
+    }
+  }
+}
+```
+
+</details>
+
+<details>
+<summary><b>Codex</b></summary>
+
+Add to `~/.codex/config.toml`:
+
+```toml
+[mcp_servers.browserforce]
+command = "npx"
+args = ["-y", "browserforce@latest", "mcp"]
+```
+
+</details>
+
+<details>
+<summary><b>Cursor</b></summary>
+
+Add to `~/.cursor/mcp.json`:
+
+```json
+{
+  "mcpServers": {
+    "browserforce": {
+      "command": "npx",
+      "args": ["-y", "browserforce@latest", "mcp"]
+    }
+  }
+}
+```
+
+</details>
+
+<details>
+<summary><b>Antigravity</b></summary>
+
+In Antigravity: Agent panel -> `...` -> `Manage MCP Servers` -> `View raw config`.
+Add the same `mcpServers` entry:
+
+```json
+{
+  "mcpServers": {
+    "browserforce": {
+      "command": "npx",
+      "args": ["-y", "browserforce@latest", "mcp"]
+    }
+  }
+}
+```
+
+</details>
+
+
+
+If MCP startup fails with `connection closed: initialize response`:
+
+1. Ensure args include `"mcp"` (without it, BrowserForce prints help and exits).
+2. If running from a local clone, install deps first: `pnpm install`.
+3. Validate the launch command manually: `npx -y browserforce@latest mcp`
+
+### CLI
+
+```bash
+npm install -g browserforce   # or: pnpm add -g browserforce
+```
+
+```bash
+browserforce serve              # Start the relay server
+browserforce status             # Check relay and extension status
+browserforce tabs               # List open browser tabs
+browserforce snapshot [n]       # Accessibility tree of tab n
+browserforce screenshot [n]     # Screenshot tab n (PNG to stdout)
+browserforce navigate <url>     # Open URL in a new tab
+browserforce -e "<code>"        # Run Playwright JavaScript (one-shot)
+browserforce plugin list        # List installed plugins
+browserforce plugin install <n> # Install a plugin from the registry
+browserforce plugin remove <n>  # Remove an installed plugin
+browserforce update             # Update to the latest version
+browserforce install-extension  # Copy extension to ~/.browserforce/extension/
+```
+
+Each `-e` command is one-shot — state does not persist between calls. For persistent state, use the MCP server.
+
+## Plugins
+
+Plugins add custom helpers directly into the `execute` tool scope. Install once — your agent calls them like built-in functions.
+
+### Install a plugin
+
+```bash
+browserforce plugin install highlight
+```
+
+That's it. Restart MCP (or Claude Desktop) and `highlight()` is available in every `execute` call.
+
+### Official plugins
+
+
+| Plugin      | What it adds                                                                                   | Install                                 |
+| ----------- | ---------------------------------------------------------------------------------------------- | --------------------------------------- |
+| `highlight` | `highlight(selector, color?)` — outlines matching elements; `clearHighlights()` — removes them | `browserforce plugin install highlight` |
+
+
+### Use an installed plugin
+
+After installing `highlight`, your agent can call it directly:
+
+```javascript
+// Outline all buttons in blue
+await highlight('button', 'blue');
+
+// Highlight the specific element you're about to click
+await highlight('[data-testid="submit"]', 'red');
+return await screenshotWithAccessibilityLabels();
+```
+
+The helper receives the active page, context, and state automatically — no plumbing needed.
+
+### Manage plugins
+
+```bash
+browserforce plugin list        # See what's installed
+browserforce plugin remove highlight   # Uninstall
+```
+
+Plugins are stored at `~/.browserforce/plugins/`. Each one is a folder with an `index.js`.
+
+### Write your own
+
+```javascript
+// ~/.browserforce/plugins/my-plugin/index.js
+export default {
+  name: 'my-plugin',
+  helpers: {
+    async scrollToBottom(page, ctx, state) {
+      await page.evaluate(() => window.scrollTo(0, document.body.scrollHeight));
+    },
+    async countLinks(page, ctx, state) {
+      return page.evaluate(() => document.querySelectorAll('a').length);
+    },
+  },
+};
+```
+
+Drop it in `~/.browserforce/plugins/my-plugin/`, restart MCP, and call `await scrollToBottom()` or `await countLinks()` from any `execute` call.
+
+Add a `SKILL.md` file alongside `index.js` and its content is automatically appended to the `execute` tool's description — so your agent knows the helpers exist without you having to explain them every time.
+
+### Any Playwright Script
+
+```javascript
+const { chromium } = require('playwright');
+
+const browser = await chromium.connectOverCDP(
+  'ws://127.0.0.1:19222/cdp?token=<TOKEN>'
+);
+
+const pages = browser.contexts()[0].pages();
+for (const page of pages) {
+  console.log(page.url());  // your real tabs!
+}
+
+// Gmail is already logged in
+const gmail = pages.find(p => p.url().includes('mail.google'));
+await gmail.screenshot({ path: 'gmail.png' });
+```
+
+No token config needed for MCP — the server reads it automatically from `~/.browserforce/cdp-url`.
+
+## What Your Agent Can Do
+
+Once connected, your agent has full Playwright access to your real browser:
+
+```javascript
+// Navigate (uses your cookies — no login needed)
+await page.goto('https://github.com');
+await waitForPageLoad();
+
+// Read pages with accessibility snapshots (10-100x cheaper than screenshots)
+return await snapshot();
+
+// Click, type, fill forms
+await page.locator('role=button[name="Sign in"]').click();
+await page.locator('role=textbox[name="Search"]').fill('query');
+
+// Screenshots when you need them
+return await page.screenshot();
+
+// Work with multiple tabs
+const pages = context.pages();
+const gmail = pages.find(p => p.url().includes('mail.google'));
+
+// Persist data across calls
+state.results = await page.evaluate(() => document.title);
+```
+
+### MCP Tools
+
+
+| Tool      | Description                                                                                                                                                                                                                    |
+| --------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
+| `execute` | Run Playwright JavaScript in your real Chrome. Access `page`, `context`, `state`, `snapshot()`, `waitForPageLoad()`, `getLogs()`, `screenshotWithAccessibilityLabels()`, `cleanHTML()`, `pageMarkdown()`, and Node.js globals. |
+| `reset`   | Reconnect to the relay and clear state. Use when the connection drops.                                                                                                                                                         |
+
+
+### Diff-Aware Helpers
+
+Use `showDiffSinceLastCall` to control diff output vs full output in execute helper calls:
+
+```javascript
+await snapshot({ showDiffSinceLastCall: true });
+await snapshot({ showDiffSinceLastCall: false });
+await cleanHTML('body', { showDiffSinceLastCall: false });
+await pageMarkdown({ showDiffSinceLastCall: true });
+```
+
+### BrowserForce Tab Swarms // Parallel Tabs Processing
+
+BrowserForce uses a parallel-first policy for independent extraction jobs, so agents finish list/count/scrape tasks faster with bounded risk.
+
+- Rule: For count/list/extraction across independent pages, dates, or items, run parallel tabs first using `Promise.all` with a concurrency cap (`3-8`, typically start at `5`).
+- Fallback: If the site starts rate-limiting (`429`), anti-bot challenges appear, or timeouts repeat, automatically retry with reduced concurrency and then sequential as a final fallback.
+- Safety: This swarm exception is for read-only bulk extraction only; no user-tab mutation (checkout/purchase/send/delete/settings changes) during swarm runs.
+- Required telemetry return: `peakConcurrentTasks`, `wallClockMs`, `sumTaskDurationsMs`, `failures`, `retries`.
+
+Need role-based, real workflows? See [Actionable Use Cases](docs/USE_CASES.md).
+
+## Examples
+
+Get started with simple prompts. The AI generates code and does the work.
+
+**Example 1: Read page content (X.com search)**
+
+**Prompt to AI:**
+
+> Go to x.com/search and search for "browserforce". Show me the top 5 tweets you find.
+
+**What the AI does:** Navigates to X, searches the term, extracts top tweets, returns them to you.
+
+**Use case:** Quick research, trend tracking, social listening.
+
+
+
+**Example 2: Interact with a form (GitHub search)**
+
+**Prompt to AI:**
+
+> Go to GitHub and search for "ai agents". Show me the top 3 repositories and their star counts.
+
+**What the AI does:** Fills GitHub search, waits for results, extracts repo names + stars, returns them.
+
+**Use case:** Finding libraries, competitive research, project discovery.
+
+
+
+### Multi-Tab Workflows
+
+**Example 3: Search → Extract → Return**
+
+**Prompt to AI:**
+
+> Search ProductHunt for "AI tools" and give me the top 5 products with their taglines and upvote counts.
+
+**What the AI does:** Navigates ProductHunt, searches, extracts product info, returns structured data.
+
+**Use case:** Market research, finding tools, competitive analysis.
+
+
+
+**Example 4: Open result in new tab, process there**
+
+**Prompt to AI:**
+
+> Find the #1 product from your last ProductHunt search, click into it, and read the full description. Tell me what it does.
+
+**What the AI does:** Opens the product page from previous results, reads the description, summarizes it.
+
+**Use case:** Deep-dive research, understanding competitors, due diligence.
+
+
+
+**Example 5: Debugging workflow (inspect + verify)**
+
+**Prompt to AI:**
+
+> Go to my staging site at staging.myapp.com/checkout and take a labeled screenshot. Tell me if the "Complete Purchase" button is visible and what's around it.
+
+**What the AI does:** Navigates, takes screenshot with interactive labels, analyzes button state and layout.
+
+**Use case:** Visual debugging, QA checks, spotting broken elements.
+
+
+
+**Example 6: Test form with data**
+
+**Prompt to AI:**
+
+> Sign up for Substack using the email [test.user@example.com](mailto:test.user@example.com). Tell me if the signup completes successfully.
+
+**What the AI does:** Fills the form, submits, waits for confirmation, reports success/failure.
+
+**Use case:** Testing sign-up flows, QA automation, form validation.
+
+
+
+**Example 7: Content pipeline (search → extract → compare)**
+
+**Prompt to AI:**
+
+> Search for "AI regulation" on both X.com and LinkedIn. Give me the top 5 trending posts from each and tell me which topics overlap.
+
+**What the AI does:** Searches both platforms, extracts posts, compares content, returns analysis.
+
+**Use case:** Multi-source research, trend analysis, market sentiment.
+
+
+
+**Example 8: Data extraction → CSV pipeline**
+
+**Prompt to AI:**
+
+> Go to Hacker News and extract the top 10 stories with their titles and vote counts. Format as CSV so I can import into a spreadsheet.
+
+**What the AI does:** Navigates HN, extracts story data, formats as CSV, returns it ready to paste.
+
+**Use case:** Data workflows, trend tracking, content curation.
+
+
+
+**Example 9: A/B testing across variants**
+
+**Prompt to AI:**
+
+> Visit myapp.com/?variant=red and myapp.com/?variant=blue. Compare the two designs and tell me which button color is more prominent and what other differences exist.
+
+**What the AI does:** Opens both variants, compares layouts/colors/text, reports visual differences.
+
+**Use case:** Design QA, A/B testing, variant comparison.
+
+
+
+**Example 10: Monitor + alert workflow**
+
+**Prompt to AI:**
+
+> Check our status page at status.myapp.com every few minutes. Tell me the current status of the API and database. Alert me if anything changes from green to red.
+
+**What the AI does:** Monitors status page, reads indicators, alerts on degradation.
+
+**Use case:** Uptime monitoring, incident detection, SLA tracking.
+
+
+
+### Parallel Tab Swarms: Real-World Use Cases
+
+**Example 11: Retail price swarm (SKU × store matrix)**
+
+**Prompt to AI:**
+
+> For these 25 SKUs, check Amazon, Walmart, Target, and Best Buy in parallel tabs. Return the best price, in-stock status, and fastest delivery ETA per SKU.
+
+**What the AI does:** Runs independent `(sku, store)` checks in capped parallel tab batches, retries with reduced concurrency on `429`/timeouts, then falls back sequentially if needed.
+
+**Use case:** Pricing intelligence, buy-box monitoring, merchandising ops.
+
+
+
+**Example 12: Travel fare grid (date × route sweep)**
+
+**Prompt to AI:**
+
+> For SFO → JFK, scan the next 14 Fridays and Sundays across Google Flights, Kayak, and Expedia. Return the cheapest refundable option for each date.
+
+**What the AI does:** Opens independent `(date, site)` tasks in parallel, extracts fare + refundability, and returns a normalized comparison table.
+
+**Use case:** Travel operations, procurement, rapid itinerary optimization.
+
+
+
+**Example 13: Competitor launch radar (company × source)**
+
+**Prompt to AI:**
+
+> Track the last 7 days of updates for these 30 competitors across release notes, changelogs, docs, and blog posts. Group findings by feature category.
+
+**What the AI does:** Parallelizes `(company, source)` extraction, deduplicates announcements, and returns a launch digest with links.
+
+**Use case:** Product strategy, PM intelligence, competitive monitoring.
+
+
+
+**Example 14: Lead qualification swarm (account × signal source)**
+
+**Prompt to AI:**
+
+> For this account list, check careers pages, LinkedIn jobs, pricing pages, and press/news for expansion signals. Score each account and rank top opportunities.
+
+**What the AI does:** Executes independent account-source checks in parallel tabs, extracts signal evidence, and returns ranked lead scores with rationale.
+
+**Use case:** Sales research, outbound prioritization, RevOps signal mining.
+
+
+
+**Example 15: Security exposure triage (domain × surface)**
+
+**Prompt to AI:**
+
+> For these domains, inspect login pages, robots.txt, status pages, public docs, and likely staging links. Flag suspicious exposures with evidence links.
+
+**What the AI does:** Runs read-only `(domain, surface)` checks in a swarm, retries degraded paths safely, and returns a risk-prioritized findings report.
+
+**Use case:** Security reviews, surface mapping, pre-audit triage.
+
+
+
+**More examples** and detailed walkthrough available in the [User Guide](GUIDE.md#examples).
+
+## How It Works
+
+```
+  Agent (OpenClaw, Claude, etc.)
+         │
+         ├─ MCP server (stdio)
+         ├─ CLI (browserforce -e)
+         │
+         │ CDP over WebSocket
+         ▼
+  Relay Server (localhost:19222)
+         │
+         │ WebSocket
+         ▼
+  Chrome Extension (MV3)
+         │
+         │ chrome.debugger API
+         ▼
+  Your Real Chrome Browser
+```
+
+The **relay server** runs on your machine (localhost only). It translates between the agent's CDP commands and the extension's debugger bridge.
+
+The **Chrome extension** lives in your browser. It attaches Chrome's built-in debugger to permitted tabs and forwards commands — exactly like DevTools does.
+
+In **Auto mode**, the agent can create and control tabs it opens. In **Manual mode**, you decide access by clicking **+ Attach Current Tab**.
+
+## You Stay in Control
+
+Click the extension icon to configure restrictions. Your browser, your rules:
+
+
+| Setting                 | What it does                                                             |
+| ----------------------- | ------------------------------------------------------------------------ |
+| **Auto / Manual mode**  | Let the agent create tabs freely, or hand-pick which tabs it can access  |
+| **Lock URL**            | Prevent the agent from navigating away from the current page             |
+| **No new tabs**         | Block the agent from opening new tabs                                    |
+| **Read-only**           | Observe only — no clicks, no typing, no interactions                     |
+| **Auto-detach**         | Automatically detach inactive tabs after 5-60 minutes                    |
+| **Auto-close**          | Automatically close agent-created tabs after 5-60 minutes                |
+| **Custom instructions** | Pass text instructions to the agent (e.g. "don't click any buy buttons") |
+
+
+### Controlled Tab Workflows
+
+- **Manually attach a tab:** Open the tab you want, click the extension popup, then click **+ Attach Current Tab**.
+- **Open full log viewer:** In the popup, click **View Full Logs** to open the extension options page.
+- **Restrict to one controlled tab:** Use **Manual mode**, attach one tab, and enable **No new tabs**.
+- **Allow multiple controlled tabs:** Stay in **Manual mode** and attach each tab you want the agent to access.
+- **Restriction modes:** Use **Lock URL** (no navigation), **No new tabs**, and **Read-only** (observe only) together or separately.
+- **Auto-cleanup:** Use **Auto-detach** for inactive attached tabs and **Auto-close** for agent-created tabs.
+
+For step-by-step setups, see the [Controlled Tabs Playbook](GUIDE.md#controlled-tabs-playbook).
+
+## Security
+
+
+| Layer            | Control                                                                 |
+| ---------------- | ----------------------------------------------------------------------- |
+| **Network**      | Relay binds to `127.0.0.1` only — never exposed to the internet         |
+| **Auth**         | Random token required for every CDP connection                          |
+| **Origin**       | Extension only accepts connections from its own Chrome origin           |
+| **Visibility**   | Chrome shows "controlled by automated test software" on active tabs     |
+| **Restrictions** | Lock URLs, block navigation, read-only mode — enforced at the CDP level |
+
+
+Everything runs on your machine. The auth token is stored at `~/.browserforce/auth-token` with owner-only permissions.
+
+## Configuration
+
+**Custom relay port:**
+
+```bash
+RELAY_PORT=19333 browserforce serve
+```
+
+**Extension relay URL:** Click the extension icon → change the URL → Save. Default: `ws://127.0.0.1:19222/extension`
+
+**Override CDP URL for MCP:**
+
+```json
+{
+  "env": {
+    "BF_CDP_URL": "ws://127.0.0.1:19333/cdp?token=your-token"
+  }
+}
+```
+
+**Client arbitration mode (`BF_CLIENT_MODE`):**
+
+```bash
+# default: one active /cdp client at a time
+BF_CLIENT_MODE=single-active browserforce serve
+
+# fallback: allow concurrent /cdp clients
+BF_CLIENT_MODE=multi-client browserforce serve
+```
+
+In `single-active` mode, the relay enforces one active client slot. A second `/cdp` connection receives HTTP `409 Conflict` (busy). In `multi-client` mode, slot arbitration is disabled.
+
+**MCP standby polling (single-active mode):** if MCP sees a busy/`409` connect error, it enters standby and polls `GET /client-slot` until `busy: false` (about every 200-400ms, up to 30s), then retries connect.
+
+**Operational non-goals:** canonical list is maintained in [AGENTS.md](AGENTS.md#operational-non-goals).
+
+## API
+
+| Endpoint                 | Description                                   |
+| ------------------------ | --------------------------------------------- |
+| `GET /`                  | Health check (extension status, target count) |
+| `GET /client-slot`       | Client-slot state: `{ mode, busy, activeClientId, connectedAt }` |
+| `GET /json/version`      | CDP discovery                                 |
+| `GET /json/list`         | List attached targets                         |
+| `GET /logs/status` | Logs viewer status (extension-only origin) |
+| `GET /logs/cdp?after=&limit=` | Incremental CDP log polling feed (extension-only origin) |
+| `ws://.../extension` | Chrome extension WebSocket |
+| `ws://.../cdp?token=...` | Agent CDP connection |
+
+Tip: add `&label=<name>` to the CDP URL to tag client connections in the logs viewer (MCP defaults to `browserforce-mcp`).
+
+## Troubleshooting
+
+
+| Problem                      | Fix                                                   |
+| ---------------------------- | ----------------------------------------------------- |
+| Extension stays gray         | Is the relay running? Check `http://127.0.0.1:19222/` |
+| "Another debugger attached"  | Close DevTools for that tab                           |
+| Agent sees 0 pages           | Open at least one regular webpage (not `chrome://`)   |
+| Extension keeps reconnecting | Normal — MV3 kills idle workers; it auto-recovers     |
+| Port in use                  | `lsof -ti:19222 | xargs kill -9`                      |
+
+
+### MCP Error: `Protocol error (Target.createTarget): Extension not connected`
+
+This usually means MCP is talking to a relay process that is not the one your extension is connected to.
+
+Quick checks:
+
+```bash
+curl -s http://127.0.0.1:19222/ | jq
+cat ~/.browserforce/cdp-url
+echo "${BF_CDP_URL:-<unset>}"
+```
+
+If MCP is reading a stale `~/.browserforce/cdp-url` (or `BF_CDP_URL` override), it may connect to the wrong port.
+
+Recovery steps (npm/npx workflow):
+
+```bash
+# 1) Stop stale relay listeners
+lsof -tiTCP:19222 -sTCP:LISTEN | xargs kill -9 2>/dev/null || true
+lsof -tiTCP:19888 -sTCP:LISTEN | xargs kill -9 2>/dev/null || true
+
+# 2) Clear stale override for this shell
+unset BF_CDP_URL
+
+# 3) Start one fresh relay on default port
+RELAY_PORT=19222 npx -y browserforce@latest serve
+```
+
+Then verify:
+
+```bash
+cat ~/.browserforce/cdp-url
+curl -s http://127.0.0.1:19222/client-slot | jq
+```
+
+Expected: `cdp-url` points to `ws://127.0.0.1:19222/...` and `/client-slot` returns `{ mode, busy, activeClientId, connectedAt }`.
+
+### MCP Error: `Unexpected server response: 409`
+
+This means single-active arbitration is working and another CDP client is currently holding the slot.
+
+Check:
+
+```bash
+curl -s http://127.0.0.1:19222/client-slot | jq
+```
+
+If `busy: true`, close the other MCP/CDP session or set `BF_CLIENT_MODE=multi-client` for explicit concurrent-client fallback.
+
+CDP traffic is logged to `~/.browserforce/cdp.jsonl` (recreated on each relay start). Summarize traffic by direction + method:
+
+```bash
+jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.jsonl | uniq -c
+```
+
+For practical debugging and operations flows, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
+
+> **Need advanced operator playbooks?** Read the [User Guide](https://github.com/ivalsaraj/browserforce/blob/main/GUIDE.md) for controlled-tab workflows, parallel swarm patterns, and production diagnostics.
+
+</details>
diff --git a/README.md b/README.md
index 4794103..4692cfb 100644
--- a/README.md
+++ b/README.md
@@ -775,6 +775,39 @@ curl -s http://127.0.0.1:19222/client-slot | jq
 
 If `busy: true`, close the other MCP/CDP session or set `BF_CLIENT_MODE=multi-client` for explicit concurrent-client fallback.
 
+### MCP Error: `MCP client for "browserforce" timed out after 10 seconds`
+
+This can happen when a second MCP session starts while BrowserForce is still connecting/retrying in the background.
+
+Why: the MCP process currently attempts browser connection during startup, and Codex's default MCP startup timeout can be shorter than BrowserForce's connect retry window.
+
+Fix in Codex config (`~/.codex/config.toml`):
+
+```toml
+[mcp_servers.browserforce]
+command = "npx"
+args = ["-y", "browserforce@latest", "mcp"]
+startup_timeout_sec = 45
+```
+
+Recommended reliable workflow:
+
+```bash
+# Terminal A: run one shared relay
+npx -y browserforce@latest serve
+
+# MCP clients (Codex/Cursor/Claude): run mcp only
+npx -y browserforce@latest mcp
+```
+
+If startup still times out, verify the relay endpoint and slot state:
+
+```bash
+curl -s http://127.0.0.1:19222/ | jq
+curl -s http://127.0.0.1:19222/client-slot | jq
+cat ~/.browserforce/cdp-url
+```
+
 CDP traffic is logged to `~/.browserforce/cdp.jsonl` (recreated on each relay start). Summarize traffic by direction + method:
 
 ```bash
diff --git a/mcp/src/index.js b/mcp/src/index.js
index ef6f256..89cc9d6 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -65,6 +65,14 @@ function ensureAllPagesCapture() {
 
 let browser = null;
 const CONNECT_RETRY_TIMEOUT_MS = 30000;
+const BACKGROUND_CONNECT_RETRY_INTERVAL_MS = 1500;
+let browserConnectPromise = null;
+let backgroundConnectLoopStarted = false;
+let lastBackgroundConnectError = null;
+
+function sleep(ms) {
+  return new Promise((resolve) => globalThis.setTimeout(resolve, ms));
+}
 
 function withClientLabel(cdpUrl) {
   try {
@@ -83,30 +91,80 @@ function withClientLabel(cdpUrl) {
 
 async function ensureBrowser() {
   if (browser?.isConnected()) return;
-  await ensureRelay();
-  const cdpUrl = withClientLabel(getCdpUrl());
-  browser = await connectOverCdpWithBusyRetry({
-    connect: (url) => chromium.connectOverCDP(url),
-    cdpUrl,
-    baseUrl: getRelayHttpUrl(),
-    timeoutMs: CONNECT_RETRY_TIMEOUT_MS,
-  });
-  browser.on('disconnected', () => {
-    browser = null;
-    contextListenerAttached = false;
-    consoleLogs.clear();
-  });
+  if (browserConnectPromise) {
+    await browserConnectPromise;
+    return;
+  }
+
+  browserConnectPromise = (async () => {
+    await ensureRelay();
+    const cdpUrl = withClientLabel(getCdpUrl());
+    const nextBrowser = await connectOverCdpWithBusyRetry({
+      connect: (url) => chromium.connectOverCDP(url),
+      cdpUrl,
+      baseUrl: getRelayHttpUrl(),
+      timeoutMs: CONNECT_RETRY_TIMEOUT_MS,
+    });
+    browser = nextBrowser;
+    browser.on('disconnected', () => {
+      browser = null;
+      contextListenerAttached = false;
+      consoleLogs.clear();
+    });
+
+    try {
+      const ctx = browser.contexts()[0];
+      if (ctx && !contextListenerAttached) {
+        ctx.on('page', (page) => setupConsoleCapture(page));
+        contextListenerAttached = true;
+        for (const page of ctx.pages()) {
+          setupConsoleCapture(page);
+        }
+      }
+    } catch { /* context not ready yet — capture will attach lazily */ }
+  })();
 
   try {
-    const ctx = browser.contexts()[0];
-    if (ctx && !contextListenerAttached) {
-      ctx.on('page', (page) => setupConsoleCapture(page));
-      contextListenerAttached = true;
-      for (const page of ctx.pages()) {
-        setupConsoleCapture(page);
+    await browserConnectPromise;
+  } finally {
+    browserConnectPromise = null;
+  }
+}
+
+function startBackgroundConnectionLoop() {
+  if (backgroundConnectLoopStarted) return;
+  backgroundConnectLoopStarted = true;
+
+  (async () => {
+    while (true) {
+      if (browser?.isConnected()) {
+        lastBackgroundConnectError = null;
+        await sleep(BACKGROUND_CONNECT_RETRY_INTERVAL_MS);
+        continue;
+      }
+
+      try {
+        await ensureBrowser();
+        if (lastBackgroundConnectError !== null) {
+          process.stderr.write('[bf-mcp] Relay slot available; connected\n');
+          lastBackgroundConnectError = null;
+        } else {
+          process.stderr.write('[bf-mcp] Connected to relay\n');
+        }
+      } catch (err) {
+        const message = err?.message || String(err);
+        if (message !== lastBackgroundConnectError) {
+          process.stderr.write(`[bf-mcp] Waiting for relay/browser: ${message}\n`);
+          process.stderr.write('[bf-mcp] MCP is running; tools will connect when slot is available\n');
+          lastBackgroundConnectError = message;
+        }
       }
+
+      await sleep(BACKGROUND_CONNECT_RETRY_INTERVAL_MS);
     }
-  } catch { /* context not ready yet — capture will attach lazily */ }
+  })().catch((err) => {
+    process.stderr.write(`[bf-mcp] Background connect loop error: ${err?.message || String(err)}\n`);
+  });
 }
 
 function getContext() {
@@ -544,17 +602,10 @@ async function main() {
   // Fire update check in background — result stored in pendingUpdate for execute handler
   checkForUpdate().then(info => { pendingUpdate = info; }).catch(() => {});
 
-  try {
-    await ensureBrowser();
-    process.stderr.write('[bf-mcp] Connected to relay\n');
-  } catch (err) {
-    process.stderr.write(`[bf-mcp] Warning: ${err.message}\n`);
-    process.stderr.write('[bf-mcp] Tools will attempt to connect on first use\n');
-  }
-
   const transport = new StdioServerTransport();
   await server.connect(transport);
   process.stderr.write('[bf-mcp] MCP server running\n');
+  startBackgroundConnectionLoop();
 }
 
 main().catch((err) => {

From 93c045e138c843d73c328ff81bd8b00873288003 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 17:28:46 +0530
Subject: [PATCH 069/192] feat(relay): enhance origin parsing to support
 extension referers

---
 relay/src/index.js              | 21 ++++++++++++++++-----
 relay/test/relay-server.test.js | 19 +++++++++++++++++++
 2 files changed, 35 insertions(+), 5 deletions(-)

diff --git a/relay/src/index.js b/relay/src/index.js
index ae13a3f..3e7692f 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -462,11 +462,22 @@ class RelayServer {
   }
 
   _extensionOriginFromReq(req) {
-    const origin = req?.headers?.origin || '';
-    if (!origin.startsWith('chrome-extension://')) {
-      return null;
-    }
-    return origin;
+    const parseExtensionOrigin = (value) => {
+      if (!value || !value.startsWith('chrome-extension://')) return null;
+      try {
+        const parsed = new URL(value);
+        if (parsed.protocol !== 'chrome-extension:' || !parsed.host) return null;
+        return `chrome-extension://${parsed.host}`;
+      } catch {
+        return null;
+      }
+    };
+
+    const origin = parseExtensionOrigin(req?.headers?.origin || '');
+    if (origin) return origin;
+
+    const referer = req?.headers?.referer || req?.headers?.referrer || '';
+    return parseExtensionOrigin(referer);
   }
 
   _deriveClientLabel(req) {
diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index 3cab703..fb1cebf 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -260,6 +260,25 @@ describe('Logs Viewer Endpoints', () => {
     await sleep(50);
   });
 
+  it('GET /logs/status accepts extension referer when Origin is absent', async () => {
+    const ext = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://test' },
+    });
+    ext.on('message', (data) => {
+      const msg = JSON.parse(data.toString());
+      if (msg.method === 'ping') ext.send(JSON.stringify({ method: 'pong' }));
+    });
+
+    const { status, body } = await httpGetWithHeaders(`http://127.0.0.1:${port}/logs/status`, {
+      Referer: 'chrome-extension://test/options.html',
+    });
+    assert.equal(status, 200);
+    assert.equal(body.extension?.connected, true);
+
+    ext.close();
+    await sleep(50);
+  });
+
   it('GET /logs/cdp supports incremental polling with after/limit', async () => {
     const ext = await connectWs(`ws://127.0.0.1:${port}/extension`, {
       headers: { Origin: 'chrome-extension://test' },

From 0e58f96cd0b5360ab41fe79d28261a1aa335aed5 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 17:29:36 +0530
Subject: [PATCH 070/192] chore: bump version to 1.0.13 in package.json

---
 package.json | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/package.json b/package.json
index 51c77d4..3df4b5e 100644
--- a/package.json
+++ b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "browserforce",
-  "version": "1.0.12",
+  "version": "1.0.13",
   "type": "module",
   "description": "Give AI agents your real Chrome browser with progressive examples: simple reads, form interactions, multi-tab workflows, and state persistence. Search X and GitHub, extract ProductHunt data, test forms, compare A/B variants, monitor status pages. Works with OpenClaw, Claude, and any MCP agent.",
   "homepage": "https://github.com/ivalsaraj/browserforce",

From ffd2bce2391ae7184db3102bf5ce71abf1c56ec7 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 21:14:33 +0530
Subject: [PATCH 071/192] docs: update README to clarify BrowserForce features
 and add deep dive sections for plugins, agent capabilities, examples,
 architecture, and troubleshooting

---
 README.md | 40 +++++++++++++++++++++++++++++++++++++---
 1 file changed, 37 insertions(+), 3 deletions(-)

diff --git a/README.md b/README.md
index 4692cfb..18ff6bf 100644
--- a/README.md
+++ b/README.md
@@ -12,7 +12,7 @@ Give AI agents controlled access to the browser you already use.
 
 Works with [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-compatible agent.
 
-## Comparison
+## Why BrowserForce?
 
 
 |                | Playwright MCP       | OpenClaw Browser        | Playwriter              | Claude Extension     | BrowserForce                         |
@@ -21,8 +21,8 @@ Works with [OpenClaw](https://github.com/openclaw/openclaw), Claude, or any MCP-
 | Login state    | Fresh                | Fresh (isolated)        | Yours                   | Yours                | **Yours**                            |
 | Tab access     | N/A (new browser)    | Managed by agent        | Click each tab          | Click each tab       | **Auto mode + manual attached tabs** |
 | Autonomous     | Yes                  | Yes                     | No (manual click)       | No (manual click)    | **Yes (fully autonomous)**           |
-| Context method | Screenshots (100KB+) | Screenshots + snapshots | A11y snapshots (5-20KB) | Screenshots (100KB+) | **A11y snapshots (5-20KB)**          |
-| Tools          | Many dedicated       | 1 `browser` tool        | 1 `execute` tool        | Built-in             | **2 tools: `execute`, `reset`**      |
+| Context method | Screenshots (100KB+) | Screenshots + snapshots | A11y snapshots (5-20KB) | Screenshots (100KB+) | **A11y snapshots (5-20KB), also support screenshots**          |
+| Tools          | Many dedicated       | 1 `browser` tool        | 1 `execute` tool        | Built-in             | **2 tools: `execute`, `reset` + extend via plugins**      |
 | Agent support  | Any MCP client       | OpenClaw only           | Any MCP client          | Claude only          | **Any MCP client**                   |
 | Playwright API | Partial              | No                      | Full                    | No                   | **Full**                             |
 
@@ -261,6 +261,14 @@ browserforce install-extension  # Copy extension to ~/.browserforce/extension/
 
 Each `-e` command is one-shot — state does not persist between calls. For persistent state, use the MCP server.
 
+
+## Deep Dive Sections
+
+The core onboarding above stays visible. The sections below keep full detail, organized with progressive disclosure.
+
+<details>
+<summary><b>Plugins (Install, Official, Usage, Manage, Authoring)</b></summary>
+
 ## Plugins
 
 Plugins add custom helpers directly into the `execute` tool scope. Install once — your agent calls them like built-in functions.
@@ -347,6 +355,12 @@ await gmail.screenshot({ path: 'gmail.png' });
 
 No token config needed for MCP — the server reads it automatically from `~/.browserforce/cdp-url`.
 
+
+</details>
+
+<details>
+<summary><b>Agent Capabilities (Tools, Helpers, Swarms)</b></summary>
+
 ## What Your Agent Can Do
 
 Once connected, your agent has full Playwright access to your real browser:
@@ -405,6 +419,12 @@ BrowserForce uses a parallel-first policy for independent extraction jobs, so ag
 
 Need role-based, real workflows? See [Actionable Use Cases](docs/USE_CASES.md).
 
+
+</details>
+
+<details>
+<summary><b>Examples and Multi-Tab Workflows</b></summary>
+
 ## Examples
 
 Get started with simple prompts. The AI generates code and does the work.
@@ -595,6 +615,12 @@ Get started with simple prompts. The AI generates code and does the work.
 
 **More examples** and detailed walkthrough available in the [User Guide](GUIDE.md#examples).
 
+
+</details>
+
+<details>
+<summary><b>Architecture, Control, Security, Configuration, API</b></summary>
+
 ## How It Works
 
 ```
@@ -714,6 +740,12 @@ In `single-active` mode, the relay enforces one active client slot. A second `/c
 
 Tip: add `&label=<name>` to the CDP URL to tag client connections in the logs viewer (MCP defaults to `browserforce-mcp`).
 
+
+</details>
+
+<details>
+<summary><b>Troubleshooting and Diagnostics</b></summary>
+
 ## Troubleshooting
 
 
@@ -817,3 +849,5 @@ jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.
 For practical debugging and operations flows, see [Actionable Use Cases](docs/USE_CASES.md#developer-high-impact).
 
 > **Need advanced operator playbooks?** Read the [User Guide](https://github.com/ivalsaraj/browserforce/blob/main/GUIDE.md) for controlled-tab workflows, parallel swarm patterns, and production diagnostics.
+
+</details>

From 222cb3ceecf2a464ed382c18d0ce72a5f12d290f Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 21:40:34 +0530
Subject: [PATCH 072/192] feat(relay): default client arbitration to
 multi-client

---
 relay/src/index.js              | 5 +++--
 relay/test/relay-server.test.js | 4 ++--
 2 files changed, 5 insertions(+), 4 deletions(-)

diff --git a/relay/src/index.js b/relay/src/index.js
index 3e7692f..978f83d 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -67,8 +67,9 @@ function writeCdpUrlFile(cdpUrl) {
 }
 
 function getClientMode() {
-  const mode = (process.env.BF_CLIENT_MODE || CLIENT_MODE_SINGLE).trim();
-  return mode === CLIENT_MODE_MULTI ? CLIENT_MODE_MULTI : CLIENT_MODE_SINGLE;
+  // Default to multi-client for zero-config MCP onboarding.
+  const mode = (process.env.BF_CLIENT_MODE || CLIENT_MODE_MULTI).trim();
+  return mode === CLIENT_MODE_SINGLE ? CLIENT_MODE_SINGLE : CLIENT_MODE_MULTI;
 }
 
 // ─── RelayServer ─────────────────────────────────────────────────────────────
diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index fb1cebf..860b4ca 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -127,10 +127,10 @@ describe('Token Persistence', () => {
   const tmpDir = path.join(os.tmpdir(), `bf-test-${crypto.randomBytes(4).toString('hex')}`);
   const origBfDir = BF_DIR;
 
-  it('defaults to single-active client mode', () => {
+  it('defaults to multi-client mode', () => {
     delete process.env.BF_CLIENT_MODE;
     const relay = new RelayServer(getRandomPort());
-    assert.equal(relay.clientMode, 'single-active');
+    assert.equal(relay.clientMode, 'multi-client');
   });
 
   it('creates auth token file on first run', () => {

From 3351ab332b6c71abb7a5758cbe5273e6da7fbf41 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 21:40:38 +0530
Subject: [PATCH 073/192] docs: make single-active an opt-in mode with guidance

---
 AGENTS.md |  4 +--
 README.md | 81 ++++++++++++++++++++++++++++++++++++++++++++++++-------
 2 files changed, 73 insertions(+), 12 deletions(-)

diff --git a/AGENTS.md b/AGENTS.md
index 326958e..9ba081d 100644
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -160,8 +160,8 @@ When a user clicks "Cancel" on Chrome's automation infobar, Chrome detaches the
 ### Client Arbitration: BF_CLIENT_MODE
 
 `BF_CLIENT_MODE` controls agent-side CDP arbitration:
-- `single-active` (default): only one active `/cdp` client connection at a time.
-- `multi-client`: fallback mode that allows concurrent `/cdp` clients.
+- `multi-client` (default): allows concurrent `/cdp` clients.
+- `single-active`: opt-in mode that allows only one active `/cdp` client connection at a time.
 
 In `single-active`, contention returns HTTP `409 Conflict` for additional `/cdp` connects while the slot is busy. Slot state is exposed at `GET /client-slot` (`mode`, `busy`, `activeClientId`, `connectedAt`).
 
diff --git a/README.md b/README.md
index 18ff6bf..d57cdfd 100644
--- a/README.md
+++ b/README.md
@@ -230,6 +230,67 @@ Add the same `mcpServers` entry:
 
 </details>
 
+Need deterministic single-owner handoff for sensitive workflows?
+Set `BF_CLIENT_MODE=single-active` in the MCP server command.
+
+Why use `single-active`:
+- Prevents two MCP clients from driving the browser at the same time.
+- Makes contention explicit (`409` + `/client-slot`), which is easier to debug.
+- Better for write-heavy flows where accidental concurrent actions are risky.
+
+<details>
+<summary><b>Set BF_CLIENT_MODE=single-active (all MCP clients)</b></summary>
+
+These examples use the POSIX `env` wrapper. If your MCP client supports an `env` object/map, set `BF_CLIENT_MODE=single-active` there instead.
+
+**OpenClaw (MCP adapter):**
+
+```json
+{
+  "plugins": {
+    "entries": {
+      "mcp-adapter": {
+        "enabled": true,
+        "config": {
+          "servers": [
+            {
+              "name": "browserforce",
+              "transport": "stdio",
+              "command": "env",
+              "args": ["BF_CLIENT_MODE=single-active", "npx", "-y", "browserforce@latest", "mcp"]
+            }
+          ]
+        }
+      }
+    }
+  }
+}
+```
+
+**Claude Desktop / Claude Code / Cursor / Antigravity:**
+
+```json
+{
+  "mcpServers": {
+    "browserforce": {
+      "command": "env",
+      "args": ["BF_CLIENT_MODE=single-active", "npx", "-y", "browserforce@latest", "mcp"]
+    }
+  }
+}
+```
+
+**Codex (`~/.codex/config.toml`):**
+
+```toml
+[mcp_servers.browserforce]
+command = "env"
+args = ["BF_CLIENT_MODE=single-active", "npx", "-y", "browserforce@latest", "mcp"]
+startup_timeout_sec = 45
+```
+
+</details>
+
 
 
 If MCP startup fails with `connection closed: initialize response`:
@@ -712,14 +773,14 @@ RELAY_PORT=19333 browserforce serve
 **Client arbitration mode (`BF_CLIENT_MODE`):**
 
 ```bash
-# default: one active /cdp client at a time
-BF_CLIENT_MODE=single-active browserforce serve
-
-# fallback: allow concurrent /cdp clients
+# default: allow concurrent /cdp clients
 BF_CLIENT_MODE=multi-client browserforce serve
+
+# opt-in: enforce one active /cdp client slot
+BF_CLIENT_MODE=single-active browserforce serve
 ```
 
-In `single-active` mode, the relay enforces one active client slot. A second `/cdp` connection receives HTTP `409 Conflict` (busy). In `multi-client` mode, slot arbitration is disabled.
+In `multi-client` mode (default), slot arbitration is disabled. In `single-active` mode, the relay enforces one active client slot and a second `/cdp` connection receives HTTP `409 Conflict` (busy).
 
 **MCP standby polling (single-active mode):** if MCP sees a busy/`409` connect error, it enters standby and polls `GET /client-slot` until `busy: false` (about every 200-400ms, up to 30s), then retries connect.
 
@@ -797,7 +858,7 @@ Expected: `cdp-url` points to `ws://127.0.0.1:19222/...` and `/client-slot` retu
 
 ### MCP Error: `Unexpected server response: 409`
 
-This means single-active arbitration is working and another CDP client is currently holding the slot.
+This appears when `BF_CLIENT_MODE=single-active` and another CDP client currently holds the slot.
 
 Check:
 
@@ -805,11 +866,11 @@ Check:
 curl -s http://127.0.0.1:19222/client-slot | jq
 ```
 
-If `busy: true`, close the other MCP/CDP session or set `BF_CLIENT_MODE=multi-client` for explicit concurrent-client fallback.
+If `busy: true`, either close the current active MCP/CDP session, or remove single-active mode (default is `multi-client`).
 
 ### MCP Error: `MCP client for "browserforce" timed out after 10 seconds`
 
-This can happen when a second MCP session starts while BrowserForce is still connecting/retrying in the background.
+This is most common when you intentionally run `single-active` mode and a second MCP session starts while the slot is busy.
 
 Why: the MCP process currently attempts browser connection during startup, and Codex's default MCP startup timeout can be shorter than BrowserForce's connect retry window.
 
@@ -817,8 +878,8 @@ Fix in Codex config (`~/.codex/config.toml`):
 
 ```toml
 [mcp_servers.browserforce]
-command = "npx"
-args = ["-y", "browserforce@latest", "mcp"]
+command = "env"
+args = ["BF_CLIENT_MODE=single-active", "npx", "-y", "browserforce@latest", "mcp"]
 startup_timeout_sec = 45
 ```
 

From e65f15e6fae0a1cae0f5c541385b08280f43b657 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 21:42:00 +0530
Subject: [PATCH 074/192] chore: bump version to 1.0.14 in package.json

---
 package.json | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/package.json b/package.json
index 3df4b5e..43eb39f 100644
--- a/package.json
+++ b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "browserforce",
-  "version": "1.0.13",
+  "version": "1.0.14",
   "type": "module",
   "description": "Give AI agents your real Chrome browser with progressive examples: simple reads, form interactions, multi-tab workflows, and state persistence. Search X and GitHub, extract ProductHunt data, test forms, compare A/B variants, monitor status pages. Works with OpenClaw, Claude, and any MCP agent.",
   "homepage": "https://github.com/ivalsaraj/browserforce",

From 6bea311cf3ff87342704ed3afeabeba035c191e7 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 22:49:01 +0530
Subject: [PATCH 075/192] feat(extension): add execution and parallel
 visibility settings in popup

---
 extension/popup.html | 16 ++++++++++++++++
 extension/popup.js   | 13 +++++++++++++
 2 files changed, 29 insertions(+)

diff --git a/extension/popup.html b/extension/popup.html
index 7f23d5c..e6e5c64 100644
--- a/extension/popup.html
+++ b/extension/popup.html
@@ -52,6 +52,22 @@ <h1>BrowserForce</h1>
         </select>
       </section>
 
+      <section class="field">
+        <label for="bf-execution-mode">Execution Strategy</label>
+        <select id="bf-execution-mode" class="full-width">
+          <option value="parallel">Parallel</option>
+          <option value="sequential">Sequential</option>
+        </select>
+      </section>
+
+      <section class="field">
+        <label for="bf-parallel-visibility">Parallel Tab Visibility</label>
+        <select id="bf-parallel-visibility" class="full-width">
+          <option value="foreground-tab">Visible new tabs (current window)</option>
+          <option value="rotate-visible">Rotate visible tabs (demo)</option>
+        </select>
+      </section>
+
       <section class="field">
         <label>Restrictions</label>
         <div class="settings-group">
diff --git a/extension/popup.js b/extension/popup.js
index f6bc646..be61438 100644
--- a/extension/popup.js
+++ b/extension/popup.js
@@ -21,6 +21,8 @@ const autoTimerEl = document.getElementById('bf-auto-timer');
 const attachBtn = document.getElementById('bf-attach-tab');
 const openLogsBtn = document.getElementById('bf-open-logs');
 const modeSelect = document.getElementById('bf-mode');
+const executionModeSelect = document.getElementById('bf-execution-mode');
+const parallelVisibilitySelect = document.getElementById('bf-parallel-visibility');
 const lockUrlCb = document.getElementById('bf-lock-url');
 const noNewTabsCb = document.getElementById('bf-no-new-tabs');
 const readOnlyCb = document.getElementById('bf-read-only');
@@ -44,6 +46,7 @@ document.querySelectorAll('.tab-btn').forEach((btn) => {
 const SETTINGS_KEYS = [
   'relayUrl', 'autoDetachMinutes', 'autoCloseMinutes',
   'mode', 'lockUrl', 'noNewTabs', 'readOnly', 'userInstructions',
+  'executionMode', 'parallelVisibilityMode',
 ];
 
 chrome.storage.local.get(SETTINGS_KEYS, (s) => {
@@ -51,6 +54,8 @@ chrome.storage.local.get(SETTINGS_KEYS, (s) => {
   autoDetachSelect.value = String(s.autoDetachMinutes || 0);
   autoCloseSelect.value = String(s.autoCloseMinutes || 0);
   modeSelect.value = s.mode || 'auto';
+  executionModeSelect.value = s.executionMode || 'parallel';
+  parallelVisibilitySelect.value = s.parallelVisibilityMode || 'foreground-tab';
   lockUrlCb.checked = !!s.lockUrl;
   noNewTabsCb.checked = !!s.noNewTabs;
   readOnlyCb.checked = !!s.readOnly;
@@ -72,6 +77,14 @@ modeSelect.addEventListener('change', () => {
   chrome.storage.local.set({ mode: modeSelect.value });
 });
 
+executionModeSelect.addEventListener('change', () => {
+  chrome.storage.local.set({ executionMode: executionModeSelect.value });
+});
+
+parallelVisibilitySelect.addEventListener('change', () => {
+  chrome.storage.local.set({ parallelVisibilityMode: parallelVisibilitySelect.value });
+});
+
 autoDetachSelect.addEventListener('change', () => {
   chrome.storage.local.set({ autoDetachMinutes: Number(autoDetachSelect.value) });
 });

From f5a27ff326f7db85a10f24c5c8d97b3f0134e127 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 22:50:00 +0530
Subject: [PATCH 076/192] feat(extension): enforce visible parallel modes for
 agent-created tabs

---
 extension/background.js | 46 ++++++++++++++++++++++++++++++++++++++---
 1 file changed, 43 insertions(+), 3 deletions(-)

diff --git a/extension/background.js b/extension/background.js
index a3f0cbc..721236d 100644
--- a/extension/background.js
+++ b/extension/background.js
@@ -189,6 +189,8 @@ async function executeCommand(msg) {
           });
         });
       });
+    case 'getAgentPreferences':
+      return getAgentExecutionSettings();
     default:
       throw new Error(`Unknown command: ${msg.method}`);
   }
@@ -196,6 +198,31 @@ async function executeCommand(msg) {
 
 // ─── Tab Operations ──────────────────────────────────────────────────────────
 
+async function getAgentExecutionSettings() {
+  const s = await chrome.storage.local.get(['executionMode', 'parallelVisibilityMode']);
+  const executionMode = s.executionMode === 'sequential' ? 'sequential' : 'parallel';
+  const parallelVisibilityMode =
+    s.parallelVisibilityMode === 'rotate-visible'
+      ? 'rotate-visible'
+      : 'foreground-tab';
+
+  return { executionMode, parallelVisibilityMode };
+}
+
+async function getCurrentWindowId() {
+  const tabs = await chrome.tabs.query({ active: true, currentWindow: true });
+  if (tabs[0] && typeof tabs[0].windowId === 'number') {
+    return tabs[0].windowId;
+  }
+
+  const win = await chrome.windows.getLastFocused();
+  if (win && typeof win.id === 'number') {
+    return win.id;
+  }
+
+  return undefined;
+}
+
 async function listTabs() {
   const tabs = await chrome.tabs.query({});
   return {
@@ -294,10 +321,23 @@ async function createTab(params) {
     throw new Error(`BLOCKED: ${msg}`);
   }
 
-  const tab = await chrome.tabs.create({
+  const agentSettings = await getAgentExecutionSettings();
+  const windowId = await getCurrentWindowId();
+  const createOptions = {
     url: params.url || 'about:blank',
-    active: false,
-  });
+    // Keep agent-created tabs visible; do not spawn separate windows.
+    active: true,
+  };
+  if (typeof windowId === 'number') {
+    createOptions.windowId = windowId;
+  }
+
+  // rotate-visible remains normalized to visible tab creation in current window.
+  if (agentSettings.parallelVisibilityMode === 'rotate-visible') {
+    createOptions.active = true;
+  }
+
+  const tab = await chrome.tabs.create(createOptions);
 
   // Brief delay for Chrome to finalize tab creation
   await sleep(200);

From 47d83758055e886fe74761f5f90153f171aebf5f Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 22:52:35 +0530
Subject: [PATCH 077/192] feat(relay): add agent-preferences endpoint backed by
 extension settings

---
 relay/src/index.js              | 25 ++++++++++
 relay/test/relay-server.test.js | 87 +++++++++++++++++++++++++++++++++
 2 files changed, 112 insertions(+)

diff --git a/relay/src/index.js b/relay/src/index.js
index 978f83d..b198eb9 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -75,6 +75,10 @@ function getClientMode() {
 // ─── RelayServer ─────────────────────────────────────────────────────────────
 
 const DEFAULT_BROWSER_CONTEXT_ID = 'bf-default-context';
+const DEFAULT_AGENT_PREFERENCES = Object.freeze({
+  executionMode: 'parallel',
+  parallelVisibilityMode: 'foreground-tab',
+});
 
 // Commands Playwright sends automatically to every page during initialization.
 // We intercept these on unattached tabs and return synthetic responses so
@@ -149,6 +153,13 @@ function syntheticInitResponse(method, target) {
   }
 }
 
+function normalizeAgentPreferences(raw) {
+  const executionMode = raw?.executionMode === 'sequential' ? 'sequential' : 'parallel';
+  // Keep relay behavior locked to visible tabs in the current window.
+  const parallelVisibilityMode = 'foreground-tab';
+  return { executionMode, parallelVisibilityMode };
+}
+
 class RelayServer {
   constructor(port = DEFAULT_PORT, pluginsDir = BF_PLUGINS_DIR) {
     this.port = port;
@@ -330,6 +341,20 @@ class RelayServer {
       return;
     }
 
+    if (url.pathname === '/agent-preferences') {
+      if (!this.ext) {
+        res.end(JSON.stringify(DEFAULT_AGENT_PREFERENCES));
+        return;
+      }
+      try {
+        const preferences = await this._sendToExt('getAgentPreferences');
+        res.end(JSON.stringify(normalizeAgentPreferences(preferences)));
+      } catch {
+        res.end(JSON.stringify(DEFAULT_AGENT_PREFERENCES));
+      }
+      return;
+    }
+
     if (url.pathname === '/logs/status' && req.method === 'GET') {
       if (!this._requireExtensionOrigin(req, res)) return;
       res.end(JSON.stringify(this._logsStatus()));
diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index 860b4ca..18c0e55 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -1673,6 +1673,93 @@ describe('GET /restrictions endpoint', () => {
   });
 });
 
+// ─── GET /agent-preferences Endpoint ────────────────────────────────────────
+
+describe('GET /agent-preferences endpoint', () => {
+  let relay;
+  let port;
+
+  before(async () => {
+    port = getRandomPort();
+    relay = new RelayServer(port);
+    relay.start({ writeCdpUrl: false });
+    await sleep(200);
+  });
+
+  after(() => {
+    relay.stop();
+  });
+
+  it('returns defaults when no extension is connected', async () => {
+    const { status, body } = await httpGet(`http://127.0.0.1:${port}/agent-preferences`);
+    assert.equal(status, 200);
+    assert.deepEqual(body, {
+      executionMode: 'parallel',
+      parallelVisibilityMode: 'foreground-tab',
+    });
+  });
+
+  it('forwards getAgentPreferences to extension and returns its response', async () => {
+    const ext = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://test' },
+    });
+
+    const extPreferences = {
+      executionMode: 'sequential',
+      parallelVisibilityMode: 'foreground-tab',
+    };
+
+    ext.on('message', (data) => {
+      const msg = JSON.parse(data.toString());
+      if (msg.method === 'ping') { ext.send(JSON.stringify({ method: 'pong' })); return; }
+      if (msg.id !== undefined && msg.method === 'getAgentPreferences') {
+        ext.send(JSON.stringify({ id: msg.id, result: extPreferences }));
+      }
+    });
+
+    await sleep(50);
+
+    const { status, body } = await httpGet(`http://127.0.0.1:${port}/agent-preferences`);
+    assert.equal(status, 200);
+    assert.deepEqual(body, extPreferences);
+
+    ext.close();
+    await sleep(100);
+  });
+
+  it('normalizes rotate-visible to foreground-tab', async () => {
+    const ext = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://test' },
+    });
+
+    ext.on('message', (data) => {
+      const msg = JSON.parse(data.toString());
+      if (msg.method === 'ping') { ext.send(JSON.stringify({ method: 'pong' })); return; }
+      if (msg.id !== undefined && msg.method === 'getAgentPreferences') {
+        ext.send(JSON.stringify({
+          id: msg.id,
+          result: {
+            executionMode: 'parallel',
+            parallelVisibilityMode: 'rotate-visible',
+          },
+        }));
+      }
+    });
+
+    await sleep(50);
+
+    const { status, body } = await httpGet(`http://127.0.0.1:${port}/agent-preferences`);
+    assert.equal(status, 200);
+    assert.deepEqual(body, {
+      executionMode: 'parallel',
+      parallelVisibilityMode: 'foreground-tab',
+    });
+
+    ext.close();
+    await sleep(100);
+  });
+});
+
 // ─── manualTabAttached Handler ───────────────────────────────────────────────
 
 describe('manualTabAttached handler', () => {

From ba1a4b07033f7584ae4b3d83f6d491398b1bbff8 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 22:54:27 +0530
Subject: [PATCH 078/192] feat(mcp): cache agent execution preferences per
 session and expose in execute context

---
 mcp/src/exec-engine.js     | 15 +++++++++++-
 mcp/src/index.js           | 40 ++++++++++++++++++++++++++++++-
 mcp/test/mcp-tools.test.js | 48 ++++++++++++++++++++++++++++++++++++++
 3 files changed, 101 insertions(+), 2 deletions(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index b593861..a18c3c7 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -470,7 +470,14 @@ export class CodeExecutionTimeoutError extends Error {
 
 // buildExecContext takes userState and optional console helpers as params
 // instead of referencing module-level singletons.
-export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {}, pluginHelpers = {}) {
+export function buildExecContext(
+  defaultPage,
+  ctx,
+  userState,
+  consoleHelpers = {},
+  pluginHelpers = {},
+  agentPreferences = {},
+) {
   const { consoleLogs, setupConsoleCapture } = consoleHelpers;
   const lastSnapshots = userState.__lastSnapshots || (userState.__lastSnapshots = new WeakMap());
   const lastRefToLocator = userState.__lastRefToLocator || (userState.__lastRefToLocator = new WeakMap());
@@ -573,6 +580,11 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
 
   const pageMarkdown = (opts) => getPageMarkdown(activePage(), opts);
 
+  const browserforceSettings = {
+    executionMode: agentPreferences?.executionMode === 'sequential' ? 'sequential' : 'parallel',
+    parallelVisibilityMode: 'foreground-tab',
+  };
+
   // Wrap plugin helpers to auto-inject (page, ctx, state) as first three args
   const wrappedPluginHelpers = {};
   for (const [name, fn] of Object.entries(pluginHelpers)) {
@@ -585,6 +597,7 @@ export function buildExecContext(defaultPage, ctx, userState, consoleHelpers = {
 
   return {
     ...wrappedPluginHelpers,           // plugin helpers spread first — built-ins always win
+    browserforceSettings,
     page: defaultPage, context: ctx, state: userState,
     snapshot, refToLocator, waitForPageLoad, getLogs, clearLogs, getCDPSession,
     screenshotWithAccessibilityLabels, cleanHTML, pageMarkdown,
diff --git a/mcp/src/index.js b/mcp/src/index.js
index 89cc9d6..fe5ad80 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -181,6 +181,39 @@ function getPages() {
 // ─── Persistent State ────────────────────────────────────────────────────────
 
 let userState = {};
+const DEFAULT_AGENT_PREFERENCES = Object.freeze({
+  executionMode: 'parallel',
+  parallelVisibilityMode: 'foreground-tab',
+});
+let cachedAgentPreferences = null;
+
+function normalizeAgentPreferences(raw) {
+  const executionMode = raw?.executionMode === 'sequential' ? 'sequential' : 'parallel';
+  // Keep behavior locked to visible tabs in the current window.
+  const parallelVisibilityMode = 'foreground-tab';
+  return { executionMode, parallelVisibilityMode };
+}
+
+async function getAgentPreferencesForSession() {
+  if (cachedAgentPreferences) {
+    return cachedAgentPreferences;
+  }
+
+  try {
+    const response = await fetch(`${getRelayHttpUrl()}/agent-preferences`, {
+      signal: AbortSignal.timeout(2000),
+    });
+    if (!response.ok) {
+      throw new Error(`HTTP ${response.status}`);
+    }
+    const raw = await response.json();
+    cachedAgentPreferences = normalizeAgentPreferences(raw);
+    return cachedAgentPreferences;
+  } catch {
+    cachedAgentPreferences = { ...DEFAULT_AGENT_PREFERENCES };
+    return cachedAgentPreferences;
+  }
+}
 
 // ─── Plugin State ────────────────────────────────────────────────────────────
 
@@ -211,6 +244,8 @@ Variables:
   page        Default page (first tab in context — shared, avoid navigating it)
   context     Browser context — access all pages via context.pages()
   state       Persistent object across calls (cleared on reset). Store your working page here.
+  browserforceSettings Session defaults loaded once per MCP session (refresh on reset).
+                      Keys: executionMode, parallelVisibilityMode.
 
 Helpers:
   snapshot({ selector?, search?, showDiffSinceLastCall? })   Accessibility tree as text. 10-100x cheaper than screenshots.
@@ -378,6 +413,7 @@ snapshot vs cleanHTML vs pageMarkdown:
 ═══ BROWSERFORCE TAB SWARMS // PARALLEL TABS PROCESSING ═══
 
 Parallel-first policy for independent extraction:
+  Read browserforceSettings.executionMode before choosing swarm strategy. Settings are session defaults.
   1) For count/list/extraction across independent pages, dates, or items, start with parallel tabs first.
   2) Use Promise.all with a concurrency cap (typically 3-8; start at 5 unless site limits are known).
   3) Keep swarm runs read-only and isolated to agent-created tabs (no checkout/purchase/send/delete/profile changes).
@@ -521,6 +557,7 @@ function registerExecuteTool(skillAppendix = '') {
     async ({ code, timeout = 30000 }) => {
       await ensureBrowser();
       ensureAllPagesCapture();
+      const agentPreferences = await getAgentPreferencesForSession();
       const ctx = getContext();
       const pages = ctx.pages();
       const page = pages[0] || null;
@@ -528,7 +565,7 @@ function registerExecuteTool(skillAppendix = '') {
       if (page) setupConsoleCapture(page);
       const execCtx = buildExecContext(page, ctx, userState, {
         consoleLogs, setupConsoleCapture,
-      }, pluginHelpers);
+      }, pluginHelpers, agentPreferences);
       try {
         const result = await runCode(code, execCtx, timeout);
         const formatted = formatResult(result);
@@ -561,6 +598,7 @@ server.tool(
     }
     browser = null;
     userState = {};
+    cachedAgentPreferences = null;
     contextListenerAttached = false;
     consoleLogs.clear();
     try {
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 4c30a0a..9408a87 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -232,6 +232,54 @@ describe('Tool Definitions', () => {
       'snapshot diff mode should only run for full-page snapshots with no search'
     );
   });
+
+  it('execute context includes browserforceSettings', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/exec-engine.js'),
+      'utf8'
+    );
+
+    assert.ok(
+      source.includes('browserforceSettings'),
+      'exec context should expose browserforceSettings in the sandbox scope'
+    );
+    assert.ok(
+      source.includes('executionMode') && source.includes('parallelVisibilityMode'),
+      'browserforceSettings should include executionMode and parallelVisibilityMode'
+    );
+  });
+
+  it('MCP preferences fetch is cached once per session', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/index.js'),
+      'utf8'
+    );
+
+    assert.ok(source.includes('cachedAgentPreferences'), 'should track cached agent preferences');
+    assert.ok(
+      source.includes('if (cachedAgentPreferences)'),
+      'should return cached preferences without refetching'
+    );
+    assert.ok(
+      source.includes('/agent-preferences'),
+      'should fetch preferences from relay /agent-preferences endpoint'
+    );
+  });
+
+  it('reset clears cached preferences', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/index.js'),
+      'utf8'
+    );
+
+    const resetIdx = source.indexOf("'reset'");
+    assert.ok(resetIdx !== -1, 'reset tool should exist');
+    const resetBlock = source.slice(resetIdx, resetIdx + 2500);
+    assert.ok(
+      resetBlock.includes('cachedAgentPreferences = null'),
+      'reset should clear cached agent preferences'
+    );
+  });
 });
 
 // ─── MCP Response Format ─────────────────────────────────────────────────────

From 2c73052a455922b45ea19a34fea61b8062783286 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 25 Feb 2026 22:55:10 +0530
Subject: [PATCH 079/192] docs: document execution mode and visible parallel
 tab settings

---
 GUIDE.md  | 21 +++++++++++++++++++++
 README.md | 12 ++++++++++++
 2 files changed, 33 insertions(+)

diff --git a/GUIDE.md b/GUIDE.md
index f87bf45..1e3c59b 100644
--- a/GUIDE.md
+++ b/GUIDE.md
@@ -59,6 +59,27 @@ Result: the agent can operate only in your approved set.
 
 Use both in long-running sessions to limit drift and memory growth.
 
+## Execution Strategy Settings
+
+Popup settings include:
+
+- `executionMode`: `parallel` or `sequential`
+- `parallelVisibilityMode`: `foreground-tab` or `rotate-visible`
+
+Current behavior lock:
+
+- Agent-created tabs stay visible in the current window (`foreground-tab` behavior).
+- No new windows are created for parallel workers.
+- `rotate-visible` is treated as `foreground-tab` in this release.
+
+MCP reads these preferences once per session and caches them. If you change popup settings mid-session, call `reset` so new execute calls pick up updated values.
+
+### Operational examples
+
+- Visible parallel in current window: `executionMode=parallel`, `parallelVisibilityMode=foreground-tab`
+- Sequential low-detection run: `executionMode=sequential`
+- Rotate-visible demo toggle: `parallelVisibilityMode=rotate-visible` (currently normalized to `foreground-tab`)
+
 ## BrowserForce Tab Swarms // Parallel Tabs Processing
 
 This is the operating policy for independent read-only extraction at scale.
diff --git a/README.md b/README.md
index d57cdfd..2b2ef55 100644
--- a/README.md
+++ b/README.md
@@ -717,6 +717,8 @@ Click the extension icon to configure restrictions. Your browser, your rules:
 | Setting                 | What it does                                                             |
 | ----------------------- | ------------------------------------------------------------------------ |
 | **Auto / Manual mode**  | Let the agent create tabs freely, or hand-pick which tabs it can access  |
+| **Execution mode**      | `parallel` for independent work, `sequential` for one-at-a-time workflows |
+| **Parallel visibility** | `foreground-tab` keeps new tabs visible in the current window             |
 | **Lock URL**            | Prevent the agent from navigating away from the current page             |
 | **No new tabs**         | Block the agent from opening new tabs                                    |
 | **Read-only**           | Observe only — no clicks, no typing, no interactions                     |
@@ -724,6 +726,16 @@ Click the extension icon to configure restrictions. Your browser, your rules:
 | **Auto-close**          | Automatically close agent-created tabs after 5-60 minutes                |
 | **Custom instructions** | Pass text instructions to the agent (e.g. "don't click any buy buttons") |
 
+`parallelVisibilityMode` is currently enforced as `foreground-tab` (visible tabs in the active window, no new windows). If `rotate-visible` is selected, BrowserForce normalizes to `foreground-tab` in this release.
+
+### Execution Strategy Preferences
+
+- **Visible parallel with current-window tabs (`foreground-tab`)**: New agent tabs open visibly in your current Chrome window and stay there.
+- **Sequential mode (`executionMode = sequential`)**: Useful for lower-noise, step-by-step workflows on sensitive sites.
+- **Rotate-visible demo mode (`rotate-visible`)**: Temporarily normalized to `foreground-tab` while the visibility lock is enforced.
+
+MCP reads `executionMode` and `parallelVisibilityMode` once per MCP session and caches them. If you change popup settings mid-session, call `reset` to refresh settings for new execute calls.
+
 
 ### Controlled Tab Workflows
 

From 9ed95471ec190e35314469459b786f1350744287 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 00:18:15 +0530
Subject: [PATCH 080/192] chore: bump version to 1.0.15 in package.json

---
 package.json | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/package.json b/package.json
index 43eb39f..9d46e82 100644
--- a/package.json
+++ b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "browserforce",
-  "version": "1.0.14",
+  "version": "1.0.15",
   "type": "module",
   "description": "Give AI agents your real Chrome browser with progressive examples: simple reads, form interactions, multi-tab workflows, and state persistence. Search X and GitHub, extract ProductHunt data, test forms, compare A/B variants, monitor status pages. Works with OpenClaw, Claude, and any MCP agent.",
   "homepage": "https://github.com/ivalsaraj/browserforce",

From abe8abfcd63d44dcc12a97c85e5b4582709e38f7 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 17:05:31 +0530
Subject: [PATCH 081/192] extension: show MCP client count and auto-mode border

---
 extension/background.js | 41 +++++++++++++++++++++++++++++++++++++++--
 extension/popup.css     | 20 ++++++++++++++++++++
 extension/popup.html    |  9 ++++++---
 extension/popup.js      | 15 +++++++++++++++
 4 files changed, 80 insertions(+), 5 deletions(-)

diff --git a/extension/background.js b/extension/background.js
index 721236d..9f1e1c7 100644
--- a/extension/background.js
+++ b/extension/background.js
@@ -4,12 +4,14 @@
 const RELAY_URL_DEFAULT = 'ws://127.0.0.1:19222/extension';
 const RECONNECT_DELAY_MS = 3000;
 const CDP_VERSION = '1.3';
+const RELAY_HTTP_DEFAULT = 'http://127.0.0.1:19222';
 
 // ─── State ───────────────────────────────────────────────────────────────────
 
 let ws = null;
 let connectionState = 'disconnected'; // disconnected | connecting | connected
 let maintainLoopActive = false;
+let currentRelayUrl = RELAY_URL_DEFAULT;
 
 /** @type {Map<number, { sessionId: string, targetId: string, targetInfo: object }>} */
 const attachedTabs = new Map();
@@ -35,6 +37,7 @@ let restrictionExplained = false;
 (async function init() {
   const stored = await chrome.storage.local.get(['relayUrl']);
   const relayUrl = stored.relayUrl || RELAY_URL_DEFAULT;
+  currentRelayUrl = relayUrl;
 
   // Register debugger listeners once (persists across reconnections)
   chrome.debugger.onEvent.addListener(onDebuggerEvent);
@@ -610,6 +613,10 @@ async function checkInactiveTabs() {
 }
 
 chrome.storage.onChanged.addListener(async (changes) => {
+  if (changes.relayUrl) {
+    currentRelayUrl = changes.relayUrl.newValue || RELAY_URL_DEFAULT;
+  }
+
   if (changes.autoDetachMinutes || changes.autoCloseMinutes) {
     const settings = await chrome.storage.local.get(['autoDetachMinutes', 'autoCloseMinutes']);
     const anyEnabled = (settings.autoDetachMinutes || 0) > 0 || (settings.autoCloseMinutes || 0) > 0;
@@ -725,6 +732,29 @@ function sleep(ms) {
   return new Promise((resolve) => setTimeout(resolve, ms));
 }
 
+function relayWsToHttpBase(wsUrl) {
+  try {
+    const parsed = new URL(wsUrl || RELAY_URL_DEFAULT);
+    const protocol = parsed.protocol === 'wss:' ? 'https:' : 'http:';
+    return `${protocol}//${parsed.host}`;
+  } catch {
+    return RELAY_HTTP_DEFAULT;
+  }
+}
+
+async function getMcpClientCount() {
+  if (connectionState !== 'connected') return 0;
+  const base = relayWsToHttpBase(currentRelayUrl);
+  try {
+    const response = await fetch(`${base}/client-slot`, { method: 'GET', cache: 'no-store' });
+    if (!response.ok) return 0;
+    const data = await response.json();
+    return Number.isFinite(data?.clients) ? data.clients : 0;
+  } catch {
+    return 0;
+  }
+}
+
 // ─── Popup Message Handler ───────────────────────────────────────────────────
 
 chrome.runtime.onMessage.addListener((msg, _sender, sendResponse) => {
@@ -741,7 +771,7 @@ chrome.runtime.onMessage.addListener((msg, _sender, sendResponse) => {
 
     // Compute seconds until next auto-action (detach or close)
     let nextAutoActionSecs = null;
-    chrome.storage.local.get(['autoDetachMinutes', 'autoCloseMinutes'], (settings) => {
+    chrome.storage.local.get(['autoDetachMinutes', 'autoCloseMinutes', 'mode'], async (settings) => {
       const detachMs = (settings.autoDetachMinutes || 0) * 60_000;
       const closeMs = (settings.autoCloseMinutes || 0) * 60_000;
       if ((detachMs || closeMs) && tabLastActivity.size > 0) {
@@ -757,7 +787,14 @@ chrome.runtime.onMessage.addListener((msg, _sender, sendResponse) => {
           nextAutoActionSecs = Math.max(0, Math.ceil(earliest / 1000));
         }
       }
-      sendResponse({ connectionState, tabs, nextAutoActionSecs });
+      const mcpClientCount = await getMcpClientCount();
+      sendResponse({
+        connectionState,
+        tabs,
+        nextAutoActionSecs,
+        mode: settings.mode || 'auto',
+        mcpClientCount,
+      });
     });
     return true; // async sendResponse
   }
diff --git a/extension/popup.css b/extension/popup.css
index 512f6f3..d2e3b28 100644
--- a/extension/popup.css
+++ b/extension/popup.css
@@ -16,6 +16,11 @@ body {
   padding: 16px;
 }
 
+.bf-popup.auto-mode {
+  border: 2px dotted #d32f2f;
+  border-radius: 10px;
+}
+
 header {
   display: flex;
   align-items: center;
@@ -23,6 +28,12 @@ header {
   margin-bottom: 12px;
 }
 
+.header-right {
+  display: flex;
+  align-items: center;
+  gap: 8px;
+}
+
 h1 {
   font-size: 14px;
   font-weight: 600;
@@ -47,6 +58,15 @@ h1 {
 .status.connecting .dot { background: #ff9800; animation: pulse 1s infinite; }
 .status.disconnected .dot { background: #9e9e9e; }
 
+.mcp-count {
+  font-size: 11px;
+  font-weight: 600;
+  color: #4a4a4a;
+  background: #f1f1f1;
+  border-radius: 10px;
+  padding: 2px 8px;
+}
+
 @keyframes pulse {
   0%, 100% { opacity: 1; }
   50% { opacity: 0.4; }
diff --git a/extension/popup.html b/extension/popup.html
index e6e5c64..3deedb5 100644
--- a/extension/popup.html
+++ b/extension/popup.html
@@ -9,9 +9,12 @@
   <div class="bf-popup">
     <header>
       <h1>BrowserForce</h1>
-      <div id="bf-status" class="status disconnected">
-        <span class="dot"></span>
-        <span id="bf-status-text">Disconnected</span>
+      <div class="header-right">
+        <div id="bf-status" class="status disconnected">
+          <span class="dot"></span>
+          <span id="bf-status-text">Disconnected</span>
+        </div>
+        <span id="bf-mcp-clients" class="mcp-count">MCP 0</span>
       </div>
     </header>
 
diff --git a/extension/popup.js b/extension/popup.js
index be61438..8ba9ff1 100644
--- a/extension/popup.js
+++ b/extension/popup.js
@@ -13,6 +13,8 @@ const RESTRICTION_LINES = {
 
 const statusEl = document.getElementById('bf-status');
 const statusTextEl = document.getElementById('bf-status-text');
+const mcpClientsEl = document.getElementById('bf-mcp-clients');
+const popupEl = document.querySelector('.bf-popup');
 const relayUrlInput = document.getElementById('bf-relay-url');
 const saveUrlBtn = document.getElementById('bf-save-url');
 const tabCountEl = document.getElementById('bf-tab-count');
@@ -60,6 +62,7 @@ chrome.storage.local.get(SETTINGS_KEYS, (s) => {
   noNewTabsCb.checked = !!s.noNewTabs;
   readOnlyCb.checked = !!s.readOnly;
   instructionsEl.value = s.userInstructions || '';
+  setAutoModeBorder(s.mode || 'auto');
 });
 
 // --- Save Handlers ---
@@ -75,6 +78,7 @@ saveUrlBtn.addEventListener('click', () => {
 
 modeSelect.addEventListener('change', () => {
   chrome.storage.local.set({ mode: modeSelect.value });
+  setAutoModeBorder(modeSelect.value);
 });
 
 executionModeSelect.addEventListener('change', () => {
@@ -178,6 +182,8 @@ function refreshStatus() {
     setStatus(response.connectionState, response.connectionState);
     setTabs(response.tabs || []);
     setAutoTimer(response.nextAutoActionSecs);
+    setMcpClientCount(response.mcpClientCount);
+    setAutoModeBorder(response.mode || modeSelect.value || 'auto');
   });
 }
 
@@ -228,6 +234,15 @@ function setAutoTimer(secs) {
   autoTimerEl.textContent = `${m}:${String(s).padStart(2, '0')}`;
 }
 
+function setMcpClientCount(count) {
+  const safeCount = Number.isFinite(count) ? count : 0;
+  mcpClientsEl.textContent = `MCP ${safeCount}`;
+}
+
+function setAutoModeBorder(mode) {
+  popupEl.classList.toggle('auto-mode', mode === 'auto');
+}
+
 function escapeHtml(str) {
   const div = document.createElement('div');
   div.textContent = str;

From e137a9091d1e11048ea6bdb892998764642ea652 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 17:05:39 +0530
Subject: [PATCH 082/192] relay: include client count in client-slot status

---
 relay/src/index.js              | 1 +
 relay/test/relay-server.test.js | 3 +++
 2 files changed, 4 insertions(+)

diff --git a/relay/src/index.js b/relay/src/index.js
index b198eb9..33179d4 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -301,6 +301,7 @@ class RelayServer {
         busy,
         activeClientId: busy ? this.activeClient.id : null,
         connectedAt: busy ? this.activeClient.connectedAt : null,
+        clients: this.clients.size,
       }));
       return;
     }
diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index 18c0e55..c225871 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -708,6 +708,7 @@ describe('WebSocket Security', () => {
         busy: false,
         activeClientId: null,
         connectedAt: null,
+        clients: 0,
       });
 
       activeClient = await connectWs(`ws://127.0.0.1:${singleRelay.port}/cdp?token=${singleRelay.authToken}`);
@@ -718,6 +719,7 @@ describe('WebSocket Security', () => {
       assert.equal(during.body.busy, true);
       assert.equal(typeof during.body.activeClientId, 'string');
       assert.equal(typeof during.body.connectedAt, 'number');
+      assert.equal(during.body.clients, 1);
 
       const activeClosed = new Promise((resolve) => activeClient.once('close', resolve));
       activeClient.close();
@@ -734,6 +736,7 @@ describe('WebSocket Security', () => {
         busy: false,
         activeClientId: null,
         connectedAt: null,
+        clients: 0,
       });
     } finally {
       if (activeClient && activeClient.readyState === WebSocket.OPEN) activeClient.close();

From c4eb34786188057deca7105e4a0ef13b6866b753 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 17:12:54 +0530
Subject: [PATCH 083/192] feat: add openclaw setup primitives and autostart
 installers

---
 mcp/src/openclaw-setup.js       | 334 ++++++++++++++++++++
 mcp/test/openclaw-setup.test.js | 530 ++++++++++++++++++++++++++++++++
 2 files changed, 864 insertions(+)
 create mode 100644 mcp/src/openclaw-setup.js
 create mode 100644 mcp/test/openclaw-setup.test.js

diff --git a/mcp/src/openclaw-setup.js b/mcp/src/openclaw-setup.js
new file mode 100644
index 0000000..156e921
--- /dev/null
+++ b/mcp/src/openclaw-setup.js
@@ -0,0 +1,334 @@
+import { spawnSync } from 'node:child_process';
+import fs from 'node:fs/promises';
+import path from 'node:path';
+
+const RELAY_PORT = 19222;
+const DARWIN_LAUNCH_AGENT_LABEL = 'ai.browserforce.relay';
+const LINUX_SYSTEMD_USER_SERVICE = 'browserforce-relay.service';
+const WIN32_TASK_NAME = 'BrowserForceRelay';
+
+function shellQuote(value) {
+  return `'${String(value).replace(/'/g, `'\\''`)}'`;
+}
+
+function posixPathJoin(left, right) {
+  return `${String(left).replace(/\/+$/, '')}/${String(right).replace(/^\/+/, '')}`;
+}
+
+function windowsCommandArg(value) {
+  return String(value)
+    .replace(/"/g, '""')
+    .replace(/%/g, '%%')
+    .replace(/[&<>|^]/g, '^$&');
+}
+
+function windowsTaskQuotedArg(value) {
+  return `"${windowsCommandArg(value)}"`;
+}
+
+function windowsTaskEscapeForTr(value) {
+  return String(value).replace(/"/g, '""');
+}
+
+function xmlEscape(value) {
+  return String(value).replace(/[&<>"']/g, (ch) => {
+    if (ch === '&') return '&amp;';
+    if (ch === '<') return '&lt;';
+    if (ch === '>') return '&gt;';
+    if (ch === '"') return '&quot;';
+    return '&apos;';
+  });
+}
+
+export function renderLaunchAgentPlist({ label, nodePath, binScriptPath }) {
+  const escapedLabel = xmlEscape(label);
+  const escapedNodePath = xmlEscape(nodePath);
+  const escapedBinScriptPath = xmlEscape(binScriptPath);
+
+  return [
+    '<?xml version="1.0" encoding="UTF-8"?>',
+    '<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">',
+    '<plist version="1.0">',
+    '<dict>',
+    '  <key>Label</key>',
+    `  <string>${escapedLabel}</string>`,
+    '  <key>ProgramArguments</key>',
+    '  <array>',
+    `    <string>${escapedNodePath}</string>`,
+    `    <string>${escapedBinScriptPath}</string>`,
+    '    <string>serve</string>',
+    '  </array>',
+    '  <key>RunAtLoad</key>',
+    '  <true/>',
+    '  <key>KeepAlive</key>',
+    '  <true/>',
+    '</dict>',
+    '</plist>',
+    '',
+  ].join('\n');
+}
+
+export function renderSystemdUserService({ nodePath, binScriptPath }) {
+  return [
+    '[Unit]',
+    'Description=BrowserForce Relay',
+    'After=network.target',
+    '',
+    '[Service]',
+    'Type=simple',
+    `ExecStart="${nodePath}" "${binScriptPath}" serve`,
+    'Restart=always',
+    'RestartSec=2',
+    '',
+    '[Install]',
+    'WantedBy=default.target',
+    '',
+  ].join('\n');
+}
+
+export function buildAutostartSpec({ platform, homeDir, nodePath, binScriptPath }) {
+  const activePlatform = platform || process.platform;
+
+  if (activePlatform === 'darwin') {
+    const plistPath = posixPathJoin(
+      posixPathJoin(homeDir, 'Library/LaunchAgents'),
+      `${DARWIN_LAUNCH_AGENT_LABEL}.plist`,
+    );
+    const programArguments = [nodePath, binScriptPath, 'serve'];
+    const plist = renderLaunchAgentPlist({
+      label: DARWIN_LAUNCH_AGENT_LABEL,
+      nodePath,
+      binScriptPath,
+    });
+
+    return {
+      platform: activePlatform,
+      filesToWrite: [
+        {
+          path: plistPath,
+          content: plist,
+        },
+      ],
+      commands: [
+        `launchctl unload ${shellQuote(plistPath)} >/dev/null 2>&1 || true`,
+        `launchctl load -w ${shellQuote(plistPath)}`,
+      ],
+      summary: `Install launchd agent ${DARWIN_LAUNCH_AGENT_LABEL}`,
+      launchAgent: {
+        label: DARWIN_LAUNCH_AGENT_LABEL,
+        plistPath,
+        programArguments,
+      },
+    };
+  }
+
+  if (activePlatform === 'linux') {
+    const servicePath = posixPathJoin(
+      posixPathJoin(homeDir, '.config/systemd/user'),
+      LINUX_SYSTEMD_USER_SERVICE,
+    );
+    const serviceContents = renderSystemdUserService({ nodePath, binScriptPath });
+
+    return {
+      platform: activePlatform,
+      filesToWrite: [
+        {
+          path: servicePath,
+          content: serviceContents,
+        },
+      ],
+      commands: [
+        'systemctl --user daemon-reload',
+        `systemctl --user enable --now ${LINUX_SYSTEMD_USER_SERVICE}`,
+      ],
+      summary: `Install systemd user service ${LINUX_SYSTEMD_USER_SERVICE}`,
+      systemd: {
+        serviceName: LINUX_SYSTEMD_USER_SERVICE,
+        servicePath,
+      },
+    };
+  }
+
+  if (activePlatform === 'win32') {
+    const commandToRun = `${windowsTaskQuotedArg(nodePath)} ${windowsTaskQuotedArg(binScriptPath)} serve`;
+    const createCommand = `schtasks /Create /F /TN "${WIN32_TASK_NAME}" /SC ONLOGON /TR "${windowsTaskEscapeForTr(commandToRun)}"`;
+
+    return {
+      platform: activePlatform,
+      filesToWrite: [],
+      commands: [createCommand],
+      summary: `Install scheduled task ${WIN32_TASK_NAME}`,
+      scheduledTask: {
+        taskName: WIN32_TASK_NAME,
+        createCommand,
+        commandToRun,
+      },
+    };
+  }
+
+  throw new Error(`Unsupported platform: ${activePlatform}`);
+}
+
+export function buildBrowserforceMcpServerEntry({ platform = process.platform } = {}) {
+  if (platform === 'win32') {
+    const command = [
+      `if (-not (netstat -ano | Select-String ':${RELAY_PORT}\\s+.*LISTENING')) {`,
+      "Start-Process -WindowStyle Hidden -FilePath 'npx' -ArgumentList '-y','browserforce@latest','serve'",
+      '}',
+      '& npx -y browserforce@latest mcp',
+    ].join(' ');
+
+    return {
+      name: 'browserforce',
+      transport: 'stdio',
+      command: 'powershell',
+      args: ['-NoProfile', '-NonInteractive', '-Command', command],
+    };
+  }
+
+  const command = [
+    `if ! lsof -tiTCP:${RELAY_PORT} -sTCP:LISTEN >/dev/null 2>&1; then`,
+    'npx -y browserforce@latest serve >/dev/null 2>&1 &',
+    'fi;',
+    'exec npx -y browserforce@latest mcp',
+  ].join(' ');
+
+  return {
+    name: 'browserforce',
+    transport: 'stdio',
+    command: 'sh',
+    args: ['-lc', command],
+  };
+}
+
+function asObject(value) {
+  return value && typeof value === 'object' && !Array.isArray(value) ? value : {};
+}
+
+function ensureMcpAdapterOnce(allowList) {
+  const values = Array.isArray(allowList) ? allowList : [];
+  const filtered = values.filter((value) => value !== 'mcp-adapter');
+  return [...filtered, 'mcp-adapter'];
+}
+
+function mergeServers(existingServers, { platform = process.platform } = {}) {
+  const values = Array.isArray(existingServers) ? existingServers : [];
+  const browserforceEntry = buildBrowserforceMcpServerEntry({ platform });
+  const merged = [];
+  let inserted = false;
+
+  for (const value of values) {
+    const isBrowserforce =
+      value &&
+      typeof value === 'object' &&
+      !Array.isArray(value) &&
+      value.name === 'browserforce';
+
+    if (!isBrowserforce) {
+      merged.push(value);
+      continue;
+    }
+
+    if (!inserted) {
+      merged.push(browserforceEntry);
+      inserted = true;
+    }
+  }
+
+  if (!inserted) {
+    merged.push(browserforceEntry);
+  }
+
+  return merged;
+}
+
+export function mergeOpenClawConfig(existingConfig, { platform = process.platform } = {}) {
+  const root = asObject(existingConfig);
+
+  const plugins = asObject(root.plugins);
+  const entries = asObject(plugins.entries);
+  const mcpAdapter = asObject(entries['mcp-adapter']);
+  const mcpAdapterConfig = asObject(mcpAdapter.config);
+
+  const tools = asObject(root.tools);
+  const sandbox = asObject(tools.sandbox);
+  const sandboxTools = asObject(sandbox.tools);
+
+  return {
+    ...root,
+    plugins: {
+      ...plugins,
+      entries: {
+        ...entries,
+        'mcp-adapter': {
+          ...mcpAdapter,
+          enabled: true,
+          config: {
+            ...mcpAdapterConfig,
+            servers: mergeServers(mcpAdapterConfig.servers, { platform }),
+          },
+        },
+      },
+    },
+    tools: {
+      ...tools,
+      sandbox: {
+        ...sandbox,
+        tools: {
+          ...sandboxTools,
+          allow: ensureMcpAdapterOnce(sandboxTools.allow),
+        },
+      },
+    },
+  };
+}
+
+export function formatJsonStable(obj) {
+  return `${JSON.stringify(obj, null, 2)}\n`;
+}
+
+function defaultExecFn(command) {
+  const result = spawnSync(command, {
+    shell: true,
+    stdio: 'inherit',
+  });
+
+  if (result.error) {
+    throw result.error;
+  }
+
+  if (typeof result.status === 'number' && result.status !== 0) {
+    throw new Error(`Command failed with exit code ${result.status}: ${command}`);
+  }
+
+  if (result.status === null) {
+    throw new Error(`Command terminated unexpectedly: ${command}`);
+  }
+}
+
+export async function applyAutostart(spec, { dryRun = false, execFn = defaultExecFn, fsApi = fs } = {}) {
+  const filesToWrite = Array.isArray(spec?.filesToWrite) ? spec.filesToWrite : [];
+  const commands = Array.isArray(spec?.commands) ? spec.commands : [];
+  const report = {
+    wroteFiles: filesToWrite.map((file) => file.path),
+    ranCommands: [],
+    skippedCommands: dryRun ? [...commands] : [],
+  };
+
+  if (dryRun) {
+    return report;
+  }
+
+  for (const file of filesToWrite) {
+    const parentDir = path.dirname(file.path);
+    await fsApi.mkdir(parentDir, { recursive: true });
+    await fsApi.writeFile(file.path, file.content, 'utf8');
+  }
+
+  for (const command of commands) {
+    await execFn(command);
+    report.ranCommands.push(command);
+  }
+
+  return report;
+}
diff --git a/mcp/test/openclaw-setup.test.js b/mcp/test/openclaw-setup.test.js
new file mode 100644
index 0000000..a7801af
--- /dev/null
+++ b/mcp/test/openclaw-setup.test.js
@@ -0,0 +1,530 @@
+import { test } from 'node:test';
+import assert from 'node:assert/strict';
+
+import {
+  applyAutostart,
+  buildBrowserforceMcpServerEntry,
+  buildAutostartSpec,
+  formatJsonStable,
+  mergeOpenClawConfig,
+  renderLaunchAgentPlist,
+  renderSystemdUserService,
+} from '../src/openclaw-setup.js';
+
+function quoteShellArg(value) {
+  const stringValue = String(value);
+  if (process.platform === 'win32') {
+    return `"${stringValue.replace(/"/g, '""')}"`;
+  }
+  return `'${stringValue.replace(/'/g, `'\\''`)}'`;
+}
+
+function buildNodeEvalCommand(source) {
+  return `${quoteShellArg(process.execPath)} -e ${quoteShellArg(source)}`;
+}
+
+test('buildBrowserforceMcpServerEntry returns stdio sh wrapper with relay autostart on POSIX', () => {
+  const entry = buildBrowserforceMcpServerEntry({ platform: 'linux' });
+
+  assert.equal(entry.transport, 'stdio');
+  assert.equal(entry.command, 'sh');
+  assert.equal(entry.args[0], '-lc');
+  assert.match(entry.args[1], /if ! lsof -tiTCP:19222 -sTCP:LISTEN/);
+  assert.match(entry.args[1], /npx -y browserforce@latest serve/);
+  assert.match(entry.args[1], /exec npx -y browserforce@latest mcp/);
+});
+
+test('buildBrowserforceMcpServerEntry returns win32-safe powershell wrapper', () => {
+  const entry = buildBrowserforceMcpServerEntry({ platform: 'win32' });
+
+  assert.equal(entry.transport, 'stdio');
+  assert.equal(entry.command, 'powershell');
+  assert.deepEqual(entry.args.slice(0, 3), ['-NoProfile', '-NonInteractive', '-Command']);
+  assert.match(entry.args[3], /netstat -ano/);
+  assert.match(entry.args[3], /browserforce@latest','serve/);
+  assert.match(entry.args[3], /& npx -y browserforce@latest mcp/);
+});
+
+test('mergeOpenClawConfig adds and enables plugins.entries["mcp-adapter"]', () => {
+  const merged = mergeOpenClawConfig({
+    plugins: {
+      entries: {
+        'mcp-adapter': {
+          enabled: false,
+        },
+      },
+    },
+  });
+
+  assert.equal(merged.plugins.entries['mcp-adapter'].enabled, true);
+});
+
+test('mergeOpenClawConfig preserves unrelated keys', () => {
+  const existing = {
+    ui: {
+      theme: 'light',
+    },
+    plugins: {
+      entries: {
+        other: {
+          enabled: false,
+          config: { foo: 1 },
+        },
+      },
+    },
+    tools: {
+      sandbox: {
+        tools: {
+          allow: ['shell'],
+          deny: ['network'],
+        },
+      },
+    },
+  };
+
+  const merged = mergeOpenClawConfig(existing);
+
+  assert.equal(merged.ui.theme, 'light');
+  assert.deepEqual(merged.plugins.entries.other, existing.plugins.entries.other);
+  assert.deepEqual(merged.tools.sandbox.tools.deny, ['network']);
+});
+
+test('mergeOpenClawConfig preserves existing non-browserforce servers', () => {
+  const existing = {
+    plugins: {
+      entries: {
+        'mcp-adapter': {
+          enabled: false,
+          config: {
+            timeoutMs: 1000,
+            servers: [
+              {
+                name: 'custom',
+                transport: 'stdio',
+                command: 'node',
+                args: ['custom-mcp.js'],
+              },
+            ],
+          },
+        },
+      },
+    },
+  };
+
+  const merged = mergeOpenClawConfig(existing);
+  const servers = merged.plugins.entries['mcp-adapter'].config.servers;
+
+  assert.equal(merged.plugins.entries['mcp-adapter'].config.timeoutMs, 1000);
+  assert.deepEqual(servers.find((server) => server.name === 'custom'), existing.plugins.entries['mcp-adapter'].config.servers[0]);
+  assert.equal(servers.filter((server) => server.name === 'browserforce').length, 1);
+});
+
+test('mergeOpenClawConfig updates browserforce server entry once without duplicates', () => {
+  const existing = {
+    plugins: {
+      entries: {
+        'mcp-adapter': {
+          config: {
+            servers: [
+              { name: 'custom', transport: 'stdio', command: 'node', args: ['custom.js'] },
+              { name: 'browserforce', transport: 'stdio', command: 'node', args: ['old-browserforce.js'] },
+              { name: 'browserforce', transport: 'stdio', command: 'node', args: ['stale-browserforce.js'] },
+            ],
+          },
+        },
+      },
+    },
+  };
+
+  const merged = mergeOpenClawConfig(existing);
+  const servers = merged.plugins.entries['mcp-adapter'].config.servers;
+  const browserforceServers = servers.filter((server) => server.name === 'browserforce');
+
+  assert.equal(browserforceServers.length, 1);
+  assert.deepEqual(browserforceServers[0], buildBrowserforceMcpServerEntry());
+  assert.deepEqual(servers.find((server) => server.name === 'custom'), existing.plugins.entries['mcp-adapter'].config.servers[0]);
+});
+
+test('mergeOpenClawConfig is idempotent', () => {
+  const first = mergeOpenClawConfig({
+    plugins: { entries: {} },
+    tools: { sandbox: { tools: { allow: ['shell'] } } },
+  });
+  const second = mergeOpenClawConfig(first);
+
+  assert.deepEqual(second, first);
+});
+
+test('mergeOpenClawConfig writes win32 browserforce server entry without sh', () => {
+  const merged = mergeOpenClawConfig(
+    {
+      plugins: {
+        entries: {
+          'mcp-adapter': {
+            config: {
+              servers: [],
+            },
+          },
+        },
+      },
+    },
+    { platform: 'win32' },
+  );
+  const server = merged.plugins.entries['mcp-adapter'].config.servers.find((value) => value.name === 'browserforce');
+
+  assert.equal(server.command, 'powershell');
+  assert.deepEqual(server.args.slice(0, 3), ['-NoProfile', '-NonInteractive', '-Command']);
+});
+
+test('formatJsonStable uses 2-space indentation and trailing newline', () => {
+  const out = formatJsonStable({ a: 1, nested: { b: true } });
+  assert.equal(out, '{\n  "a": 1,\n  "nested": {\n    "b": true\n  }\n}\n');
+});
+
+test('mergeOpenClawConfig ensures tools.sandbox.tools.allow includes mcp-adapter once', () => {
+  const merged = mergeOpenClawConfig({
+    tools: {
+      sandbox: {
+        tools: {
+          allow: ['shell', 'mcp-adapter', 'mcp-adapter'],
+        },
+      },
+    },
+  });
+
+  const allow = merged.tools.sandbox.tools.allow;
+  assert.equal(allow.includes('mcp-adapter'), true);
+  assert.equal(allow.filter((item) => item === 'mcp-adapter').length, 1);
+  assert.equal(allow.includes('shell'), true);
+});
+
+test('buildAutostartSpec returns darwin launch agent spec', () => {
+  const spec = buildAutostartSpec({
+    platform: 'darwin',
+    homeDir: '/Users/alex',
+    nodePath: '/usr/local/bin/node',
+    binScriptPath: '/Users/alex/.npm/_npx/browserforce/bin.js',
+  });
+
+  assert.equal(spec.platform, 'darwin');
+  assert.equal(Array.isArray(spec.filesToWrite), true);
+  assert.equal(spec.filesToWrite.length, 1);
+  assert.equal(spec.filesToWrite[0].path, '/Users/alex/Library/LaunchAgents/ai.browserforce.relay.plist');
+  assert.match(spec.filesToWrite[0].content, /<key>Label<\/key>\n\s*<string>ai\.browserforce\.relay<\/string>/);
+  assert.equal(Array.isArray(spec.commands), true);
+  assert.deepEqual(spec.commands, [
+    "launchctl unload '/Users/alex/Library/LaunchAgents/ai.browserforce.relay.plist' >/dev/null 2>&1 || true",
+    "launchctl load -w '/Users/alex/Library/LaunchAgents/ai.browserforce.relay.plist'",
+  ]);
+  assert.equal(typeof spec.summary, 'string');
+  assert.notEqual(spec.summary.trim(), '');
+  assert.equal(spec.launchAgent.label, 'ai.browserforce.relay');
+  assert.equal(spec.launchAgent.plistPath, '/Users/alex/Library/LaunchAgents/ai.browserforce.relay.plist');
+  assert.match(spec.launchAgent.programArguments.join(' '), /\/usr\/local\/bin\/node .*\/bin\.js serve/);
+});
+
+test('buildAutostartSpec returns linux systemd user service spec', () => {
+  const spec = buildAutostartSpec({
+    platform: 'linux',
+    homeDir: '/home/alex',
+    nodePath: '/usr/bin/node',
+    binScriptPath: '/home/alex/.npm/_npx/browserforce/bin.js',
+  });
+
+  assert.equal(spec.platform, 'linux');
+  assert.equal(Array.isArray(spec.filesToWrite), true);
+  assert.equal(spec.filesToWrite.length, 1);
+  assert.equal(spec.filesToWrite[0].path, '/home/alex/.config/systemd/user/browserforce-relay.service');
+  assert.match(spec.filesToWrite[0].content, /ExecStart="\/usr\/bin\/node" "\/home\/alex\/\.npm\/_npx\/browserforce\/bin\.js" serve/);
+  assert.equal(Array.isArray(spec.commands), true);
+  assert.deepEqual(spec.commands, [
+    'systemctl --user daemon-reload',
+    'systemctl --user enable --now browserforce-relay.service',
+  ]);
+  assert.equal(typeof spec.summary, 'string');
+  assert.notEqual(spec.summary.trim(), '');
+  assert.equal(spec.systemd.servicePath, '/home/alex/.config/systemd/user/browserforce-relay.service');
+  assert.equal(spec.commands.some((command) => command === 'systemctl --user enable --now browserforce-relay.service'), true);
+});
+
+test('buildAutostartSpec returns win32 scheduled task spec', () => {
+  const spec = buildAutostartSpec({
+    platform: 'win32',
+    homeDir: 'C:\\Users\\alex',
+    nodePath: 'C:\\Program Files\\nodejs\\node.exe',
+    binScriptPath: 'C:\\Users\\alex\\AppData\\Roaming\\npm\\node_modules\\browserforce\\bin.js',
+  });
+
+  assert.equal(spec.platform, 'win32');
+  assert.equal(Array.isArray(spec.filesToWrite), true);
+  assert.deepEqual(spec.filesToWrite, []);
+  assert.equal(Array.isArray(spec.commands), true);
+  assert.equal(spec.commands.length, 1);
+  assert.match(spec.commands[0], /schtasks\s+\/Create/);
+  assert.equal(typeof spec.summary, 'string');
+  assert.notEqual(spec.summary.trim(), '');
+  assert.equal(spec.scheduledTask.taskName, 'BrowserForceRelay');
+  assert.match(spec.scheduledTask.createCommand, /schtasks\s+\/Create/);
+  assert.match(spec.scheduledTask.createCommand, /\/SC\s+ONLOGON/);
+});
+
+test('buildAutostartSpec win32 escapes cmd metacharacters and quotes in /TR payload', () => {
+  const spec = buildAutostartSpec({
+    platform: 'win32',
+    homeDir: 'C:\\Users\\alex',
+    nodePath: 'C:\\Program Files\\Tools & Stuff\\100%\\node.exe',
+    binScriptPath: 'C:\\Users\\alex\\AppData\\Roaming\\npm\\b&f%\\bin"odd".js',
+  });
+
+  assert.equal(spec.platform, 'win32');
+  assert.match(spec.scheduledTask.commandToRun, /"\S[\s\S]*"\s+"\S[\s\S]*"\s+serve$/);
+  assert.match(spec.scheduledTask.commandToRun, /\^&/);
+  assert.match(spec.scheduledTask.commandToRun, /%%/);
+  assert.match(spec.scheduledTask.commandToRun, /""odd""/);
+  assert.match(spec.scheduledTask.createCommand, /\/TR\s+"/);
+  assert.match(spec.scheduledTask.createCommand, /\^&/);
+  assert.match(spec.scheduledTask.createCommand, /%%/);
+});
+
+test('buildAutostartSpec throws for unsupported platform', () => {
+  assert.throws(
+    () =>
+      buildAutostartSpec({
+        platform: 'freebsd',
+        homeDir: '/home/alex',
+        nodePath: '/usr/bin/node',
+        binScriptPath: '/home/alex/bin/browserforce',
+      }),
+    /Unsupported platform: freebsd/,
+  );
+});
+
+test('renderLaunchAgentPlist includes expected label, args, and run-at-load keys', () => {
+  const output = renderLaunchAgentPlist({
+    label: 'ai.browserforce.relay',
+    nodePath: '/usr/local/bin/node',
+    binScriptPath: '/Users/alex/.npm/_npx/browserforce/bin.js',
+  });
+
+  assert.match(output, /<key>Label<\/key>\n\s*<string>ai\.browserforce\.relay<\/string>/);
+  assert.match(output, /<key>ProgramArguments<\/key>\n\s*<array>\n\s*<string>\/usr\/local\/bin\/node<\/string>\n\s*<string>\/Users\/alex\/\.npm\/_npx\/browserforce\/bin\.js<\/string>\n\s*<string>serve<\/string>\n\s*<\/array>/);
+  assert.match(output, /<key>RunAtLoad<\/key>\n\s*<true\/>/);
+});
+
+test('renderLaunchAgentPlist escapes XML entities in interpolated values', () => {
+  const output = renderLaunchAgentPlist({
+    label: 'relay & <label> "x" \'y\'',
+    nodePath: '/tmp/&<node>"\'',
+    binScriptPath: '/tmp/&<script>"\'',
+  });
+
+  assert.match(output, /<string>relay &amp; &lt;label&gt; &quot;x&quot; &apos;y&apos;<\/string>/);
+  assert.match(output, /<string>\/tmp\/&amp;&lt;node&gt;&quot;&apos;<\/string>/);
+  assert.match(output, /<string>\/tmp\/&amp;&lt;script&gt;&quot;&apos;<\/string>/);
+});
+
+test('renderSystemdUserService includes unit, service, install sections and expected ExecStart', () => {
+  const output = renderSystemdUserService({
+    nodePath: '/usr/bin/node',
+    binScriptPath: '/home/alex/.npm/_npx/browserforce/bin.js',
+  });
+
+  assert.match(output, /^\[Unit\]/m);
+  assert.match(output, /^\[Service\]/m);
+  assert.match(output, /^\[Install\]/m);
+  assert.match(output, /^ExecStart="\/usr\/bin\/node" "\/home\/alex\/\.npm\/_npx\/browserforce\/bin\.js" serve$/m);
+});
+
+test('applyAutostart dryRun=true runs no commands and returns planned actions', async () => {
+  const spec = {
+    filesToWrite: [
+      {
+        path: '/tmp/browserforce/autostart/file.txt',
+        content: 'hello',
+      },
+    ],
+    commands: ['echo one', 'echo two'],
+  };
+  const execCalls = [];
+  const fsCalls = [];
+
+  const report = await applyAutostart(spec, {
+    dryRun: true,
+    execFn: async (command) => {
+      execCalls.push(command);
+    },
+    fsApi: {
+      mkdir: async (...args) => {
+        fsCalls.push(['mkdir', ...args]);
+      },
+      writeFile: async (...args) => {
+        fsCalls.push(['writeFile', ...args]);
+      },
+    },
+  });
+
+  assert.deepEqual(execCalls, []);
+  assert.deepEqual(fsCalls, []);
+  assert.deepEqual(report, {
+    wroteFiles: ['/tmp/browserforce/autostart/file.txt'],
+    ranCommands: [],
+    skippedCommands: ['echo one', 'echo two'],
+  });
+});
+
+test('applyAutostart dryRun=false executes commands in order', async () => {
+  const spec = {
+    filesToWrite: [],
+    commands: ['echo first', 'echo second', 'echo third'],
+  };
+  const execCalls = [];
+
+  const report = await applyAutostart(spec, {
+    dryRun: false,
+    execFn: async (command) => {
+      execCalls.push(command);
+    },
+    fsApi: {
+      mkdir: async () => {},
+      writeFile: async () => {},
+    },
+  });
+
+  assert.deepEqual(execCalls, ['echo first', 'echo second', 'echo third']);
+  assert.deepEqual(report, {
+    wroteFiles: [],
+    ranCommands: ['echo first', 'echo second', 'echo third'],
+    skippedCommands: [],
+  });
+});
+
+test('applyAutostart creates parent directories before writing files', async () => {
+  const spec = {
+    filesToWrite: [
+      {
+        path: '/tmp/browserforce/nested/path/file.txt',
+        content: 'created',
+      },
+    ],
+    commands: [],
+  };
+  const fsCalls = [];
+
+  const report = await applyAutostart(spec, {
+    dryRun: false,
+    execFn: async () => {},
+    fsApi: {
+      mkdir: async (...args) => {
+        fsCalls.push(['mkdir', ...args]);
+      },
+      writeFile: async (...args) => {
+        fsCalls.push(['writeFile', ...args]);
+      },
+    },
+  });
+
+  assert.deepEqual(fsCalls, [
+    ['mkdir', '/tmp/browserforce/nested/path', { recursive: true }],
+    ['writeFile', '/tmp/browserforce/nested/path/file.txt', 'created', 'utf8'],
+  ]);
+  assert.deepEqual(report, {
+    wroteFiles: ['/tmp/browserforce/nested/path/file.txt'],
+    ranCommands: [],
+    skippedCommands: [],
+  });
+});
+
+test('applyAutostart writes all files before executing any commands', async () => {
+  const spec = {
+    filesToWrite: [
+      {
+        path: '/tmp/browserforce/first/file.txt',
+        content: 'one',
+      },
+      {
+        path: '/tmp/browserforce/second/file.txt',
+        content: 'two',
+      },
+    ],
+    commands: ['echo after-files', 'echo still-after-files'],
+  };
+  const callOrder = [];
+
+  const report = await applyAutostart(spec, {
+    dryRun: false,
+    execFn: async (command) => {
+      callOrder.push(`exec:${command}`);
+    },
+    fsApi: {
+      mkdir: async (dirPath) => {
+        callOrder.push(`mkdir:${dirPath}`);
+      },
+      writeFile: async (filePath) => {
+        callOrder.push(`writeFile:${filePath}`);
+      },
+    },
+  });
+
+  assert.deepEqual(callOrder, [
+    'mkdir:/tmp/browserforce/first',
+    'writeFile:/tmp/browserforce/first/file.txt',
+    'mkdir:/tmp/browserforce/second',
+    'writeFile:/tmp/browserforce/second/file.txt',
+    'exec:echo after-files',
+    'exec:echo still-after-files',
+  ]);
+  assert.deepEqual(report, {
+    wroteFiles: ['/tmp/browserforce/first/file.txt', '/tmp/browserforce/second/file.txt'],
+    ranCommands: ['echo after-files', 'echo still-after-files'],
+    skippedCommands: [],
+  });
+});
+
+test('applyAutostart default execFn runs process.execPath command successfully', async () => {
+  const command = buildNodeEvalCommand('process.exit(0)');
+
+  const report = await applyAutostart(
+    {
+      filesToWrite: [],
+      commands: [command],
+    },
+    {
+      dryRun: false,
+      fsApi: {
+        mkdir: async () => {},
+        writeFile: async () => {},
+      },
+    },
+  );
+
+  assert.deepEqual(report, {
+    wroteFiles: [],
+    ranCommands: [command],
+    skippedCommands: [],
+  });
+});
+
+test('applyAutostart default execFn includes exit code and command on non-zero status', async () => {
+  const command = buildNodeEvalCommand('process.exit(7)');
+
+  await assert.rejects(
+    applyAutostart(
+      {
+        filesToWrite: [],
+        commands: [command],
+      },
+      {
+        dryRun: false,
+        fsApi: {
+          mkdir: async () => {},
+          writeFile: async () => {},
+        },
+      },
+    ),
+    (error) => {
+      assert.match(error.message, /Command failed with exit code 7/);
+      assert.equal(error.message.includes(command), true);
+      return true;
+    },
+  );
+});

From 3aa0ccb3f22a69ff9c08c685162c3008627d4b9d Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 17:12:58 +0530
Subject: [PATCH 084/192] feat: add setup openclaw CLI command

---
 bin.js           | 136 ++++++++++++++++++++++++++++++++++++++++++++++-
 test/cli.test.js |  93 +++++++++++++++++++++++++++++++-
 2 files changed, 227 insertions(+), 2 deletions(-)

diff --git a/bin.js b/bin.js
index 3a32ced..9151dc2 100644
--- a/bin.js
+++ b/bin.js
@@ -10,6 +10,8 @@ const { values, positionals } = parseArgs({
   options: {
     eval: { type: 'string', short: 'e' },
     timeout: { type: 'string', default: '30000' },
+    'dry-run': { type: 'boolean', default: false },
+    'no-autostart': { type: 'boolean', default: false },
     json: { type: 'boolean', default: false },
     help: { type: 'boolean', short: 'h', default: false },
   },
@@ -432,6 +434,134 @@ async function cmdInstallExtension() {
   await doInstallExtension(false);
 }
 
+async function cmdSetup() {
+  const target = positionals[1];
+  if (!target) {
+    console.error('Usage: browserforce setup openclaw [--dry-run] [--json] [--no-autostart]');
+    process.exit(1);
+  }
+
+  if (target !== 'openclaw') {
+    console.error(`Unknown setup target: ${target}`);
+    process.exit(1);
+  }
+
+  const { homedir } = await import('node:os');
+  const { join, dirname } = await import('node:path');
+  const fs = await import('node:fs/promises');
+  const {
+    mergeOpenClawConfig,
+    formatJsonStable,
+    buildAutostartSpec,
+    applyAutostart,
+  } = await import('./mcp/src/openclaw-setup.js');
+
+  const dryRun = values['dry-run'] === true;
+  const noAutostart = values['no-autostart'] === true;
+  const homeDir = homedir();
+  const openclawConfigPath = join(homeDir, '.openclaw', 'openclaw.json');
+  const openclawDir = dirname(openclawConfigPath);
+
+  let existingConfig = {};
+  let configExisted = false;
+  try {
+    const raw = await fs.readFile(openclawConfigPath, 'utf8');
+    configExisted = true;
+    existingConfig = raw.trim() ? JSON.parse(raw) : {};
+  } catch (err) {
+    if (err.code !== 'ENOENT') {
+      throw new Error(`Failed to read OpenClaw config at ${openclawConfigPath}: ${err.message}`);
+    }
+  }
+
+  const mergedConfig = mergeOpenClawConfig(existingConfig);
+  const mergedJson = formatJsonStable(mergedConfig);
+  if (!dryRun) {
+    await fs.mkdir(openclawDir, { recursive: true });
+    await fs.writeFile(openclawConfigPath, mergedJson, 'utf8');
+  }
+
+  let autostart = null;
+  if (!noAutostart) {
+    let autostartExecFn;
+    if (values.json) {
+      const { spawnSync } = await import('node:child_process');
+      autostartExecFn = (command) => {
+        const result = spawnSync(command, {
+          shell: true,
+          encoding: 'utf8',
+          stdio: ['ignore', 'pipe', 'pipe'],
+        });
+
+        if (result.error) {
+          throw result.error;
+        }
+
+        if (typeof result.status === 'number' && result.status !== 0) {
+          const commandOutput = [result.stderr, result.stdout]
+            .filter((chunk) => typeof chunk === 'string' && chunk.trim().length > 0)
+            .join('\n')
+            .trim();
+          throw new Error(
+            commandOutput
+              ? `Command failed with exit code ${result.status}: ${command}\n${commandOutput}`
+              : `Command failed with exit code ${result.status}: ${command}`,
+          );
+        }
+
+        if (result.status === null) {
+          throw new Error(`Command terminated unexpectedly: ${command}`);
+        }
+      };
+    }
+
+    const autostartSpec = buildAutostartSpec({
+      platform: process.platform,
+      homeDir,
+      nodePath: process.execPath,
+      binScriptPath: fileURLToPath(import.meta.url),
+    });
+    const autostartReport = await applyAutostart(autostartSpec, {
+      dryRun,
+      ...(autostartExecFn ? { execFn: autostartExecFn } : {}),
+    });
+    autostart = {
+      platform: autostartSpec.platform,
+      summary: autostartSpec.summary,
+      wroteFiles: autostartReport.wroteFiles,
+      ranCommands: autostartReport.ranCommands,
+      skippedCommands: autostartReport.skippedCommands,
+    };
+  }
+
+  const result = {
+    target: 'openclaw',
+    dryRun,
+    openclawConfigPath,
+    mcpAdapterConfigured: mergedConfig?.plugins?.entries?.['mcp-adapter']?.enabled === true,
+    configExisted,
+    configWritten: !dryRun,
+    autostart,
+  };
+
+  if (values.json) {
+    process.stdout.write(formatJsonStable(result));
+    return;
+  }
+
+  console.log('OpenClaw setup complete');
+  console.log(`  target: ${result.target}`);
+  console.log(`  openclawConfigPath: ${result.openclawConfigPath}`);
+  console.log(`  mcpAdapterConfigured: ${result.mcpAdapterConfigured}`);
+  console.log(`  config: ${dryRun ? 'dry-run (not written)' : 'written'}`);
+  if (noAutostart) {
+    console.log('  autostart: skipped (--no-autostart)');
+  } else {
+    console.log(`  autostart.platform: ${autostart.platform}`);
+    console.log(`  autostart: ${dryRun ? 'dry-run (not applied)' : 'applied'}`);
+  }
+}
+
 function cmdHelp() {
   console.log(`
   BrowserForce — Give AI agents your real Chrome browser
@@ -447,12 +577,15 @@ function cmdHelp() {
     browserforce plugin list        List installed plugins
     browserforce plugin install <n> Install a plugin from the registry
     browserforce plugin remove <n>  Remove an installed plugin
+    browserforce setup openclaw     Configure OpenClaw + optional autostart
     browserforce update             Update to the latest version
     browserforce install-extension  Copy extension to ~/.browserforce/extension/
     browserforce -e "<code>"        Execute Playwright JavaScript (one-shot)
 
   Options:
     --timeout <ms>    Execution timeout (default: 30000)
+    --dry-run         Preview setup changes without writing files
+    --no-autostart    Skip autostart setup for setup openclaw
     --json            JSON output
     -h, --help        Show this help
 
@@ -461,6 +594,7 @@ function cmdHelp() {
     browserforce tabs
     browserforce plugin list
     browserforce plugin install highlight
+    browserforce setup openclaw --dry-run --json
     browserforce update
     browserforce -e "return await snapshot()"
     browserforce -e "await page.goto('https://github.com'); return await snapshot()"
@@ -478,7 +612,7 @@ const commands = {
   serve: cmdServe, mcp: cmdMcp, status: cmdStatus, tabs: cmdTabs,
   screenshot: cmdScreenshot, snapshot: cmdSnapshot, navigate: cmdNavigate,
   execute: cmdExecute, plugin: cmdPlugin, update: cmdUpdate,
-  'install-extension': cmdInstallExtension, help: cmdHelp,
+  'install-extension': cmdInstallExtension, setup: cmdSetup, help: cmdHelp,
 };
 
 const handler = commands[command];
diff --git a/test/cli.test.js b/test/cli.test.js
index e6c2ed9..2bdcaae 100644
--- a/test/cli.test.js
+++ b/test/cli.test.js
@@ -5,7 +5,7 @@ import { execFile, spawn } from 'node:child_process';
 import { promisify } from 'node:util';
 import http from 'node:http';
 import { createRequire } from 'node:module';
-import { mkdirSync, rmSync } from 'node:fs';
+import { chmodSync, existsSync, mkdirSync, rmSync, writeFileSync } from 'node:fs';
 import { join } from 'node:path';
 import { tmpdir } from 'node:os';
 
@@ -310,3 +310,94 @@ describe('CLI install-extension', () => {
     rmSync(freshDir, { recursive: true, force: true });
   });
 });
+
+describe('CLI setup', () => {
+  it('help includes setup openclaw', async () => {
+    const { stdout } = await exec('node', ['bin.js', 'help']);
+    assert.ok(stdout.includes('setup openclaw'));
+  });
+
+  it('setup openclaw --json stays parse-safe when autostart commands print output', async () => {
+    if (process.platform === 'win32') return;
+
+    const homeDir = join(tmpdir(), `bf-openclaw-home-${Math.random().toString(36).slice(2)}`);
+    const binDir = join(tmpdir(), `bf-openclaw-bin-${Math.random().toString(36).slice(2)}`);
+    mkdirSync(homeDir, { recursive: true });
+    mkdirSync(binDir, { recursive: true });
+
+    const commandName = process.platform === 'darwin' ? 'launchctl' : 'systemctl';
+    const commandPath = join(binDir, commandName);
+    writeFileSync(commandPath, '#!/bin/sh\necho "mock stdout from autostart"\necho "mock stderr from autostart" 1>&2\nexit 0\n', 'utf8');
+    chmodSync(commandPath, 0o755);
+
+    const { stdout } = await exec('node', ['bin.js', 'setup', 'openclaw', '--json'], {
+      env: {
+        ...process.env,
+        HOME: homeDir,
+        PATH: `${binDir}:${process.env.PATH || ''}`,
+      },
+    });
+
+    const result = JSON.parse(stdout);
+    assert.equal(result.target, 'openclaw');
+    assert.equal(result.dryRun, false);
+    assert.equal(typeof result.autostart.platform, 'string');
+
+    rmSync(homeDir, { recursive: true, force: true });
+    rmSync(binDir, { recursive: true, force: true });
+  });
+
+  it('setup openclaw --dry-run --json outputs expected keys', async () => {
+    const homeDir = join(tmpdir(), `bf-openclaw-home-${Math.random().toString(36).slice(2)}`);
+    mkdirSync(homeDir, { recursive: true });
+
+    const { stdout } = await exec('node', ['bin.js', 'setup', 'openclaw', '--dry-run', '--json'], {
+      env: { ...process.env, HOME: homeDir },
+    });
+
+    const result = JSON.parse(stdout);
+    assert.equal(typeof result.openclawConfigPath, 'string');
+    assert.equal(result.mcpAdapterConfigured, true);
+    assert.equal(typeof result.autostart.platform, 'string');
+    assert.equal(existsSync(join(homeDir, '.openclaw', 'openclaw.json')), false);
+
+    rmSync(homeDir, { recursive: true, force: true });
+  });
+
+  it('setup openclaw --dry-run --no-autostart --json skips autostart and returns base keys', async () => {
+    const homeDir = join(tmpdir(), `bf-openclaw-home-${Math.random().toString(36).slice(2)}`);
+    mkdirSync(homeDir, { recursive: true });
+
+    const { stdout } = await exec('node', ['bin.js', 'setup', 'openclaw', '--dry-run', '--no-autostart', '--json'], {
+      env: { ...process.env, HOME: homeDir },
+    });
+
+    const result = JSON.parse(stdout);
+    assert.deepEqual(Object.keys(result).sort(), [
+      'autostart',
+      'configExisted',
+      'configWritten',
+      'dryRun',
+      'mcpAdapterConfigured',
+      'openclawConfigPath',
+      'target',
+    ].sort());
+    assert.equal(result.target, 'openclaw');
+    assert.equal(result.dryRun, true);
+    assert.equal(result.mcpAdapterConfigured, true);
+    assert.equal(result.autostart, null);
+    assert.equal(existsSync(join(homeDir, '.openclaw', 'openclaw.json')), false);
+
+    rmSync(homeDir, { recursive: true, force: true });
+  });
+
+  it('setup unknown target exits non-zero with error', async () => {
+    try {
+      await exec('node', ['bin.js', 'setup', 'nope']);
+      assert.fail('should have exited with error');
+    } catch (err) {
+      assert.ok(err.code !== 0);
+      assert.ok(err.stderr.includes('Unknown setup target'));
+    }
+  });
+});

From 288b0bb719496ff99e795700d1ca19edea382c6e Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 17:13:02 +0530
Subject: [PATCH 085/192] feat: add opt-in postinstall openclaw setup hook

---
 package.json                     |   4 +-
 scripts/postinstall-openclaw.mjs |  47 +++++++++
 test/postinstall.test.js         | 160 +++++++++++++++++++++++++++++++
 3 files changed, 210 insertions(+), 1 deletion(-)
 create mode 100644 scripts/postinstall-openclaw.mjs
 create mode 100644 test/postinstall.test.js

diff --git a/package.json b/package.json
index 9d46e82..690904b 100644
--- a/package.json
+++ b/package.json
@@ -29,6 +29,7 @@
   "files": [
     "README.md",
     "bin.js",
+    "scripts/",
     "extension/",
     "relay/src/",
     "relay/package.json",
@@ -47,7 +48,8 @@
     "relay": "lsof -ti tcp:19222 | xargs kill -9 2>/dev/null; sleep 0.3; node relay/src/index.js",
     "relay:dev": "lsof -ti tcp:19222 | xargs kill -9 2>/dev/null; sleep 0.3; node --watch relay/src/index.js",
     "mcp": "node mcp/src/index.js",
-    "test": "node --test relay/test/relay-server.test.js && node --test mcp/test/mcp-tools.test.js && node --test mcp/test/plugin-loader.test.js && node --test mcp/test/plugin-installer.test.js && node --test mcp/test/exec-engine-plugins.test.js && node --test mcp/test/mcp-plugin-integration.test.js && node --test test/cli.test.js",
+    "postinstall": "node scripts/postinstall-openclaw.mjs",
+    "test": "node --test relay/test/relay-server.test.js && node --test mcp/test/mcp-tools.test.js && node --test mcp/test/plugin-loader.test.js && node --test mcp/test/plugin-installer.test.js && node --test mcp/test/exec-engine-plugins.test.js && node --test mcp/test/mcp-plugin-integration.test.js && node --test test/cli.test.js && node --test test/postinstall.test.js",
     "test:relay": "node --test relay/test/relay-server.test.js",
     "test:mcp": "node --test mcp/test/mcp-tools.test.js && node --test mcp/test/plugin-loader.test.js && node --test mcp/test/plugin-installer.test.js && node --test mcp/test/exec-engine-plugins.test.js && node --test mcp/test/mcp-plugin-integration.test.js"
   }
diff --git a/scripts/postinstall-openclaw.mjs b/scripts/postinstall-openclaw.mjs
new file mode 100644
index 0000000..f527a96
--- /dev/null
+++ b/scripts/postinstall-openclaw.mjs
@@ -0,0 +1,47 @@
+import { spawn } from 'node:child_process';
+import { dirname, resolve } from 'node:path';
+import { fileURLToPath } from 'node:url';
+
+const scriptDir = dirname(fileURLToPath(import.meta.url));
+const binScriptPath = resolve(scriptDir, '..', 'bin.js');
+
+function isCiEnv() {
+  const raw = process.env.CI;
+  if (!raw) return false;
+  const normalized = String(raw).toLowerCase();
+  return normalized !== '0' && normalized !== 'false';
+}
+
+function runSetup(args) {
+  return new Promise((resolve, reject) => {
+    const child = spawn(process.execPath, [binScriptPath, 'setup', 'openclaw', ...args], {
+      stdio: 'inherit',
+      env: process.env,
+    });
+
+    child.on('error', reject);
+    child.on('close', (code) => {
+      if (code === 0) {
+        resolve();
+        return;
+      }
+      reject(new Error(`setup openclaw exited with code ${code}`));
+    });
+  });
+}
+
+async function main() {
+  if (process.env.BROWSERFORCE_SETUP_OPENCLAW !== '1') return;
+  if (isCiEnv() && process.env.BROWSERFORCE_SETUP_OPENCLAW_FORCE !== '1') return;
+
+  await runSetup(['--dry-run', '--json']);
+
+  if (process.env.BROWSERFORCE_SETUP_OPENCLAW_APPLY === '1') {
+    await runSetup(['--json']);
+  }
+}
+
+main().catch((error) => {
+  console.error(`[postinstall-openclaw] ${error.message}`);
+  process.exitCode = 1;
+});
diff --git a/test/postinstall.test.js b/test/postinstall.test.js
new file mode 100644
index 0000000..5992cc4
--- /dev/null
+++ b/test/postinstall.test.js
@@ -0,0 +1,160 @@
+import { describe, it } from 'node:test';
+import assert from 'node:assert/strict';
+import { execFile } from 'node:child_process';
+import { promisify } from 'node:util';
+import { mkdtempSync, mkdirSync, rmSync, existsSync, readFileSync, writeFileSync } from 'node:fs';
+import { join, dirname } from 'node:path';
+import { tmpdir } from 'node:os';
+import { fileURLToPath } from 'node:url';
+
+const exec = promisify(execFile);
+const repoRoot = join(dirname(fileURLToPath(import.meta.url)), '..');
+const postinstallScript = join(repoRoot, 'scripts', 'postinstall-openclaw.mjs');
+
+function buildEnv(overrides = {}) {
+  const env = { ...process.env };
+  for (const [key, value] of Object.entries(overrides)) {
+    if (value === null) {
+      delete env[key];
+      continue;
+    }
+    env[key] = value;
+  }
+  return env;
+}
+
+function createSandbox() {
+  const root = mkdtempSync(join(tmpdir(), 'bf-postinstall-'));
+  const cwd = join(root, 'cwd');
+  const home = join(root, 'home');
+  mkdirSync(cwd, { recursive: true });
+  mkdirSync(home, { recursive: true });
+  return { root, cwd, home };
+}
+
+function writeSpawnHook({ hookPath, logPath }) {
+  const hookSource = `const cp = require('node:child_process');
+const { appendFileSync } = require('node:fs');
+const { EventEmitter } = require('node:events');
+
+cp.spawn = function spawnStub(command, args = []) {
+  appendFileSync(${JSON.stringify(logPath)}, [command, ...(Array.isArray(args) ? args : [])].join(' ') + '\\n');
+  const child = new EventEmitter();
+  process.nextTick(() => child.emit('close', 0));
+  return child;
+};
+`;
+
+  writeFileSync(hookPath, hookSource, 'utf8');
+}
+
+async function runPostinstall({ cwd = repoRoot, env = {}, nodeArgs = [] } = {}) {
+  return exec(process.execPath, [...nodeArgs, postinstallScript], {
+    cwd,
+    env: buildEnv(env),
+  });
+}
+
+describe('postinstall openclaw hook', () => {
+  it('exits 0 and does nothing when BROWSERFORCE_SETUP_OPENCLAW is unset', async () => {
+    const sandbox = createSandbox();
+    try {
+      const { stdout } = await runPostinstall({
+        cwd: sandbox.cwd,
+        env: {
+          BROWSERFORCE_SETUP_OPENCLAW: null,
+          BROWSERFORCE_SETUP_OPENCLAW_FORCE: null,
+          BROWSERFORCE_SETUP_OPENCLAW_APPLY: null,
+          HOME: sandbox.home,
+          PATH: '',
+        },
+      });
+
+      assert.equal(stdout.trim(), '');
+      assert.equal(existsSync(join(sandbox.home, '.openclaw')), false);
+    } finally {
+      rmSync(sandbox.root, { recursive: true, force: true });
+    }
+  });
+
+  it('runs dry-run setup command when opt-in is enabled in CI-safe mode', async () => {
+    const sandbox = createSandbox();
+    try {
+      const { stdout } = await runPostinstall({
+        cwd: sandbox.cwd,
+        env: {
+          BROWSERFORCE_SETUP_OPENCLAW: '1',
+          BROWSERFORCE_SETUP_OPENCLAW_FORCE: '1',
+          BROWSERFORCE_SETUP_OPENCLAW_APPLY: null,
+          CI: '1',
+          HOME: sandbox.home,
+          PATH: '',
+        },
+      });
+
+      assert.match(stdout, /"target"\s*:\s*"openclaw"/);
+      assert.match(stdout, /"dryRun"\s*:\s*true/);
+    } finally {
+      rmSync(sandbox.root, { recursive: true, force: true });
+    }
+  });
+
+  it('exits 0 and skips setup command in CI when force is not enabled', async () => {
+    const sandbox = createSandbox();
+    try {
+      const { stdout } = await runPostinstall({
+        cwd: sandbox.cwd,
+        env: {
+          BROWSERFORCE_SETUP_OPENCLAW: '1',
+          BROWSERFORCE_SETUP_OPENCLAW_FORCE: null,
+          BROWSERFORCE_SETUP_OPENCLAW_APPLY: null,
+          CI: '1',
+          HOME: sandbox.home,
+          PATH: '',
+        },
+      });
+
+      assert.equal(stdout.trim(), '');
+      assert.equal(existsSync(join(sandbox.home, '.openclaw')), false);
+    } finally {
+      rmSync(sandbox.root, { recursive: true, force: true });
+    }
+  });
+
+  it('runs dry-run setup command before apply command when apply mode is enabled', async () => {
+    const sandbox = createSandbox();
+    const hookPath = join(sandbox.root, 'spawn-hook.cjs');
+    const logPath = join(sandbox.root, 'spawn-invocations.log');
+    writeSpawnHook({ hookPath, logPath });
+
+    try {
+      const { stdout } = await runPostinstall({
+        cwd: sandbox.cwd,
+        nodeArgs: ['--require', hookPath],
+        env: {
+          BROWSERFORCE_SETUP_OPENCLAW: '1',
+          BROWSERFORCE_SETUP_OPENCLAW_FORCE: '1',
+          BROWSERFORCE_SETUP_OPENCLAW_APPLY: '1',
+          CI: null,
+          HOME: sandbox.home,
+          PATH: '',
+        },
+      });
+
+      assert.equal(stdout.trim(), '');
+
+      const invocations = readFileSync(logPath, 'utf8')
+        .split(/\r?\n/)
+        .map((line) => line.trim())
+        .filter(Boolean);
+
+      const expectedCommandPrefix = `${process.execPath} ${join(repoRoot, 'bin.js')} setup openclaw`;
+      assert.deepEqual(invocations, [
+        `${expectedCommandPrefix} --dry-run --json`,
+        `${expectedCommandPrefix} --json`,
+      ]);
+    } finally {
+      rmSync(sandbox.root, { recursive: true, force: true });
+    }
+  });
+});

From a694270e7ae19dd9078a0bdb5063571b5ff6a90d Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 17:13:07 +0530
Subject: [PATCH 086/192] docs: add openclaw setup and autostart guidance

---
 README.frontpage.md | 44 +++++++++++++++++++++++++++++++++++++++-----
 README.md           | 44 +++++++++++++++++++++++++++++++++++++++-----
 2 files changed, 78 insertions(+), 10 deletions(-)

diff --git a/README.frontpage.md b/README.frontpage.md
index 4aed4f7..f11e4f5 100644
--- a/README.frontpage.md
+++ b/README.frontpage.md
@@ -200,13 +200,42 @@ browserforce serve
 
 Most OpenClaw users chat with their agent from Telegram or WhatsApp. BrowserForce lets your agent browse the web as you — no login flows, no captchas — even from a messaging app.
 
-**Quick setup** (copy-paste into your terminal):
+#### OpenClaw One-Time Setup
 
 ```bash
-npm install -g browserforce && browserforce install-extension && npx -y skills add ivalsaraj/browserforce
+npm install -g browserforce
+browserforce install-extension
+browserforce setup openclaw
+```
+
+Optional: install the BrowserForce skill for your OpenClaw agent:
+
+```bash
+npx -y skills add ivalsaraj/browserforce
+```
+
+#### Autostart Modes
+
+- `Default wrapper mode`: `setup openclaw` writes an OpenClaw MCP server entry that starts `browserforce serve` on-demand before `browserforce mcp`.
+- `Always-on daemon mode`: `setup openclaw` also installs OS autostart by default (launchd on macOS, systemd user service on Linux, scheduled task on Windows) so relay is already running after login.
+- `No daemon registration mode`: run `browserforce setup openclaw --no-autostart` to skip OS login service/daemon registration only; MCP wrapper autostart-on-demand still runs `browserforce serve` before `browserforce mcp`.
+
+Setup flags:
+- `--dry-run`: preview OpenClaw/autostart changes without writing files.
+- `--json`: print machine-readable setup output.
+- `--no-autostart`: skip OS login service/daemon registration only; wrapper autostart-on-demand stays enabled.
+
+Opt-in install automation environment variables:
+
+```bash
+BROWSERFORCE_SETUP_OPENCLAW=1
+BROWSERFORCE_SETUP_OPENCLAW_FORCE=1
+BROWSERFORCE_SETUP_OPENCLAW_APPLY=1
 ```
 
-Then start the relay (keep this running):
+When enabled, package install runs `setup openclaw --dry-run --json`; adding `BROWSERFORCE_SETUP_OPENCLAW_APPLY=1` also runs `setup openclaw --json`. In CI, `BROWSERFORCE_SETUP_OPENCLAW_FORCE=1` is required for the setup hook to run.
+
+Then start the relay (only needed if you want to run it manually):
 
 ```bash
 browserforce serve
@@ -236,8 +265,8 @@ Add to `~/.openclaw/openclaw.json`:
             {
               "name": "browserforce",
               "transport": "stdio",
-              "command": "npx",
-              "args": ["-y", "browserforce@latest", "mcp"]
+              "command": "sh",
+              "args": ["-lc", "if ! lsof -tiTCP:19222 -sTCP:LISTEN >/dev/null 2>&1; then npx -y browserforce@latest serve >/dev/null 2>&1 & fi; exec npx -y browserforce@latest mcp"]
             }
           ]
         }
@@ -247,6 +276,8 @@ Add to `~/.openclaw/openclaw.json`:
 }
 ```
 
+This wrapper-style entry auto-starts the relay on demand. Manual/non-wrapper alternative: use `npx -y browserforce@latest mcp` and keep `browserforce serve` running yourself.
+
 </details>
 
 <details>
@@ -360,10 +391,13 @@ browserforce -e "<code>"        # Run Playwright JavaScript (one-shot)
 browserforce plugin list        # List installed plugins
 browserforce plugin install <n> # Install a plugin from the registry
 browserforce plugin remove <n>  # Remove an installed plugin
+browserforce setup openclaw [--dry-run] [--json] [--no-autostart] # Configure OpenClaw + optional autostart
 browserforce update             # Update to the latest version
 browserforce install-extension  # Copy extension to ~/.browserforce/extension/
 ```
 
+Setup flags: `--dry-run` (preview), `--no-autostart` (skip OS login daemon/service registration only), `--json` (machine-readable output).
+
 Each `-e` command is one-shot — state does not persist between calls. For persistent state, use the MCP server.
 
 ## Plugins
diff --git a/README.md b/README.md
index 2b2ef55..85c11d5 100644
--- a/README.md
+++ b/README.md
@@ -95,13 +95,42 @@ browserforce serve
 
 Most OpenClaw users chat with their agent from Telegram or WhatsApp. BrowserForce lets your agent browse the web as you — no login flows, no captchas — even from a messaging app.
 
-**Quick setup** (copy-paste into your terminal):
+#### OpenClaw One-Time Setup
 
 ```bash
-npm install -g browserforce && browserforce install-extension && npx -y skills add ivalsaraj/browserforce
+npm install -g browserforce
+browserforce install-extension
+browserforce setup openclaw
+```
+
+Optional: install the BrowserForce skill for your OpenClaw agent:
+
+```bash
+npx -y skills add ivalsaraj/browserforce
 ```
 
-Then start the relay (keep this running):
+#### Autostart Modes
+
+- `Default wrapper mode`: `setup openclaw` writes an OpenClaw MCP server entry that starts `browserforce serve` on-demand before `browserforce mcp`.
+- `Always-on daemon mode`: `setup openclaw` also installs OS autostart by default (launchd on macOS, systemd user service on Linux, scheduled task on Windows) so relay is already running after login.
+- `No daemon registration mode`: run `browserforce setup openclaw --no-autostart` to skip OS login service/daemon registration only; MCP wrapper autostart-on-demand still runs `browserforce serve` before `browserforce mcp`.
+
+Setup flags:
+- `--dry-run`: preview OpenClaw/autostart changes without writing files.
+- `--json`: print machine-readable setup output.
+- `--no-autostart`: skip OS login service/daemon registration only; wrapper autostart-on-demand stays enabled.
+
+Opt-in install automation environment variables:
+
+```bash
+BROWSERFORCE_SETUP_OPENCLAW=1
+BROWSERFORCE_SETUP_OPENCLAW_FORCE=1
+BROWSERFORCE_SETUP_OPENCLAW_APPLY=1
+```
+
+When enabled, package install runs `setup openclaw --dry-run --json`; adding `BROWSERFORCE_SETUP_OPENCLAW_APPLY=1` also runs `setup openclaw --json`. In CI, `BROWSERFORCE_SETUP_OPENCLAW_FORCE=1` is required for the setup hook to run.
+
+Then start the relay (only needed if you want to run it manually):
 
 ```bash
 browserforce serve
@@ -131,8 +160,8 @@ Add to `~/.openclaw/openclaw.json`:
             {
               "name": "browserforce",
               "transport": "stdio",
-              "command": "npx",
-              "args": ["-y", "browserforce@latest", "mcp"]
+              "command": "sh",
+              "args": ["-lc", "if ! lsof -tiTCP:19222 -sTCP:LISTEN >/dev/null 2>&1; then npx -y browserforce@latest serve >/dev/null 2>&1 & fi; exec npx -y browserforce@latest mcp"]
             }
           ]
         }
@@ -142,6 +171,8 @@ Add to `~/.openclaw/openclaw.json`:
 }
 ```
 
+This wrapper-style entry auto-starts the relay on demand. Manual/non-wrapper alternative: use `npx -y browserforce@latest mcp` and keep `browserforce serve` running yourself.
+
 </details>
 
 <details>
@@ -316,10 +347,13 @@ browserforce -e "<code>"        # Run Playwright JavaScript (one-shot)
 browserforce plugin list        # List installed plugins
 browserforce plugin install <n> # Install a plugin from the registry
 browserforce plugin remove <n>  # Remove an installed plugin
+browserforce setup openclaw [--dry-run] [--json] [--no-autostart] # Configure OpenClaw + optional autostart
 browserforce update             # Update to the latest version
 browserforce install-extension  # Copy extension to ~/.browserforce/extension/
 ```
 
+Setup flags: `--dry-run` (preview), `--no-autostart` (skip OS login daemon/service registration only), `--json` (machine-readable output).
+
 Each `-e` command is one-shot — state does not persist between calls. For persistent state, use the MCP server.
 
 

From 985f2f6baf2da13b5a860b1ab49c92aa002f863e Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 17:16:30 +0530
Subject: [PATCH 087/192] fix(connection): prefer relay port/json-version and
 preflight extension status

---
 bin.js                 |  11 ++++-
 mcp/src/exec-engine.js | 108 +++++++++++++++++++++++++++++++++++------
 mcp/src/index.js       |   9 ++--
 3 files changed, 107 insertions(+), 21 deletions(-)

diff --git a/bin.js b/bin.js
index 3a32ced..abcaa5c 100644
--- a/bin.js
+++ b/bin.js
@@ -71,7 +71,12 @@ function httpFetch(method, url, body, authToken) {
 }
 
 async function connectBrowser() {
-  const { getCdpUrl, ensureRelay } = await import('./mcp/src/exec-engine.js');
+  const {
+    getCdpUrl,
+    ensureRelay,
+    assertExtensionConnected,
+    getRelayHttpUrlFromCdpUrl,
+  } = await import('./mcp/src/exec-engine.js');
   await ensureRelay();
   // playwright-core lives in mcp/node_modules (pnpm workspace sub-package).
   // Use createRequire from the mcp package context to locate it, then dynamic-import.
@@ -80,7 +85,9 @@ async function connectBrowser() {
   const pwPath = mReq.resolve('playwright-core');
   const { default: pw } = await import(pwPath);
   const { chromium } = pw;
-  const cdpUrl = getCdpUrl();
+  const cdpUrl = await getCdpUrl();
+  const baseUrl = getRelayHttpUrlFromCdpUrl(cdpUrl);
+  await assertExtensionConnected({ baseUrl });
   return chromium.connectOverCDP(cdpUrl);
 }
 
diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index a18c3c7..c98fe85 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -21,28 +21,106 @@ export const BF_DIR = join(homedir(), '.browserforce');
 export const CDP_URL_FILE = join(BF_DIR, 'cdp-url');
 const RELAY_SCRIPT = fileURLToPath(new URL('../../relay/src/index.js', import.meta.url));
 
-export function getCdpUrl() {
-  if (process.env.BF_CDP_URL) return process.env.BF_CDP_URL;
+function getExplicitCdpUrlOverride() {
+  const value = process.env.BF_CDP_URL;
+  if (!value) return null;
+  const trimmed = value.trim();
+  return trimmed || null;
+}
+
+function parseRelayHttpUrlFromCdpUrl(cdpUrl) {
+  try {
+    const parsed = new URL(cdpUrl);
+    if (!parsed.hostname || !parsed.port) return null;
+    return `http://${parsed.hostname}:${parsed.port}`;
+  } catch {
+    return null;
+  }
+}
+
+function readCdpUrlFromFile() {
   try {
     const url = readFileSync(CDP_URL_FILE, 'utf8').trim();
-    if (url) return url;
+    return url || null;
   } catch { /* fall through */ }
+  return null;
+}
+
+export async function getCdpUrl({ baseUrl = getRelayHttpUrl(), timeoutMs = 2000 } = {}) {
+  const explicit = getExplicitCdpUrlOverride();
+  if (explicit) return explicit;
+
+  const resolvedBaseUrl = String(baseUrl).replace(/\/+$/, '');
+  try {
+    const response = await fetch(`${resolvedBaseUrl}/json/version`, {
+      signal: AbortSignal.timeout(timeoutMs),
+    });
+    if (response.ok) {
+      const data = await response.json();
+      if (typeof data?.webSocketDebuggerUrl === 'string' && data.webSocketDebuggerUrl.trim()) {
+        return data.webSocketDebuggerUrl.trim();
+      }
+    }
+  } catch { /* fall through */ }
+
+  const legacyFileUrl = readCdpUrlFromFile();
+  if (legacyFileUrl) return legacyFileUrl;
+
   throw new Error(
     'Cannot find CDP URL. Either:\n' +
     '  1. Start the relay first: browserforce serve\n' +
-    '  2. Set BF_CDP_URL environment variable'
+    `  2. Ensure relay is reachable at ${resolvedBaseUrl}\n` +
+    '  3. Set BF_CDP_URL environment variable'
   );
 }
 
 /** Derive the relay HTTP base URL from the CDP WebSocket URL. */
 export function getRelayHttpUrl() {
-  const cdpUrl = getCdpUrl();
+  const explicit = getExplicitCdpUrlOverride();
+  if (explicit) {
+    return parseRelayHttpUrlFromCdpUrl(explicit) || `http://127.0.0.1:${getRelayPort()}`;
+  }
+  return `http://127.0.0.1:${getRelayPort()}`;
+}
+
+export function getRelayHttpUrlFromCdpUrl(cdpUrl) {
+  return parseRelayHttpUrlFromCdpUrl(cdpUrl) || getRelayHttpUrl();
+}
+
+export async function assertExtensionConnected({ baseUrl = getRelayHttpUrl(), timeoutMs = 2000 } = {}) {
+  const resolvedBaseUrl = String(baseUrl).replace(/\/+$/, '');
+  let response;
   try {
-    const parsed = new URL(cdpUrl);
-    return `http://${parsed.hostname}:${parsed.port}`;
+    response = await fetch(`${resolvedBaseUrl}/`, {
+      signal: AbortSignal.timeout(timeoutMs),
+    });
   } catch {
-    return `http://127.0.0.1:${DEFAULT_PORT}`;
+    throw new Error(
+      `Cannot reach BrowserForce relay at ${resolvedBaseUrl}. ` +
+      'Start it with `browserforce serve`.'
+    );
   }
+
+  if (!response.ok) {
+    throw new Error(
+      `Cannot reach BrowserForce relay at ${resolvedBaseUrl} (HTTP ${response.status}).`
+    );
+  }
+
+  let status;
+  try {
+    status = await response.json();
+  } catch {
+    throw new Error(`Relay at ${resolvedBaseUrl} returned invalid status JSON.`);
+  }
+
+  if (status?.extension !== true) {
+    throw new Error(
+      `BrowserForce extension is not connected to relay at ${resolvedBaseUrl}.`
+    );
+  }
+
+  return status;
 }
 
 export function isCdpBusyError(err) {
@@ -111,14 +189,10 @@ export async function connectOverCdpWithBusyRetry({
 // ─── Auto-start relay ───────────────────────────────────────────────────────
 
 function getRelayPort() {
-  if (process.env.RELAY_PORT) return parseInt(process.env.RELAY_PORT, 10);
-  try {
-    const url = readFileSync(CDP_URL_FILE, 'utf8').trim();
-    if (url) {
-      const port = new URL(url).port;
-      if (port) return parseInt(port, 10);
-    }
-  } catch { /* fall through */ }
+  if (process.env.RELAY_PORT) {
+    const parsed = parseInt(process.env.RELAY_PORT, 10);
+    if (Number.isFinite(parsed) && parsed > 0) return parsed;
+  }
   return DEFAULT_PORT;
 }
 
@@ -136,6 +210,8 @@ async function isRelayRunning(port) {
  * background process and wait for it to become reachable.
  */
 export async function ensureRelay() {
+  if (getExplicitCdpUrlOverride()) return;
+
   const port = getRelayPort();
   if (await isRelayRunning(port)) return;
 
diff --git a/mcp/src/index.js b/mcp/src/index.js
index fe5ad80..4a184d3 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -7,7 +7,8 @@ import { StdioServerTransport } from '@modelcontextprotocol/sdk/server/stdio.js'
 import { z } from 'zod';
 import { chromium } from 'playwright-core';
 import {
-  getCdpUrl, getRelayHttpUrl, ensureRelay, connectOverCdpWithBusyRetry,
+  getCdpUrl, getRelayHttpUrl, getRelayHttpUrlFromCdpUrl, assertExtensionConnected,
+  ensureRelay, connectOverCdpWithBusyRetry,
   CodeExecutionTimeoutError, buildExecContext, runCode, formatResult,
 } from './exec-engine.js';
 import { loadPlugins, buildPluginHelpers, buildPluginSkillAppendix } from './plugin-loader.js';
@@ -98,11 +99,13 @@ async function ensureBrowser() {
 
   browserConnectPromise = (async () => {
     await ensureRelay();
-    const cdpUrl = withClientLabel(getCdpUrl());
+    const cdpUrl = withClientLabel(await getCdpUrl());
+    const baseUrl = getRelayHttpUrlFromCdpUrl(cdpUrl);
+    await assertExtensionConnected({ baseUrl });
     const nextBrowser = await connectOverCdpWithBusyRetry({
       connect: (url) => chromium.connectOverCDP(url),
       cdpUrl,
-      baseUrl: getRelayHttpUrl(),
+      baseUrl,
       timeoutMs: CONNECT_RETRY_TIMEOUT_MS,
     });
     browser = nextBrowser;

From 118a09ae93dc7f6ff89d6de9296ec1314738c78b Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 17:16:34 +0530
Subject: [PATCH 088/192] test(exec-engine): cover relay-port default
 resolution and extension preflight

---
 mcp/test/exec-engine-plugins.test.js | 95 +++++++++++++++++++++++++++-
 1 file changed, 94 insertions(+), 1 deletion(-)

diff --git a/mcp/test/exec-engine-plugins.test.js b/mcp/test/exec-engine-plugins.test.js
index 0a22cab..71c3659 100644
--- a/mcp/test/exec-engine-plugins.test.js
+++ b/mcp/test/exec-engine-plugins.test.js
@@ -1,6 +1,13 @@
 import { test } from 'node:test';
 import assert from 'node:assert/strict';
-import { buildExecContext, runCode, formatResult } from '../src/exec-engine.js';
+import {
+  buildExecContext,
+  runCode,
+  formatResult,
+  getRelayHttpUrl,
+  getCdpUrl,
+  assertExtensionConnected,
+} from '../src/exec-engine.js';
 
 const mockPage = { isClosed: () => false, url: () => 'about:blank', title: async () => 'Test' };
 const mockCtx = { pages: () => [mockPage] };
@@ -226,3 +233,89 @@ test('pageMarkdown search resets regex state for g/y regex flags', async () => {
   assert.ok(result.includes('target on only line'));
   assert.ok(!result.includes('No matches found'));
 });
+
+test('getRelayHttpUrl defaults to localhost:19222', () => {
+  const originalPort = process.env.RELAY_PORT;
+  const originalCdp = process.env.BF_CDP_URL;
+  try {
+    delete process.env.RELAY_PORT;
+    delete process.env.BF_CDP_URL;
+    assert.equal(getRelayHttpUrl(), 'http://127.0.0.1:19222');
+  } finally {
+    if (originalPort === undefined) delete process.env.RELAY_PORT;
+    else process.env.RELAY_PORT = originalPort;
+    if (originalCdp === undefined) delete process.env.BF_CDP_URL;
+    else process.env.BF_CDP_URL = originalCdp;
+  }
+});
+
+test('getRelayHttpUrl respects BF_CDP_URL override host/port', () => {
+  const originalCdp = process.env.BF_CDP_URL;
+  try {
+    process.env.BF_CDP_URL = 'ws://127.0.0.1:19457/cdp?token=test-token';
+    assert.equal(getRelayHttpUrl(), 'http://127.0.0.1:19457');
+  } finally {
+    if (originalCdp === undefined) delete process.env.BF_CDP_URL;
+    else process.env.BF_CDP_URL = originalCdp;
+  }
+});
+
+test('getCdpUrl resolves from /json/version when BF_CDP_URL is not set', async () => {
+  const originalCdp = process.env.BF_CDP_URL;
+  const originalPort = process.env.RELAY_PORT;
+  const originalFetch = globalThis.fetch;
+
+  try {
+    delete process.env.BF_CDP_URL;
+    process.env.RELAY_PORT = '19222';
+    globalThis.fetch = async (url) => {
+      assert.equal(url, 'http://127.0.0.1:19222/json/version');
+      return {
+        ok: true,
+        json: async () => ({
+          webSocketDebuggerUrl: 'ws://127.0.0.1:19222/cdp?token=from-json-version',
+        }),
+      };
+    };
+
+    const cdpUrl = await getCdpUrl();
+    assert.equal(cdpUrl, 'ws://127.0.0.1:19222/cdp?token=from-json-version');
+  } finally {
+    if (originalCdp === undefined) delete process.env.BF_CDP_URL;
+    else process.env.BF_CDP_URL = originalCdp;
+    if (originalPort === undefined) delete process.env.RELAY_PORT;
+    else process.env.RELAY_PORT = originalPort;
+    globalThis.fetch = originalFetch;
+  }
+});
+
+test('assertExtensionConnected throws a clear error when extension is disconnected', async () => {
+  const originalFetch = globalThis.fetch;
+  try {
+    globalThis.fetch = async () => ({
+      ok: true,
+      json: async () => ({ status: 'ok', extension: false }),
+    });
+    await assert.rejects(
+      () => assertExtensionConnected({ baseUrl: 'http://127.0.0.1:19222' }),
+      /extension is not connected/i
+    );
+  } finally {
+    globalThis.fetch = originalFetch;
+  }
+});
+
+test('assertExtensionConnected succeeds when extension is connected', async () => {
+  const originalFetch = globalThis.fetch;
+  try {
+    globalThis.fetch = async () => ({
+      ok: true,
+      json: async () => ({ status: 'ok', extension: true }),
+    });
+    await assert.doesNotReject(
+      () => assertExtensionConnected({ baseUrl: 'http://127.0.0.1:19222' })
+    );
+  } finally {
+    globalThis.fetch = originalFetch;
+  }
+});

From 5adb4c2a1b65ee9369c38347b74ca4c04989b6b4 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 17:16:39 +0530
Subject: [PATCH 089/192] docs(readme): add simple port-conflict and
 port-switch guide

---
 README.md | 66 ++++++++++++++++++++++++++++++++++++++++++++++++++-----
 1 file changed, 61 insertions(+), 5 deletions(-)

diff --git a/README.md b/README.md
index 2b2ef55..92a6754 100644
--- a/README.md
+++ b/README.md
@@ -764,24 +764,80 @@ Everything runs on your machine. The auth token is stored at `~/.browserforce/au
 
 ## Configuration
 
-**Custom relay port:**
+### Port conflicts and switching ports
+
+Default port is `19222`.
+
+If that port is already in use, you can see:
+- relay start failure (`EADDRINUSE` / port in use)
+- extension stays disconnected (gray)
+- MCP errors like `Extension not connected`
+
+Switch all components to the same new port (example: `19333`):
+
+1. Start relay on the new port:
 
 ```bash
 RELAY_PORT=19333 browserforce serve
 ```
 
-**Extension relay URL:** Click the extension icon → change the URL → Save. Default: `ws://127.0.0.1:19222/extension`
+2. In extension popup, set relay URL to:
+
+```text
+ws://127.0.0.1:19333/extension
+```
 
-**Override CDP URL for MCP:**
+3. Start MCP on the same port:
+
+**Cursor** (`~/.cursor/mcp.json`)
+```json
+{
+  "mcpServers": {
+    "browserforce": {
+      "command": "env",
+      "args": ["RELAY_PORT=19333", "npx", "-y", "browserforce@latest", "mcp"]
+    }
+  }
+}
+```
+
+**Claude Code** (`~/.claude/mcp.json`)
+```json
+{
+  "mcpServers": {
+    "browserforce": {
+      "command": "env",
+      "args": ["RELAY_PORT=19333", "npx", "-y", "browserforce@latest", "mcp"]
+    }
+  }
+}
+```
 
+**OpenClaw** (`~/.openclaw/openclaw.json`)
 ```json
 {
-  "env": {
-    "BF_CDP_URL": "ws://127.0.0.1:19333/cdp?token=your-token"
+  "plugins": {
+    "entries": {
+      "mcp-adapter": {
+        "enabled": true,
+        "config": {
+          "servers": [
+            {
+              "name": "browserforce",
+              "transport": "stdio",
+              "command": "env",
+              "args": ["RELAY_PORT=19333", "npx", "-y", "browserforce@latest", "mcp"]
+            }
+          ]
+        }
+      }
+    }
   }
 }
 ```
 
+Fallback only (if you cannot pass `RELAY_PORT` in MCP config): use `BF_CDP_URL` with the exact relay port/token.
+
 **Client arbitration mode (`BF_CLIENT_MODE`):**
 
 ```bash

From 1cf026537c80228569243d0f32c42a44526eeb5a Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 17:19:00 +0530
Subject: [PATCH 090/192] mcp: trim execute prompt and refine reset guidance

---
 mcp/src/index.js | 308 +++++++++++------------------------------------
 1 file changed, 72 insertions(+), 236 deletions(-)

diff --git a/mcp/src/index.js b/mcp/src/index.js
index fe5ad80..353b110 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -7,7 +7,8 @@ import { StdioServerTransport } from '@modelcontextprotocol/sdk/server/stdio.js'
 import { z } from 'zod';
 import { chromium } from 'playwright-core';
 import {
-  getCdpUrl, getRelayHttpUrl, ensureRelay, connectOverCdpWithBusyRetry,
+  getCdpUrl, getRelayHttpUrl, getRelayHttpUrlFromCdpUrl, assertExtensionConnected,
+  ensureRelay, connectOverCdpWithBusyRetry,
   CodeExecutionTimeoutError, buildExecContext, runCode, formatResult,
 } from './exec-engine.js';
 import { loadPlugins, buildPluginHelpers, buildPluginSkillAppendix } from './plugin-loader.js';
@@ -98,11 +99,13 @@ async function ensureBrowser() {
 
   browserConnectPromise = (async () => {
     await ensureRelay();
-    const cdpUrl = withClientLabel(getCdpUrl());
+    const cdpUrl = withClientLabel(await getCdpUrl());
+    const baseUrl = getRelayHttpUrlFromCdpUrl(cdpUrl);
+    await assertExtensionConnected({ baseUrl });
     const nextBrowser = await connectOverCdpWithBusyRetry({
       connect: (url) => chromium.connectOverCDP(url),
       cdpUrl,
-      baseUrl: getRelayHttpUrl(),
+      baseUrl,
       timeoutMs: CONNECT_RETRY_TIMEOUT_MS,
     });
     browser = nextBrowser;
@@ -272,155 +275,67 @@ Globals: fetch, URL, URLSearchParams, Buffer, setTimeout, clearTimeout, TextEnco
 
 IMPORTANT: Do NOT navigate the user's existing tabs. Always create or reuse a dedicated tab.
 
-On your first call, initialize state.page:
-  // Reuse an about:blank tab if one exists, otherwise create a new one
+On your first call:
   state.page = context.pages().find(p => p.url() === 'about:blank') || await context.newPage();
   await state.page.goto('https://example.com');
   await waitForPageLoad();
   return await snapshot();
 
-After setup, use state.page for ALL subsequent operations — not the default page variable.
-If state.page was closed or navigated away, recreate it:
+After setup, use state.page for all subsequent operations.
+If state.page was closed:
   if (!state.page || state.page.isClosed()) {
-    state.page = await context.newPage();
+    state.page = context.pages().find(p => p.url() === 'about:blank') || await context.newPage();
   }
 
-═══ WORKFLOW — OBSERVE → ACT → OBSERVE ═══
+═══ CORE LOOP — OBSERVE → ACT → OBSERVE ═══
 
-After every action, verify its result before proceeding:
+After every action, verify the result before proceeding.
+Each execute call should usually do one meaningful action and return verification.
+Exception: read-only bulk extraction can do multi-step execution when actions are independent.
 
-1. OBSERVE: snapshot() to understand current page state
-2. ACT: Perform ONE action (click, type, navigate, etc.)
-3. OBSERVE: snapshot() again to verify the action worked
+Recommended cycle:
+  1) OBSERVE: console.log('URL:', state.page.url()); return await snapshot();
+  2) ACT: one action (click, type, navigate, submit)
+  3) OBSERVE: snapshot() again; verify the expected change happened
 
-Never chain multiple actions blindly. If you click a button, verify it worked before clicking the next.
-Each execute call should do ONE meaningful action and return verification.
-Exception: Multi-step is allowed for read-only bulk extraction when actions are independent and no user-tab mutation occurs.
+If nothing changed, wait for load and observe again before retrying.
 
-When navigating:
-  await state.page.goto(url);
-  await waitForPageLoad();
-  return await snapshot();
-
-When clicking:
-  await state.page.locator('role=button[name="Submit"]').click();
-  await waitForPageLoad();
-  return await snapshot();
-
-When filling forms:
-  await state.page.locator('role=textbox[name="Email"]').fill('user@example.com');
-  return await snapshot();
-
-═══ SNAPSHOT FIRST ═══
-
-ALWAYS prefer snapshot() over screenshot():
-- snapshot() returns a text accessibility tree — fast, cheap, searchable
-- screenshot() returns a PNG image — expensive, requires vision processing
-
-Use snapshot() for:
-  ✓ Reading page content and text
-  ✓ Finding interactive elements (buttons, links, inputs)
-  ✓ Verifying actions succeeded
-  ✓ Checking if a page loaded correctly
-
-Use screenshot() ONLY for:
-  ✓ Visual layout verification (grids, alignment, spacing)
-  ✓ Seeing images, charts, or visual content
-  ✓ Debugging when snapshot doesn't show the issue
-
-Targeted snapshots: snapshot({ search: /pattern/i }) filters the tree.
-Scoped snapshots: snapshot({ selector: '#main' }) limits to a subtree.
-
-═══ PAGE MANAGEMENT ═══
+═══ INTERACTION RULES ═══
 
-Listing tabs:       const pages = context.pages();
-Creating a tab:     const p = await context.newPage();
-Navigating:         await state.page.goto(url);
-Current URL:        state.page.url()
-Page title:         await state.page.title()
-
-context.pages() returns ALL open tabs. Index 0 is usually the user's original tab.
-Store your working page in state.page to avoid losing track of it.
-
-For multi-tab workflows:
-  const pages = context.pages();
-  // Find a specific tab by URL
-  const gmail = pages.find(p => p.url().includes('mail.google'));
-
-═══ INTERACTING WITH ELEMENTS ═══
-
-Use Playwright locators with accessibility roles (from snapshot output):
-  await state.page.locator('role=button[name="Sign in"]').click();
-  await state.page.locator('role=textbox[name="Search"]').fill('query');
-  await state.page.locator('role=link[name="Settings"]').click();
+Selector priority:
+  1) Use fresh [ref=...] locators from snapshot output
+  2) Use role/name locators from snapshot
+  3) Use stable test IDs (data-testid)
+  4) Avoid brittle nth()/deep CSS selectors unless no stable option exists
 
-If snapshot shows [ref=e3], resolve it with refToLocator({ ref }) before acting:
+If snapshot shows [ref=e3]:
   const locator = refToLocator({ ref: 'e3' });
   if (locator) await state.page.locator(locator).click();
 
-For text content:
-  const text = await state.page.locator('role=heading').textContent();
+Before interacting, dismiss blockers:
+  await snapshot({ search: /cookie|consent|accept|reject|allow|age|verify|login|sign.in/i });
 
-Selector priority:
-  1. Use [ref=...] locators from snapshot output immediately after observing
-  2. Use role/name locators from snapshot
-  3. Use stable test IDs (data-testid) if present
-  4. Avoid brittle nth()/deep CSS selectors unless no stable option exists
-
-Before interacting, handle page blockers (cookie/consent banners, age gates, login popups):
-  const blockers = await snapshot({ search: /cookie|consent|accept|reject|allow|age|verify|login|sign.in/i });
-  // Dismiss blockers first, then continue with the main task
-
-Avoid stale locator usage:
-  // BAD: using a stale locator from an old snapshot after DOM changes
-  // GOOD: refresh observation first, then act with new refs/locators
-  await snapshot();
-
-Typing text with newlines:
-  // Use fill() for multiline blocks to avoid accidental Enter key submissions
+For multiline text, prefer fill() with \\n:
   await state.page.locator('role=textbox[name="Message"]').fill('Line 1\\nLine 2');
 
-═══ TACTICAL ANTI-PATTERNS ═══
-
-Popup control:
-  ✗ Don’t click through a popup without confirming what changed
-  ✓ Dismiss popup, then run snapshot() immediately to confirm main UI is usable
+═══ SNAPSHOT VS SCREENSHOT ═══
 
-Consent blockers:
-  ✗ Don’t continue form/page actions while consent banners block focus
-  ✓ Handle cookie/consent overlays first, then retry the intended action
+Prefer snapshot() for text/content/verification.
+Use screenshotWithAccessibilityLabels() only when visual layout or spatial relationships matter.
 
-Stale locators:
-  ✗ Don’t reuse [ref=...] values after DOM/nav updates
-  ✓ Refresh snapshot() and use the newest refs/role locators
+Use cleanHTML/pageMarkdown for extraction:
+  - snapshot() for interactive structure and refs
+  - cleanHTML() for structured DOM extraction/parsing
+  - pageMarkdown() for article-like content
 
-Newline typing:
-  ✗ Don’t use keyboard Enter loops for multiline textareas unless explicitly needed
-  ✓ Prefer locator.fill('line1\\nline2') for deterministic multiline input
+═══ PARALLEL TAB EXTRACTION ═══
 
-Raw CDP sessions:
-  ✗ Don’t call page.context().newCDPSession(page) directly
-  ✓ Use getCDPSession({ page }) for relay-safe CDP session creation
+Read browserforceSettings.executionMode before choosing strategy.
+For independent read-only extraction tasks, start with parallel tabs (cap concurrency, usually 3-8).
+On 429/challenges/timeouts: retry with lower concurrency, then sequential if needed.
+If visibility mode requires showing work (for example, rotating/foreground demos), bringing your own working tab to front is allowed.
 
-═══ EXTRACTION DECISION TREE ═══
-
-snapshot vs cleanHTML vs pageMarkdown:
-  1) Use snapshot() when you need current interactive structure, labels, and refs.
-  2) Use cleanHTML(selector?) when you need structured DOM content for parsing/extraction.
-  3) Use pageMarkdown() for article/blog/news pages where nav/ads should be removed.
-  4) Use screenshotWithAccessibilityLabels() only when layout/visual evidence is required.
-
-═══ BROWSERFORCE TAB SWARMS // PARALLEL TABS PROCESSING ═══
-
-Parallel-first policy for independent extraction:
-  Read browserforceSettings.executionMode before choosing swarm strategy. Settings are session defaults.
-  1) For count/list/extraction across independent pages, dates, or items, start with parallel tabs first.
-  2) Use Promise.all with a concurrency cap (typically 3-8; start at 5 unless site limits are known).
-  3) Keep swarm runs read-only and isolated to agent-created tabs (no checkout/purchase/send/delete/profile changes).
-  4) If you hit 429, anti-bot challenges, or repeated timeouts, automatically retry with reduced concurrency.
-  5) If reduced concurrency still fails, retry sequentially.
-
-Always return telemetry for swarm runs:
+Return telemetry for swarm runs:
   {
     peakConcurrentTasks,
     wallClockMs,
@@ -429,122 +344,43 @@ Always return telemetry for swarm runs:
     retries
   }
 
-═══ DEBUGGING WORKFLOW ═══
-
-Combine snapshot + logs:
-  1) snapshot({ search: /target text|button|error/i }) to verify element presence and naming
-  2) getLogs({ count: 30 }) for runtime/network/console errors
-  3) page.evaluate(() => { ...visibility checks... }) to validate hidden/disabled/overlay states
-
-Example visibility check:
-  return await state.page.evaluate(() => {
-    const el = document.querySelector('[data-testid="submit"]');
-    if (!el) return { found: false };
-    const s = getComputedStyle(el);
-    const r = el.getBoundingClientRect();
-    return { found: true, visible: s.display !== 'none' && s.visibility !== 'hidden' && r.width > 0 && r.height > 0 };
-  });
-
-═══ ADVANCED PATTERNS ═══
-
-Authenticated fetch:
-  // Reuse browser session cookies/headers from the current page context
-  return await state.page.evaluate(async () => {
-    const res = await fetch('/api/me', { credentials: 'include' });
-    return { status: res.status, body: await res.text() };
-  });
-
-Network interception:
-  await state.page.route('**/api/**', async (route) => {
-    const request = route.request();
-    // Inspect/modify request here if needed before continuing
-    await route.continue();
-  });
-
-Downloads:
-  // Use expect_download pattern and save path after click/navigation trigger
-  const [download] = await Promise.all([
-    state.page.waitForEvent('download'),
-    state.page.locator('role=button[name="Export CSV"]').click(),
-  ]);
-  return { suggestedFilename: download.suggestedFilename() };
-
-═══ COMMON PATTERNS ═══
-
-Navigate and read:
-  await state.page.goto('https://example.com');
-  await waitForPageLoad();
-  return await snapshot();
-
-Click and verify:
-  await state.page.locator('role=button[name="Next"]').click();
-  await waitForPageLoad();
-  return await snapshot();
+═══ DEBUGGING QUICK LOOP ═══
 
-Fill form and submit:
-  await state.page.locator('role=textbox[name="Username"]').fill('user');
-  await state.page.locator('role=textbox[name="Password"]').fill('pass');
-  await state.page.locator('role=button[name="Login"]').click();
-  await waitForPageLoad();
-  return await snapshot();
+1) snapshot({ search: /button|dialog|error|target/i })
+2) getLogs({ count: 30 })
+3) state.page.evaluate(...) for visibility/disabled/overlay checks
 
-Extract data:
-  return await state.page.evaluate(() => {
-    return document.querySelector('.price').textContent;
-  });
-
-Wait for specific element:
-  await state.page.locator('role=heading[name="Dashboard"]').waitFor();
-  return await snapshot();
+For JS-heavy or authenticated sites, stay in browser automation.
+Do not switch to raw HTTP/curl expecting fully rendered DOM state.
 
-Debug with console logs:
-  return getLogs({ count: 20 });
+═══ HARD RULES ═══
 
-When you need the full tree instead of diff output:
-  return await snapshot({ showDiffSinceLastCall: false });
-
-═══ ANTI-PATTERNS ═══
-
-✗ Don't navigate the user's existing tabs — create your own via context.newPage()
-✗ Don't screenshot() to read text — use snapshot()
-✗ Don't chain actions without verifying — observe after each action
-✗ Don't use page.waitForTimeout() — use waitForPageLoad() or waitFor()
-✗ Don't forget to return a value — every call should return verification
-✗ Don't write complex multi-step scripts by default — split into separate execute calls
-✓ Exception: Multi-step is allowed for read-only bulk extraction when actions are independent and no user-tab mutation occurs
-✗ Don't use page variable directly — use state.page after first call setup
+✗ Don't navigate the user's existing tabs
+✗ Don't screenshot to read text; use snapshot
+✗ Don't chain actions blindly without verification
+✗ Don't use page.waitForTimeout() when a deterministic wait is available
+✗ Don't use stale refs after DOM/navigation updates
+✗ Don't call page.context().newCDPSession(page); use getCDPSession({ page })
+✗ Don't call browser.close() or context.close()
+✗ Don't call page.bringToFront() by default; only use it when user asks or when visibility mode needs visible tab progression
+✗ Don't use the default page variable for ongoing work after setup; use state.page
 
 ═══ ERROR RECOVERY ═══
 
-If page closed:      state.page = await context.newPage();
-If navigation fails: Check state.page.url() to see where you actually are
-If element missing:   Use snapshot({ search: /element/ }) to find it
-If connection lost:   Call the reset tool, then re-initialize state.page
-If timeout:          Increase timeout param, or break into smaller steps
-
-═══ API REFERENCE ═══
-
-snapshot(options?)
-  options.selector  CSS selector to scope the snapshot (e.g., '#main', '.sidebar')
-  options.search    Regex string to filter tree nodes (e.g., 'button|link')
-  options.showDiffSinceLastCall  When true (default), returns a smart diff from previous snapshot when unchanged scope+search is not used
-  Returns: Text accessibility tree with interactive element refs
-
-waitForPageLoad(options?)
-  options.timeout   Max wait in ms (default: 30000)
-  Returns: { success, readyState, pendingRequests, waitTimeMs, timedOut }
-  Filters analytics/ad requests that never finish. Polls document.readyState.
-
-getLogs(options?)
-  options.count     Number of recent entries (default: all)
-  Returns: Array of "[type] message" strings from browser console
+If page closed:      recreate state.page with context.newPage() (or reuse about:blank)
+If navigation fails: check current URL, then snapshot() to re-ground state
+If element missing:  use snapshot({ search: /.../ }) with tighter patterns
+If connection lost:  call reset, then reinitialize state.page
+If timeout:          increase timeout or break work into smaller execute calls
+If Chrome/extension unavailable: ask user to open Chrome, keep at least one normal web tab open, and ensure BrowserForce extension is connected
 
-clearLogs()
-  Clears captured console logs for current page.
+═══ API QUICK REFERENCE ═══
 
-state
-  Persistent object — survives across execute calls. Cleared on reset.
-  Use state.page, state.data, state.anything to preserve working state.`;
+snapshot(options?) -> text accessibility tree with interactive refs
+waitForPageLoad(options?) -> { success, readyState, pendingRequests, waitTimeMs, timedOut }
+getLogs(options?) -> browser console log entries
+clearLogs() -> clears captured logs for current page
+state -> persistent across execute calls; cleared on reset`;
 
 function registerExecuteTool(skillAppendix = '') {
   server.tool(
@@ -578,7 +414,7 @@ function registerExecuteTool(skillAppendix = '') {
         return { content };
       } catch (err) {
         const isTimeout = err instanceof CodeExecutionTimeoutError;
-        const hint = isTimeout ? '' : '\n\n[If connection lost, call reset tool to reconnect]';
+        const hint = isTimeout ? '' : '\n\n[HINT: Call reset only for connection/internal failures (relay disconnect, page/context closed, Playwright internal/assertion issues). For normal selector/logic errors, fix and retry without reset.]';
         return {
           content: [{ type: 'text', text: `Error: ${err.message}${hint}` }],
           isError: true,
@@ -590,7 +426,7 @@ function registerExecuteTool(skillAppendix = '') {
 
 server.tool(
   'reset',
-  'Reconnects to the relay, reinitializes the browser context, and clears persistent state. Use when: connection lost, pages closed unexpectedly, or state is corrupt.',
+  'Reconnects CDP and reinitializes browser/page bindings. Use when MCP stops responding, connection errors occur, pages/context were closed, or state is inconsistent. Reset clears persistent state; reinitialize state.page after calling it.',
   {},
   async () => {
     if (browser) {

From b8afe45030b66382cdd07da6a7c189fcaa284168 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 23:32:55 +0530
Subject: [PATCH 091/192] fix(mcp): restore missing exec-engine exports for
 startup

---
 mcp/test/mcp-tools.test.js | 12 ++++++++++++
 1 file changed, 12 insertions(+)

diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 9408a87..205b502 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -56,6 +56,18 @@ describe('Tool Definitions', () => {
     assert.equal(result, '');
   });
 
+  it('index imports only exports available from exec-engine', async () => {
+    const { execSync } = await import('node:child_process');
+    const result = execSync(
+      'node --input-type=module -e "import { getCdpUrl, getRelayHttpUrl, getRelayHttpUrlFromCdpUrl, assertExtensionConnected, ensureRelay, connectOverCdpWithBusyRetry, CodeExecutionTimeoutError, buildExecContext, runCode, formatResult } from \'./src/exec-engine.js\'; console.log(typeof getRelayHttpUrlFromCdpUrl, typeof assertExtensionConnected);"',
+      {
+        cwd: join(import.meta.url.replace('file://', ''), '../../'),
+        encoding: 'utf8',
+      }
+    ).trim();
+    assert.equal(result, 'function function');
+  });
+
   it('registers exactly 2 tools: execute, reset', () => {
     const source = readFileSync(
       join(import.meta.url.replace('file://', ''), '../../src/index.js'),

From d123a08b3acee12f5a6befaae99461d49ee9cfc8 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 26 Feb 2026 23:51:26 +0530
Subject: [PATCH 092/192] chore: bump version to 1.0.16 in package.json

---
 package.json | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/package.json b/package.json
index 690904b..1dafb53 100644
--- a/package.json
+++ b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "browserforce",
-  "version": "1.0.15",
+  "version": "1.0.16",
   "type": "module",
   "description": "Give AI agents your real Chrome browser with progressive examples: simple reads, form interactions, multi-tab workflows, and state persistence. Search X and GitHub, extract ProductHunt data, test forms, compare A/B variants, monitor status pages. Works with OpenClaw, Claude, and any MCP agent.",
   "homepage": "https://github.com/ivalsaraj/browserforce",

From 0fdf7d1fbc96bff9fab83199285adec7f5872ce7 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Fri, 27 Feb 2026 09:53:09 +0530
Subject: [PATCH 093/192] feat(mcp): introduce browserforceRestrictions to exec
 context and session management

- Added runtimeRestrictions parameter to buildExecContext for managing browserforce settings.
- Implemented normalization and caching for browserforceRestrictions in session management.
- Updated documentation to include new restrictions and their implications for execution strategies.
- Enhanced tests to validate the inclusion and behavior of browserforceRestrictions in the execution context.
---
 mcp/src/exec-engine.js     |   9 ++++
 mcp/src/index.js           | 108 +++++++++++++++++++++++++++++++++----
 mcp/test/mcp-tools.test.js |  61 +++++++++++++++++++++
 3 files changed, 167 insertions(+), 11 deletions(-)

diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index c98fe85..2169faa 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -553,6 +553,7 @@ export function buildExecContext(
   consoleHelpers = {},
   pluginHelpers = {},
   agentPreferences = {},
+  runtimeRestrictions = {},
 ) {
   const { consoleLogs, setupConsoleCapture } = consoleHelpers;
   const lastSnapshots = userState.__lastSnapshots || (userState.__lastSnapshots = new WeakMap());
@@ -660,6 +661,13 @@ export function buildExecContext(
     executionMode: agentPreferences?.executionMode === 'sequential' ? 'sequential' : 'parallel',
     parallelVisibilityMode: 'foreground-tab',
   };
+  const browserforceRestrictions = {
+    mode: runtimeRestrictions?.mode === 'manual' ? 'manual' : 'auto',
+    lockUrl: !!runtimeRestrictions?.lockUrl,
+    noNewTabs: !!runtimeRestrictions?.noNewTabs,
+    readOnly: !!runtimeRestrictions?.readOnly,
+    instructions: typeof runtimeRestrictions?.instructions === 'string' ? runtimeRestrictions.instructions : '',
+  };
 
   // Wrap plugin helpers to auto-inject (page, ctx, state) as first three args
   const wrappedPluginHelpers = {};
@@ -674,6 +682,7 @@ export function buildExecContext(
   return {
     ...wrappedPluginHelpers,           // plugin helpers spread first — built-ins always win
     browserforceSettings,
+    browserforceRestrictions,
     page: defaultPage, context: ctx, state: userState,
     snapshot, refToLocator, waitForPageLoad, getLogs, clearLogs, getCDPSession,
     screenshotWithAccessibilityLabels, cleanHTML, pageMarkdown,
diff --git a/mcp/src/index.js b/mcp/src/index.js
index 353b110..e3e73c6 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -188,7 +188,15 @@ const DEFAULT_AGENT_PREFERENCES = Object.freeze({
   executionMode: 'parallel',
   parallelVisibilityMode: 'foreground-tab',
 });
+const DEFAULT_BROWSERFORCE_RESTRICTIONS = Object.freeze({
+  mode: 'auto',
+  lockUrl: false,
+  noNewTabs: false,
+  readOnly: false,
+  instructions: '',
+});
 let cachedAgentPreferences = null;
+let cachedBrowserforceRestrictions = null;
 
 function normalizeAgentPreferences(raw) {
   const executionMode = raw?.executionMode === 'sequential' ? 'sequential' : 'parallel';
@@ -218,6 +226,37 @@ async function getAgentPreferencesForSession() {
   }
 }
 
+function normalizeRestrictions(raw) {
+  return {
+    mode: raw?.mode === 'manual' ? 'manual' : 'auto',
+    lockUrl: !!raw?.lockUrl,
+    noNewTabs: !!raw?.noNewTabs,
+    readOnly: !!raw?.readOnly,
+    instructions: typeof raw?.instructions === 'string' ? raw.instructions : '',
+  };
+}
+
+async function getBrowserforceRestrictionsForSession() {
+  if (cachedBrowserforceRestrictions) {
+    return cachedBrowserforceRestrictions;
+  }
+
+  try {
+    const response = await fetch(`${getRelayHttpUrl()}/restrictions`, {
+      signal: AbortSignal.timeout(2000),
+    });
+    if (!response.ok) {
+      throw new Error(`HTTP ${response.status}`);
+    }
+    const raw = await response.json();
+    cachedBrowserforceRestrictions = normalizeRestrictions(raw);
+    return cachedBrowserforceRestrictions;
+  } catch {
+    cachedBrowserforceRestrictions = { ...DEFAULT_BROWSERFORCE_RESTRICTIONS };
+    return cachedBrowserforceRestrictions;
+  }
+}
+
 // ─── Plugin State ────────────────────────────────────────────────────────────
 
 let plugins = [];
@@ -249,6 +288,8 @@ Variables:
   state       Persistent object across calls (cleared on reset). Store your working page here.
   browserforceSettings Session defaults loaded once per MCP session (refresh on reset).
                       Keys: executionMode, parallelVisibilityMode.
+  browserforceRestrictions Session restrictions from extension/relay.
+                      Keys: mode, lockUrl, noNewTabs, readOnly, instructions.
 
 Helpers:
   snapshot({ selector?, search?, showDiffSinceLastCall? })   Accessibility tree as text. 10-100x cheaper than screenshots.
@@ -287,11 +328,36 @@ If state.page was closed:
     state.page = context.pages().find(p => p.url() === 'about:blank') || await context.newPage();
   }
 
+═══ URL DISCOVERY (NO GUESSING) ═══
+
+Do NOT guess deep links when the site already exposes navigation links.
+When discovering a section/page:
+  1) Snapshot first and inspect visible refs.
+  2) Prefer clicking discovered links/buttons or reading hrefs from those elements.
+  3) Only construct a URL manually if there is no discoverable navigation path.
+  4) If a guessed URL fails (404/wrong content), back up and derive it from on-page links.
+
+Example href discovery:
+  const hrefs = await state.page.evaluate(() =>
+    Array.from(document.querySelectorAll('a')).map(a => ({ text: a.textContent?.trim(), href: a.getAttribute('href') }))
+  );
+
+═══ SETTINGS & STRATEGY PRECHECK ═══
+
+Read browserforceSettings + browserforceRestrictions before planning execution.
+- executionMode=sequential: do one task at a time; do not run tab swarms.
+- executionMode=parallel: parallelize only independent read-only tasks.
+- parallelVisibilityMode=foreground-tab: new tabs are visible in the current window; avoid disruptive tab choreography.
+- mode=manual or noNewTabs=true: do not create tabs, only operate on user-attached tabs.
+- lockUrl=true: do not navigate away from current URL (reload is allowed).
+- readOnly=true: no click/type/submit actions; observe with snapshot/screenshot/evaluate only.
+- instructions: treat as mandatory policy text for this session.
+
 ═══ CORE LOOP — OBSERVE → ACT → OBSERVE ═══
 
 After every action, verify the result before proceeding.
 Each execute call should usually do one meaningful action and return verification.
-Exception: read-only bulk extraction can do multi-step execution when actions are independent.
+Multi-step is allowed for read-only bulk extraction when actions are independent.
 
 Recommended cycle:
   1) OBSERVE: console.log('URL:', state.page.url()); return await snapshot();
@@ -315,23 +381,38 @@ If snapshot shows [ref=e3]:
 Before interacting, dismiss blockers:
   await snapshot({ search: /cookie|consent|accept|reject|allow|age|verify|login|sign.in/i });
 
+Handle login popups by preferring controllable tabs over blocked popup windows.
+
 For multiline text, prefer fill() with \\n:
   await state.page.locator('role=textbox[name="Message"]').fill('Line 1\\nLine 2');
 
+═══ SNAPSHOT DIFF CONTROL ═══
+
+Use snapshot({ showDiffSinceLastCall: true }) to get concise diffs when repeatedly observing the same page.
+Use snapshot({ showDiffSinceLastCall: false }) when you need full output.
+
 ═══ SNAPSHOT VS SCREENSHOT ═══
 
 Prefer snapshot() for text/content/verification.
 Use screenshotWithAccessibilityLabels() only when visual layout or spatial relationships matter.
 
-Use cleanHTML/pageMarkdown for extraction:
-  - snapshot() for interactive structure and refs
-  - cleanHTML() for structured DOM extraction/parsing
-  - pageMarkdown() for article-like content
+snapshot vs cleanHTML vs pageMarkdown:
+  - snapshot(): interactive structure, refs, quick verification
+  - cleanHTML(): structured DOM extraction/parsing
+  - pageMarkdown(): article-like content extraction
+
+Authenticated fetch:
+  Use state.page.evaluate(() => fetch(...)) when authenticated browser session context matters.
+
+Downloads:
+  Prefer browser-driven download flows for large outputs instead of printing huge payloads.
 
-═══ PARALLEL TAB EXTRACTION ═══
+═══ BROWSERFORCE TAB SWARMS // PARALLEL TABS PROCESSING ═══
 
 Read browserforceSettings.executionMode before choosing strategy.
-For independent read-only extraction tasks, start with parallel tabs (cap concurrency, usually 3-8).
+For independent read-only extraction tasks, use Promise.all with a concurrency cap (usually 3-8, start at 5).
+Never run Promise.all actions against the same Page object.
+Parallel task rule: one tab/page per task, then aggregate results.
 On 429/challenges/timeouts: retry with lower concurrency, then sequential if needed.
 If visibility mode requires showing work (for example, rotating/foreground demos), bringing your own working tab to front is allowed.
 
@@ -350,6 +431,7 @@ Return telemetry for swarm runs:
 2) getLogs({ count: 30 })
 3) state.page.evaluate(...) for visibility/disabled/overlay checks
 
+Combine snapshot + logs to debug JS-heavy failures.
 For JS-heavy or authenticated sites, stay in browser automation.
 Do not switch to raw HTTP/curl expecting fully rendered DOM state.
 
@@ -359,7 +441,7 @@ Do not switch to raw HTTP/curl expecting fully rendered DOM state.
 ✗ Don't screenshot to read text; use snapshot
 ✗ Don't chain actions blindly without verification
 ✗ Don't use page.waitForTimeout() when a deterministic wait is available
-✗ Don't use stale refs after DOM/navigation updates
+✗ Don't use stale refs after DOM/navigation updates (stale locator refs cause false actions)
 ✗ Don't call page.context().newCDPSession(page); use getCDPSession({ page })
 ✗ Don't call browser.close() or context.close()
 ✗ Don't call page.bringToFront() by default; only use it when user asks or when visibility mode needs visible tab progression
@@ -376,7 +458,7 @@ If Chrome/extension unavailable: ask user to open Chrome, keep at least one norm
 
 ═══ API QUICK REFERENCE ═══
 
-snapshot(options?) -> text accessibility tree with interactive refs
+snapshot(options?) -> text accessibility tree with interactive refs; options.showDiffSinceLastCall toggles diff/full output
 waitForPageLoad(options?) -> { success, readyState, pendingRequests, waitTimeMs, timedOut }
 getLogs(options?) -> browser console log entries
 clearLogs() -> clears captured logs for current page
@@ -393,7 +475,10 @@ function registerExecuteTool(skillAppendix = '') {
     async ({ code, timeout = 30000 }) => {
       await ensureBrowser();
       ensureAllPagesCapture();
-      const agentPreferences = await getAgentPreferencesForSession();
+      const [agentPreferences, browserforceRestrictions] = await Promise.all([
+        getAgentPreferencesForSession(),
+        getBrowserforceRestrictionsForSession(),
+      ]);
       const ctx = getContext();
       const pages = ctx.pages();
       const page = pages[0] || null;
@@ -401,7 +486,7 @@ function registerExecuteTool(skillAppendix = '') {
       if (page) setupConsoleCapture(page);
       const execCtx = buildExecContext(page, ctx, userState, {
         consoleLogs, setupConsoleCapture,
-      }, pluginHelpers, agentPreferences);
+      }, pluginHelpers, agentPreferences, browserforceRestrictions);
       try {
         const result = await runCode(code, execCtx, timeout);
         const formatted = formatResult(result);
@@ -435,6 +520,7 @@ server.tool(
     browser = null;
     userState = {};
     cachedAgentPreferences = null;
+    cachedBrowserforceRestrictions = null;
     contextListenerAttached = false;
     consoleLogs.clear();
     try {
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index 205b502..cd0b543 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -113,6 +113,7 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('page'), 'should mention page');
     assert.ok(promptBlock.includes('context'), 'should mention context');
     assert.ok(promptBlock.includes('state'), 'should mention state');
+    assert.ok(promptBlock.includes('browserforceRestrictions'), 'should mention browserforceRestrictions for extension policy');
     // Key behavioral guidance
     assert.ok(promptBlock.includes('state.page'), 'should mention state.page for page management');
     assert.ok(promptBlock.includes('snapshot'), 'should mention snapshot-first approach');
@@ -188,6 +189,29 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('retries'), 'should require retries telemetry');
   });
 
+  it('execute prompt guards against guessed URLs and unsafe single-page parallelism', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/index.js'),
+      'utf8'
+    );
+    const promptStart = source.indexOf('const EXECUTE_PROMPT');
+    const promptEnd = source.indexOf("server.tool(\n  'execute'");
+    const promptBlock = source.slice(promptStart, promptEnd);
+
+    assert.ok(
+      promptBlock.includes('URL DISCOVERY (NO GUESSING)'),
+      'should include explicit no-guessing URL discovery guidance'
+    );
+    assert.ok(
+      promptBlock.includes('Do NOT guess deep links when the site already exposes navigation links'),
+      'should require deriving URLs from visible on-page links first'
+    );
+    assert.ok(
+      promptBlock.includes('Never run Promise.all actions against the same Page object'),
+      'should forbid parallel interactions against one page object'
+    );
+  });
+
   it('execute tool has code and optional timeout params', () => {
     const source = readFileSync(
       join(import.meta.url.replace('file://', ''), '../../src/index.js'),
@@ -261,6 +285,22 @@ describe('Tool Definitions', () => {
     );
   });
 
+  it('execute context includes browserforceRestrictions', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/exec-engine.js'),
+      'utf8'
+    );
+
+    assert.ok(
+      source.includes('browserforceRestrictions'),
+      'exec context should expose browserforceRestrictions in the sandbox scope'
+    );
+    assert.ok(
+      source.includes('lockUrl') && source.includes('noNewTabs') && source.includes('readOnly'),
+      'browserforceRestrictions should include lockUrl, noNewTabs, and readOnly flags'
+    );
+  });
+
   it('MCP preferences fetch is cached once per session', () => {
     const source = readFileSync(
       join(import.meta.url.replace('file://', ''), '../../src/index.js'),
@@ -278,6 +318,23 @@ describe('Tool Definitions', () => {
     );
   });
 
+  it('MCP restrictions fetch is cached once per session', () => {
+    const source = readFileSync(
+      join(import.meta.url.replace('file://', ''), '../../src/index.js'),
+      'utf8'
+    );
+
+    assert.ok(source.includes('cachedBrowserforceRestrictions'), 'should track cached browserforce restrictions');
+    assert.ok(
+      source.includes('if (cachedBrowserforceRestrictions)'),
+      'should return cached restrictions without refetching'
+    );
+    assert.ok(
+      source.includes('/restrictions'),
+      'should fetch restrictions from relay /restrictions endpoint'
+    );
+  });
+
   it('reset clears cached preferences', () => {
     const source = readFileSync(
       join(import.meta.url.replace('file://', ''), '../../src/index.js'),
@@ -291,6 +348,10 @@ describe('Tool Definitions', () => {
       resetBlock.includes('cachedAgentPreferences = null'),
       'reset should clear cached agent preferences'
     );
+    assert.ok(
+      resetBlock.includes('cachedBrowserforceRestrictions = null'),
+      'reset should clear cached browserforce restrictions'
+    );
   });
 });
 

From e3f69565bd0d26ecdc8beb88220a0b257162547b Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Sun, 1 Mar 2026 14:08:51 +0530
Subject: [PATCH 094/192] feat(openclaw): auto-install scoped policy plugin in
 setup

---
 README.md                          | 42 +++++++++++--------
 bin.js                             | 26 ++++++++++++
 mcp/src/index.js                   |  5 +++
 plugins/official/openclaw/SKILL.md |  7 ++++
 plugins/official/openclaw/index.js |  6 +++
 plugins/registry.json              |  8 ++++
 test/cli.test.js                   | 65 ++++++++++++++++++++++++++++++
 7 files changed, 142 insertions(+), 17 deletions(-)
 create mode 100644 plugins/official/openclaw/SKILL.md
 create mode 100644 plugins/official/openclaw/index.js

diff --git a/README.md b/README.md
index 669914e..59ca2ba 100644
--- a/README.md
+++ b/README.md
@@ -103,6 +103,8 @@ browserforce install-extension
 browserforce setup openclaw
 ```
 
+`setup openclaw` now auto-installs the official `openclaw` BrowserForce plugin into `~/.browserforce/plugins/openclaw/` so OpenClaw gets BrowserForce-specific usage policy without affecting other agents.
+
 Optional: install the BrowserForce skill for your OpenClaw agent:
 
 ```bash
@@ -142,6 +144,19 @@ browserforce serve
 
 If your agent browses to the page and responds with the title, you're all set.
 
+#### OpenClaw + BrowserForce Flow
+
+```mermaid
+flowchart LR
+  U["User (Telegram / WhatsApp / OpenClaw UI)"] --> OA["OpenClaw Agent"]
+  OA --> MCP["BrowserForce MCP (`browserforce mcp`)"]
+  MCP --> RELAY["Relay (`127.0.0.1:19222`)"]
+  RELAY --> EXT["Chrome Extension (MV3)"]
+  EXT --> CHROME["User's Real Chrome Session"]
+  SETUP["`browserforce setup openclaw`"] --> PLUGIN["Auto-install `openclaw` plugin\n(SKILL appended to execute prompt)"]
+  PLUGIN --> MCP
+```
+
 **MCP setup (advanced):**
 
 <details>
@@ -382,6 +397,7 @@ That's it. Restart MCP (or Claude Desktop) and `highlight()` is available in eve
 | Plugin      | What it adds                                                                                   | Install                                 |
 | ----------- | ---------------------------------------------------------------------------------------------- | --------------------------------------- |
 | `highlight` | `highlight(selector, color?)` — outlines matching elements; `clearHighlights()` — removes them | `browserforce plugin install highlight` |
+| `openclaw`  | OpenClaw-specific BrowserForce tab policy (skill text only, no helper functions)              | Auto-installed by `browserforce setup openclaw` |
 
 
 ### Use an installed plugin
@@ -718,23 +734,15 @@ Get started with simple prompts. The AI generates code and does the work.
 
 ## How It Works
 
-```
-  Agent (OpenClaw, Claude, etc.)
-         │
-         ├─ MCP server (stdio)
-         ├─ CLI (browserforce -e)
-         │
-         │ CDP over WebSocket
-         ▼
-  Relay Server (localhost:19222)
-         │
-         │ WebSocket
-         ▼
-  Chrome Extension (MV3)
-         │
-         │ chrome.debugger API
-         ▼
-  Your Real Chrome Browser
+```mermaid
+flowchart TD
+  AGENT["Agent (OpenClaw, Claude, Codex, etc.)"] --> MCP["MCP Server (`browserforce mcp`)"]
+  AGENT --> CLI["CLI (`browserforce -e`)"]
+  MCP --> RELAY["Relay Server (`127.0.0.1:19222`)"]
+  CLI --> RELAY
+  RELAY --> EXT["Chrome Extension (MV3 Service Worker)"]
+  EXT --> CDP["`chrome.debugger` bridge"]
+  CDP --> BROWSER["User's Real Chrome Browser"]
 ```
 
 The **relay server** runs on your machine (localhost only). It translates between the agent's CDP commands and the extension's debugger bridge.
diff --git a/bin.js b/bin.js
index 89721e3..b08da13 100644
--- a/bin.js
+++ b/bin.js
@@ -468,6 +468,7 @@ async function cmdSetup() {
   const homeDir = homedir();
   const openclawConfigPath = join(homeDir, '.openclaw', 'openclaw.json');
   const openclawDir = dirname(openclawConfigPath);
+  const pluginsDir = process.env.BF_PLUGINS_DIR || join(homeDir, '.browserforce', 'plugins');
 
   let existingConfig = {};
   let configExisted = false;
@@ -541,6 +542,23 @@ async function cmdSetup() {
     };
   }
 
+  // OpenClaw-specific guidance should only affect OpenClaw users.
+  // Install the openclaw plugin after setup (best effort).
+  let openclawPlugin = null;
+  if (!dryRun) {
+    try {
+      const { installPlugin } = await import('./mcp/src/plugin-installer.js');
+      await installPlugin('openclaw', pluginsDir);
+      openclawPlugin = { name: 'openclaw', installed: true };
+    } catch (err) {
+      openclawPlugin = {
+        name: 'openclaw',
+        installed: false,
+        error: err?.message || String(err),
+      };
+    }
+  }
+
   const result = {
     target: 'openclaw',
     dryRun,
@@ -549,6 +567,7 @@ async function cmdSetup() {
     configExisted,
     configWritten: !dryRun,
     autostart,
+    ...(openclawPlugin ? { openclawPlugin } : {}),
   };
 
   if (values.json) {
@@ -567,6 +586,13 @@ async function cmdSetup() {
     console.log(`  autostart.platform: ${autostart.platform}`);
     console.log(`  autostart: ${dryRun ? 'dry-run (not applied)' : 'applied'}`);
   }
+  if (openclawPlugin) {
+    if (openclawPlugin.installed) {
+      console.log('  openclawPlugin: installed');
+    } else {
+      console.log(`  openclawPlugin: install failed (${openclawPlugin.error})`);
+    }
+  }
 }
 
 function cmdHelp() {
diff --git a/mcp/src/index.js b/mcp/src/index.js
index e3e73c6..541796c 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -353,6 +353,11 @@ Read browserforceSettings + browserforceRestrictions before planning execution.
 - readOnly=true: no click/type/submit actions; observe with snapshot/screenshot/evaluate only.
 - instructions: treat as mandatory policy text for this session.
 
+Empty tabs/targets handling:
+- If tabs/targets are empty, treat it as normal startup state and create/reuse a dedicated tab with context.newPage().
+- Do not ask the user to click Attach/Share by default.
+- Ask for manual Attach/Share only when mode=manual or noNewTabs=true, or when the user explicitly asks to use their current tab.
+
 ═══ CORE LOOP — OBSERVE → ACT → OBSERVE ═══
 
 After every action, verify the result before proceeding.
diff --git a/plugins/official/openclaw/SKILL.md b/plugins/official/openclaw/SKILL.md
new file mode 100644
index 0000000..4069ff9
--- /dev/null
+++ b/plugins/official/openclaw/SKILL.md
@@ -0,0 +1,7 @@
+## BrowserForce tab policy (OpenClaw)
+
+- Do not ask the user to click Attach/Share by default.
+- If tabs/targets are empty, create or reuse a dedicated tab with `context.newPage()`, then navigate and call `snapshot()`.
+- Ask for manual Attach/Share only when `mode=manual` or `noNewTabs=true`, or when the user explicitly asks to use their current tab.
+- If blocked, report one concrete reason and one concrete user action.
+- Do not repeat "click extension icon again" loops without a new error signal.
diff --git a/plugins/official/openclaw/index.js b/plugins/official/openclaw/index.js
new file mode 100644
index 0000000..eaaa1dc
--- /dev/null
+++ b/plugins/official/openclaw/index.js
@@ -0,0 +1,6 @@
+export default {
+  name: 'openclaw',
+  description: 'OpenClaw-specific BrowserForce usage policy',
+  version: '1.0.0',
+  helpers: {},
+};
diff --git a/plugins/registry.json b/plugins/registry.json
index ec5a90e..da6006e 100644
--- a/plugins/registry.json
+++ b/plugins/registry.json
@@ -8,6 +8,14 @@
       "url": "https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/official/highlight/index.js",
       "skill_url": "https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/official/highlight/SKILL.md",
       "sha256": "d302bd9a0f6e96bd0c7a8666b560e01ab88f9f9e4c4694f14d97019f4cc04424"
+    },
+    {
+      "name": "openclaw",
+      "description": "OpenClaw-specific BrowserForce usage policy",
+      "version": "1.0.0",
+      "url": "https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/official/openclaw/index.js",
+      "skill_url": "https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/official/openclaw/SKILL.md",
+      "sha256": "a7113a603aa1ee50bdb97b5b1ab50b1e36d9bcf0b7bf8b6fb244b638329f8763"
     }
   ]
 }
diff --git a/test/cli.test.js b/test/cli.test.js
index 2bdcaae..41b0b8b 100644
--- a/test/cli.test.js
+++ b/test/cli.test.js
@@ -391,6 +391,71 @@ describe('CLI setup', () => {
     rmSync(homeDir, { recursive: true, force: true });
   });
 
+  it('setup openclaw --json installs openclaw plugin for OpenClaw users', async () => {
+    const homeDir = join(tmpdir(), `bf-openclaw-home-${Math.random().toString(36).slice(2)}`);
+    mkdirSync(homeDir, { recursive: true });
+
+    const fakeRegistry = {
+      plugins: [{
+        name: 'openclaw',
+        url: 'https://example.com/plugins/openclaw/index.js',
+        skill_url: 'https://example.com/plugins/openclaw/SKILL.md',
+        sha256: null,
+      }],
+    };
+
+    const { stdout } = await exec('node', ['bin.js', 'setup', 'openclaw', '--json', '--no-autostart'], {
+      env: {
+        ...process.env,
+        HOME: homeDir,
+        BF_TEST_REGISTRY: JSON.stringify(fakeRegistry),
+        BF_TEST_PLUGIN_JS: 'export default { name: "openclaw", helpers: {} };',
+        BF_TEST_PLUGIN_SKILL: '# OpenClaw BrowserForce policy',
+      },
+    });
+
+    const result = JSON.parse(stdout);
+    assert.equal(result.target, 'openclaw');
+    assert.equal(result.dryRun, false);
+    assert.deepEqual(result.openclawPlugin, { name: 'openclaw', installed: true });
+    assert.equal(existsSync(join(homeDir, '.browserforce', 'plugins', 'openclaw', 'index.js')), true);
+    assert.equal(existsSync(join(homeDir, '.browserforce', 'plugins', 'openclaw', 'SKILL.md')), true);
+
+    rmSync(homeDir, { recursive: true, force: true });
+  });
+
+  it('setup openclaw --dry-run does not install openclaw plugin', async () => {
+    const homeDir = join(tmpdir(), `bf-openclaw-home-${Math.random().toString(36).slice(2)}`);
+    mkdirSync(homeDir, { recursive: true });
+
+    const fakeRegistry = {
+      plugins: [{
+        name: 'openclaw',
+        url: 'https://example.com/plugins/openclaw/index.js',
+        skill_url: 'https://example.com/plugins/openclaw/SKILL.md',
+        sha256: null,
+      }],
+    };
+
+    const { stdout } = await exec('node', ['bin.js', 'setup', 'openclaw', '--dry-run', '--json', '--no-autostart'], {
+      env: {
+        ...process.env,
+        HOME: homeDir,
+        BF_TEST_REGISTRY: JSON.stringify(fakeRegistry),
+        BF_TEST_PLUGIN_JS: 'export default { name: "openclaw", helpers: {} };',
+        BF_TEST_PLUGIN_SKILL: '# OpenClaw BrowserForce policy',
+      },
+    });
+
+    const result = JSON.parse(stdout);
+    assert.equal(result.target, 'openclaw');
+    assert.equal(result.dryRun, true);
+    assert.equal(result.openclawPlugin, undefined);
+    assert.equal(existsSync(join(homeDir, '.browserforce', 'plugins', 'openclaw', 'index.js')), false);
+
+    rmSync(homeDir, { recursive: true, force: true });
+  });
+
   it('setup unknown target exits non-zero with error', async () => {
     try {
       await exec('node', ['bin.js', 'setup', 'nope']);

From 1ce6b871a6c7a5634c4d595ccdffba4f15fa1e40 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Sun, 1 Mar 2026 14:09:08 +0530
Subject: [PATCH 095/192] chore: bump version to 1.0.17

---
 package.json | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/package.json b/package.json
index 1dafb53..1af8494 100644
--- a/package.json
+++ b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "browserforce",
-  "version": "1.0.16",
+  "version": "1.0.17",
   "type": "module",
   "description": "Give AI agents your real Chrome browser with progressive examples: simple reads, form interactions, multi-tab workflows, and state persistence. Search X and GitHub, extract ProductHunt data, test forms, compare A/B variants, monitor status pages. Works with OpenClaw, Claude, and any MCP agent.",
   "homepage": "https://github.com/ivalsaraj/browserforce",

From d33877531823da7adba094eb581dbc3a94369a20 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Sun, 1 Mar 2026 23:26:30 +0530
Subject: [PATCH 096/192] feat(plugins): add google-sheets helpers for safe
 formatting

---
 README.md                               |  35 +-
 plugins/official/google-sheets/SKILL.md |  73 +++
 plugins/official/google-sheets/index.js | 781 ++++++++++++++++++++++++
 plugins/registry.json                   |   8 +
 4 files changed, 886 insertions(+), 11 deletions(-)
 create mode 100644 plugins/official/google-sheets/SKILL.md
 create mode 100644 plugins/official/google-sheets/index.js

diff --git a/README.md b/README.md
index 59ca2ba..b870760 100644
--- a/README.md
+++ b/README.md
@@ -105,10 +105,16 @@ browserforce setup openclaw
 
 `setup openclaw` now auto-installs the official `openclaw` BrowserForce plugin into `~/.browserforce/plugins/openclaw/` so OpenClaw gets BrowserForce-specific usage policy without affecting other agents.
 
-Optional: install the BrowserForce skill for your OpenClaw agent:
+Optional: install the BrowserForce skill for your OpenClaw agent (scoped to OpenClaw only):
 
 ```bash
-npx -y skills add ivalsaraj/browserforce
+npx -y skills add ivalsaraj/browserforce --agent openclaw --skill browserforce --yes
+```
+
+Preview only (no install):
+
+```bash
+npx -y skills add ivalsaraj/browserforce --list
 ```
 
 #### Autostart Modes
@@ -276,18 +282,24 @@ Add the same `mcpServers` entry:
 
 </details>
 
-Need deterministic single-owner handoff for sensitive workflows?
-Set `BF_CLIENT_MODE=single-active` in the MCP server command.
+<details>
+<summary><b>Client Mode: multi-client (default) vs single-active</b></summary>
 
-Why use `single-active`:
-- Prevents two MCP clients from driving the browser at the same time.
-- Makes contention explicit (`409` + `/client-slot`), which is easier to debug.
-- Better for write-heavy flows where accidental concurrent actions are risky.
+BrowserForce has two relay client arbitration modes:
 
-<details>
-<summary><b>Set BF_CLIENT_MODE=single-active (all MCP clients)</b></summary>
+- `multi-client` (default): allows multiple MCP/CDP clients at the same time.
+Use this for normal browsing, read-heavy research, and parallel workflows across tools.
+- `single-active`: allows only one active MCP/CDP client slot at a time.
+Use this for sensitive or write-heavy workflows where you need deterministic single-owner control.
+
+When `single-active` is enabled, additional clients get `409 Conflict` while the slot is busy, and can wait/retry using `/client-slot`.
+
+Set mode in the MCP server command:
+
+- Explicit default mode: `env BF_CLIENT_MODE=multi-client npx -y browserforce@latest mcp`
+- Single-owner mode: `env BF_CLIENT_MODE=single-active npx -y browserforce@latest mcp`
 
-These examples use the POSIX `env` wrapper. If your MCP client supports an `env` object/map, set `BF_CLIENT_MODE=single-active` there instead.
+Examples below use the POSIX `env` wrapper. If your MCP client supports an `env` map, set `BF_CLIENT_MODE` there instead.
 
 **OpenClaw (MCP adapter):**
 
@@ -397,6 +409,7 @@ That's it. Restart MCP (or Claude Desktop) and `highlight()` is available in eve
 | Plugin      | What it adds                                                                                   | Install                                 |
 | ----------- | ---------------------------------------------------------------------------------------------- | --------------------------------------- |
 | `highlight` | `highlight(selector, color?)` — outlines matching elements; `clearHighlights()` — removes them | `browserforce plugin install highlight` |
+| `google-sheets` | `gsReadContiguousRows()`; `gsFormatBulletsInRange()`; `gsSplitBulletsInRange()`; `gsRebalanceBoldInRange()`; `gsLogIssue()` | `browserforce plugin install google-sheets` |
 | `openclaw`  | OpenClaw-specific BrowserForce tab policy (skill text only, no helper functions)              | Auto-installed by `browserforce setup openclaw` |
 
 
diff --git a/plugins/official/google-sheets/SKILL.md b/plugins/official/google-sheets/SKILL.md
new file mode 100644
index 0000000..e96f2c9
--- /dev/null
+++ b/plugins/official/google-sheets/SKILL.md
@@ -0,0 +1,73 @@
+## google-sheets plugin
+
+Use Google Sheets helpers when work involves reading or structuring sheet content reliably without guesswork.
+
+Available helpers:
+- `gsGetMeta()` → current spreadsheet id + gid + title + URL
+- `gsGotoCell(cellRef)` → jump to a cell using the Sheets name box
+- `gsReadCell(cellRef, options?)` → read cell text through the in-cell editor
+- `gsReadContiguousRows(options?)` → detect used rows without hard-scanning arbitrary ranges
+- `gsSplitBulletsInRange(rangeRef, options?)` → replace in-cell bullet separators with real new lines
+- `gsRebalanceBoldInRange(rangeRef, options?)` → sparse bolding (default: max 1 bold segment per line)
+- `gsFormatBulletsInRange(rangeRef, options?)` → split bullets + rebalance bold in one pass
+- `gsLogIssue(summary, details?, options?)` → append a JSONL issue entry
+- `gsIssueLogPath()` → return default issue log path
+
+## Reliability Rules
+
+- Never hardcode long row scans (`1..80`, `1..200`) when structure is contiguous.
+- Use `gsReadContiguousRows({ columns: ['A','B'], startRow: 1, maxRows: 30, emptyStreakStop: 2 })`.
+- Always report `scannedRows`, `usedRowCount`, and `stopReason` when summarizing extraction.
+- Prefer `gsFormatBulletsInRange()` for multi-cell content cleanup tasks.
+- Use `dryRun: true` first for formatting helpers when changing many cells.
+- Log every process failure or unexpected behavior with `gsLogIssue(...)`.
+
+## Example: Read Guidelines Table
+
+```js
+const meta = await gsGetMeta();
+const result = await gsReadContiguousRows({
+  columns: ['A', 'B'],
+  startRow: 1,
+  maxRows: 30,
+  emptyStreakStop: 2
+});
+
+return {
+  sheet: meta,
+  scan: {
+    scannedRows: result.scannedRows,
+    usedRowCount: result.usedRowCount,
+    stopReason: result.stopReason
+  },
+  rows: result.rows
+};
+```
+
+## Example: Log a Failure Pattern
+
+```js
+await gsLogIssue(
+  'Overscan loop on Google Sheets',
+  {
+    symptom: 'Looped to row 80 while table ended at row 9',
+    fix: 'Use gsReadContiguousRows with emptyStreakStop=2',
+    impact: 'Reduced scans and prevented wasted actions'
+  }
+);
+```
+
+## Example: Split + Sparse Bold in One Call
+
+```js
+const result = await gsFormatBulletsInRange('D2:D11', {
+  maxBoldPerLine: 1,
+  preferredPhrasesByCell: {
+    D2: ['review-ready PRs', 'sprint timeline', 'Escalates blockers'],
+    D3: ['Consistent quality', 'Review feedback', 'precise ETA']
+  },
+  verify: true
+});
+
+return result.summary || result;
+```
diff --git a/plugins/official/google-sheets/index.js b/plugins/official/google-sheets/index.js
new file mode 100644
index 0000000..750173a
--- /dev/null
+++ b/plugins/official/google-sheets/index.js
@@ -0,0 +1,781 @@
+import { appendFile, mkdir } from 'node:fs/promises';
+import { dirname, join } from 'node:path';
+import { homedir } from 'node:os';
+
+const DEFAULT_SCAN_MAX_ROWS = 30;
+const DEFAULT_EMPTY_STREAK_STOP = 2;
+const DEFAULT_EDITOR_WAIT_MS = 35;
+const DEFAULT_LOG_PATH = join(homedir(), '.browserforce', 'logs', 'google-sheets-issues.jsonl');
+const SHEETS_URL_RE = /^https:\/\/docs\.google\.com\/spreadsheets\//;
+
+function assertPage(page, helperName) {
+  if (!page || typeof page.url !== 'function') {
+    throw new Error(`${helperName}() requires an active page`);
+  }
+}
+
+function assertGoogleSheet(page, helperName) {
+  assertPage(page, helperName);
+  const url = String(page.url() || '');
+  if (!SHEETS_URL_RE.test(url)) {
+    throw new Error(`${helperName}() requires a Google Sheets page, got: ${url || 'unknown URL'}`);
+  }
+}
+
+function normalizeCellRef(cellRef) {
+  const ref = String(cellRef || '').toUpperCase().trim();
+  if (!/^[A-Z]+[1-9][0-9]*$/.test(ref)) {
+    throw new Error(`Invalid cell reference: "${cellRef}"`);
+  }
+  return ref;
+}
+
+function normalizeColumns(columns) {
+  if (!Array.isArray(columns) || columns.length === 0) {
+    throw new Error('columns must be a non-empty array like ["A", "B"]');
+  }
+  return columns.map((value) => {
+    const col = String(value || '').toUpperCase().trim();
+    if (!/^[A-Z]+$/.test(col)) {
+      throw new Error(`Invalid column value: "${value}"`);
+    }
+    return col;
+  });
+}
+
+function columnToIndex(column) {
+  let value = 0;
+  for (const ch of String(column || '')) {
+    value = value * 26 + (ch.charCodeAt(0) - 64);
+  }
+  return value;
+}
+
+function indexToColumn(index) {
+  let n = Number(index);
+  let out = '';
+  while (n > 0) {
+    const rem = (n - 1) % 26;
+    out = String.fromCharCode(65 + rem) + out;
+    n = Math.floor((n - 1) / 26);
+  }
+  return out;
+}
+
+function parseA1Range(rangeRef) {
+  const ref = String(rangeRef || '').toUpperCase().trim();
+  const m = ref.match(/^([A-Z]+)([1-9][0-9]*)(?::([A-Z]+)([1-9][0-9]*))?$/);
+  if (!m) throw new Error(`Invalid A1 range: "${rangeRef}"`);
+
+  const startCol = m[1];
+  const startRow = Number(m[2]);
+  const endCol = m[3] || startCol;
+  const endRow = m[4] ? Number(m[4]) : startRow;
+
+  const startColIdx = columnToIndex(startCol);
+  const endColIdx = columnToIndex(endCol);
+  const colMin = Math.min(startColIdx, endColIdx);
+  const colMax = Math.max(startColIdx, endColIdx);
+  const rowMin = Math.min(startRow, endRow);
+  const rowMax = Math.max(startRow, endRow);
+
+  return {
+    startCol: indexToColumn(colMin),
+    endCol: indexToColumn(colMax),
+    startColIdx: colMin,
+    endColIdx: colMax,
+    startRow: rowMin,
+    endRow: rowMax,
+  };
+}
+
+function expandA1Range(rangeRef) {
+  const parsed = parseA1Range(rangeRef);
+  const refs = [];
+  for (let r = parsed.startRow; r <= parsed.endRow; r += 1) {
+    for (let c = parsed.startColIdx; c <= parsed.endColIdx; c += 1) {
+      refs.push(`${indexToColumn(c)}${r}`);
+    }
+  }
+  return refs;
+}
+
+function parseSheetMeta(urlRaw) {
+  const fallback = {
+    spreadsheetId: null,
+    gid: null,
+    url: String(urlRaw || ''),
+  };
+  try {
+    const url = new URL(String(urlRaw || ''));
+    const match = url.pathname.match(/\/spreadsheets\/d\/([^/]+)/);
+    const gidFromQuery = url.searchParams.get('gid');
+    const gidFromHash = (url.hash.match(/gid=(\d+)/) || [])[1] || null;
+    return {
+      spreadsheetId: match ? match[1] : null,
+      gid: gidFromQuery || gidFromHash || null,
+      url: url.toString(),
+    };
+  } catch {
+    return fallback;
+  }
+}
+
+async function pause(page, ms = DEFAULT_EDITOR_WAIT_MS) {
+  if (!ms || ms < 1) return;
+  if (typeof page.waitForTimeout === 'function') {
+    await page.waitForTimeout(ms);
+    return;
+  }
+  await new Promise((resolve) => setTimeout(resolve, ms));
+}
+
+async function gotoCell(page, cellRef) {
+  const ref = normalizeCellRef(cellRef);
+  const box = page.locator('#t-name-box');
+  await box.click();
+  await box.fill(ref);
+  await page.keyboard.press('Enter');
+  return ref;
+}
+
+async function openEditorAtCell(page, cellRef, waitMs = DEFAULT_EDITOR_WAIT_MS) {
+  const ref = await gotoCell(page, cellRef);
+  await page.keyboard.press('F2');
+  await pause(page, waitMs);
+  return ref;
+}
+
+async function closeEditor(page, commit = false) {
+  await page.keyboard.press(commit ? 'Enter' : 'Escape');
+}
+
+function mergeRanges(ranges = []) {
+  const normalized = ranges
+    .filter((r) => Number.isInteger(r.start) && Number.isInteger(r.end) && r.end > r.start)
+    .map((r) => ({ start: r.start, end: r.end }))
+    .sort((a, b) => a.start - b.start || a.end - b.end);
+
+  const merged = [];
+  for (const r of normalized) {
+    if (!merged.length || r.start > merged[merged.length - 1].end) {
+      merged.push({ ...r });
+    } else {
+      merged[merged.length - 1].end = Math.max(merged[merged.length - 1].end, r.end);
+    }
+  }
+  return merged;
+}
+
+function rangesEqual(a = [], b = []) {
+  if (a.length !== b.length) return false;
+  for (let i = 0; i < a.length; i += 1) {
+    if (a[i].start !== b[i].start || a[i].end !== b[i].end) return false;
+  }
+  return true;
+}
+
+function overlapsAny(range, ranges) {
+  return ranges.some((r) => !(range.end <= r.start || range.start >= r.end));
+}
+
+function findNonOverlappingRanges(text, phrases = []) {
+  if (!Array.isArray(phrases) || phrases.length === 0) return [];
+  const occ = [];
+  for (const rawPhrase of phrases) {
+    const phrase = String(rawPhrase || '');
+    if (!phrase) continue;
+    let idx = 0;
+    while (idx <= text.length) {
+      const found = text.indexOf(phrase, idx);
+      if (found === -1) break;
+      occ.push({ start: found, end: found + phrase.length });
+      idx = found + phrase.length;
+    }
+  }
+  occ.sort((a, b) => a.start - b.start || b.end - a.end);
+  const chosen = [];
+  for (const r of occ) {
+    if (!overlapsAny(r, chosen)) chosen.push(r);
+  }
+  return mergeRanges(chosen);
+}
+
+function splitLinesWithOffsets(text) {
+  const lines = String(text || '').split('\n');
+  const out = [];
+  let start = 0;
+  for (let i = 0; i < lines.length; i += 1) {
+    const line = lines[i];
+    const end = start + line.length;
+    out.push({ index: i, text: line, start, end });
+    start = end + 1; // account for newline
+  }
+  return out;
+}
+
+function getRangesWithinLine(ranges, line) {
+  return ranges.filter((r) => r.start >= line.start && r.end <= line.end);
+}
+
+function selectSparseBoldRanges(text, existingRanges, preferredPhrases, maxBoldPerLine, keepExistingFallback = true) {
+  const maxPerLine = Number.isInteger(maxBoldPerLine) && maxBoldPerLine > 0 ? maxBoldPerLine : 1;
+  const preferredRanges = findNonOverlappingRanges(text, preferredPhrases || []);
+  const existingMerged = mergeRanges(existingRanges || []);
+  const lines = splitLinesWithOffsets(text);
+
+  const chosen = [];
+  for (const line of lines) {
+    const picks = [];
+
+    const fromPreferred = getRangesWithinLine(preferredRanges, line);
+    for (const r of fromPreferred) {
+      if (picks.length >= maxPerLine) break;
+      if (!overlapsAny(r, picks)) picks.push(r);
+    }
+
+    if (keepExistingFallback && picks.length < maxPerLine) {
+      const fromExisting = getRangesWithinLine(existingMerged, line);
+      for (const r of fromExisting) {
+        if (picks.length >= maxPerLine) break;
+        if (!overlapsAny(r, picks)) picks.push(r);
+      }
+    }
+
+    chosen.push(...picks);
+  }
+
+  return mergeRanges(chosen);
+}
+
+function splitBulletsText(text, options = {}) {
+  const pattern = options.separatorPattern || '\\s-\\s';
+  const flags = options.separatorFlags || 'g';
+  const replacement = options.replacement || '\n- ';
+  const re = new RegExp(pattern, flags);
+  return String(text || '').replace(re, replacement);
+}
+
+function defaultStyle() {
+  return "font-size:13px;color:#000000;font-weight:normal;text-decoration:none;font-family:'Arial';font-style:normal;text-decoration-skip-ink:none;";
+}
+
+async function readEditorText(page, { trim = true } = {}) {
+  const raw = await page.evaluate(() => {
+    const editor = document.querySelector('#waffle-rich-text-editor');
+    if (!editor) return null;
+    return editor.innerText.replace(/\n+$/g, '');
+  });
+  if (raw === null) {
+    throw new Error('Cannot read Google Sheets editor (#waffle-rich-text-editor not found)');
+  }
+  return trim ? raw.trim() : raw;
+}
+
+async function readEditorSnapshot(page) {
+  const data = await page.evaluate(() => {
+    const editor = document.querySelector('#waffle-rich-text-editor');
+    if (!editor) return null;
+
+    const text = editor.innerText.replace(/\n+$/g, '');
+    const firstSpan = editor.querySelector('span');
+    const baseStyle = firstSpan?.getAttribute('style') || '';
+
+    const isBoldElement = (el) => {
+      if (!el || el.nodeType !== 1) return false;
+      if (el.tagName === 'B' || el.tagName === 'STRONG') return true;
+      const fw = getComputedStyle(el).fontWeight;
+      return fw === 'bold' || Number(fw) >= 600;
+    };
+
+    const walker = document.createTreeWalker(editor, NodeFilter.SHOW_TEXT);
+    const ranges = [];
+    let node;
+    let offset = 0;
+    while ((node = walker.nextNode())) {
+      const t = node.textContent || '';
+      const start = offset;
+      const end = offset + t.length;
+      let p = node.parentElement;
+      let bold = false;
+      while (p && p !== editor) {
+        if (isBoldElement(p)) {
+          bold = true;
+          break;
+        }
+        p = p.parentElement;
+      }
+      if (bold && t.length) ranges.push({ start, end });
+      offset = end;
+    }
+
+    ranges.sort((a, b) => a.start - b.start || a.end - b.end);
+    const merged = [];
+    for (const r of ranges) {
+      if (!merged.length || r.start > merged[merged.length - 1].end) {
+        merged.push({ start: r.start, end: r.end });
+      } else {
+        merged[merged.length - 1].end = Math.max(merged[merged.length - 1].end, r.end);
+      }
+    }
+
+    return {
+      text,
+      baseStyle,
+      boldRanges: merged,
+      lineCount: text.split('\n').length,
+    };
+  });
+
+  if (!data) throw new Error('Cannot read Google Sheets editor snapshot');
+  return {
+    text: data.text,
+    baseStyle: data.baseStyle || defaultStyle(),
+    boldRanges: mergeRanges(data.boldRanges),
+    lineCount: data.lineCount,
+  };
+}
+
+async function writeEditorWithRanges(page, text, boldRanges, baseStyle) {
+  const result = await page.evaluate(({ textValue, ranges, style }) => {
+    const editor = document.querySelector('#waffle-rich-text-editor');
+    if (!editor) return null;
+
+    const normalizedStyle = style || "font-size:13px;color:#000000;font-weight:normal;text-decoration:none;font-family:'Arial';font-style:normal;text-decoration-skip-ink:none;";
+    const boldStyle = /font-weight\s*:/i.test(normalizedStyle)
+      ? normalizedStyle.replace(/font-weight\s*:\s*[^;]+/i, 'font-weight:bold')
+      : `${normalizedStyle};font-weight:bold`;
+
+    const merged = [];
+    const sorted = (ranges || [])
+      .filter((r) => Number.isInteger(r.start) && Number.isInteger(r.end) && r.end > r.start)
+      .sort((a, b) => a.start - b.start || a.end - b.end);
+    for (const r of sorted) {
+      const start = Math.max(0, Math.min(textValue.length, r.start));
+      const end = Math.max(0, Math.min(textValue.length, r.end));
+      if (end <= start) continue;
+      if (!merged.length || start > merged[merged.length - 1].end) {
+        merged.push({ start, end });
+      } else {
+        merged[merged.length - 1].end = Math.max(merged[merged.length - 1].end, end);
+      }
+    }
+
+    while (editor.firstChild) editor.removeChild(editor.firstChild);
+
+    const appendChunk = (chunk, useBold) => {
+      const lines = chunk.split('\n');
+      for (let i = 0; i < lines.length; i += 1) {
+        if (lines[i].length > 0) {
+          const span = document.createElement('span');
+          span.setAttribute('style', useBold ? boldStyle : normalizedStyle);
+          span.appendChild(document.createTextNode(lines[i]));
+          editor.appendChild(span);
+        }
+        if (i < lines.length - 1) editor.appendChild(document.createElement('br'));
+      }
+    };
+
+    let pos = 0;
+    for (const r of merged) {
+      if (r.start > pos) appendChunk(textValue.slice(pos, r.start), false);
+      appendChunk(textValue.slice(r.start, r.end), true);
+      pos = r.end;
+    }
+    if (pos < textValue.length) appendChunk(textValue.slice(pos), false);
+    editor.appendChild(document.createElement('br'));
+
+    editor.dispatchEvent(new InputEvent('input', { bubbles: true }));
+    const after = editor.innerText.replace(/\n+$/g, '');
+    return { after, lineCount: after.split('\n').length };
+  }, { textValue: text, ranges: boldRanges, style: baseStyle });
+
+  if (!result) throw new Error('Failed to write Google Sheets editor content');
+  return result;
+}
+
+async function readCell(page, cellRef, options = {}) {
+  const { trim = true, waitMs = DEFAULT_EDITOR_WAIT_MS } = options;
+  const ref = await gotoCell(page, cellRef);
+  await page.keyboard.press('F2');
+  await pause(page, waitMs);
+  let value;
+  try {
+    value = await readEditorText(page, { trim });
+  } finally {
+    await page.keyboard.press('Escape');
+  }
+  return { ref, value };
+}
+
+async function readRow(page, row, columns, options) {
+  const cells = {};
+  for (const col of columns) {
+    const { value } = await readCell(page, `${col}${row}`, options);
+    cells[col] = value;
+  }
+  return cells;
+}
+
+function hasData(cells) {
+  return Object.values(cells).some((value) => Boolean(String(value || '').trim()));
+}
+
+export default {
+  name: 'google-sheets',
+  description: 'Google Sheets helpers for reliable row scanning, cell reads, and issue logging',
+  version: '1.0.0',
+  helpers: {
+    gsGetMeta: async (page) => {
+      assertGoogleSheet(page, 'gsGetMeta');
+      const title = await page.title();
+      const meta = parseSheetMeta(page.url());
+      return { ...meta, title };
+    },
+
+    gsGotoCell: async (page, ctx, state, cellRef) => {
+      assertGoogleSheet(page, 'gsGotoCell');
+      const ref = await gotoCell(page, cellRef);
+      return { ok: true, ref };
+    },
+
+    gsReadCell: async (page, ctx, state, cellRef, options = {}) => {
+      assertGoogleSheet(page, 'gsReadCell');
+      const { ref, value } = await readCell(page, cellRef, options);
+      return { ref, value };
+    },
+
+    gsReadContiguousRows: async (page, ctx, state, options = {}) => {
+      assertGoogleSheet(page, 'gsReadContiguousRows');
+
+      const columns = normalizeColumns(options.columns || ['A', 'B']);
+      const startRow = Number.isInteger(options.startRow) && options.startRow > 0 ? options.startRow : 1;
+      const maxRows = Number.isInteger(options.maxRows) && options.maxRows > 0
+        ? options.maxRows
+        : DEFAULT_SCAN_MAX_ROWS;
+      const emptyStreakStop = Number.isInteger(options.emptyStreakStop) && options.emptyStreakStop > 0
+        ? options.emptyStreakStop
+        : DEFAULT_EMPTY_STREAK_STOP;
+
+      const rows = [];
+      let scannedRows = 0;
+      let seenData = false;
+      let emptyStreak = 0;
+      let stopReason = 'max_rows_reached';
+
+      for (let i = 0; i < maxRows; i += 1) {
+        const row = startRow + i;
+        const cells = await readRow(page, row, columns, options);
+        scannedRows += 1;
+
+        if (hasData(cells)) {
+          rows.push({ row, cells });
+          seenData = true;
+          emptyStreak = 0;
+          continue;
+        }
+
+        if (seenData) {
+          emptyStreak += 1;
+          if (emptyStreak >= emptyStreakStop) {
+            stopReason = 'empty_streak_stop';
+            break;
+          }
+        }
+      }
+
+      return {
+        rows,
+        scannedRows,
+        usedRowCount: rows.length,
+        stopReason,
+        config: { columns, startRow, maxRows, emptyStreakStop },
+      };
+    },
+
+    gsLogIssue: async (page, ctx, state, summary, details = {}, options = {}) => {
+      const text = String(summary || '').trim();
+      if (!text) throw new Error('gsLogIssue() requires a non-empty summary');
+
+      const logPath = String(options.logPath || DEFAULT_LOG_PATH);
+      const entry = {
+        ts: new Date().toISOString(),
+        summary: text,
+        details: details && typeof details === 'object' ? details : { note: String(details) },
+        pageUrl: page && typeof page.url === 'function' ? page.url() : null,
+        pageTitle: page && typeof page.title === 'function' ? await page.title() : null,
+      };
+
+      await mkdir(dirname(logPath), { recursive: true });
+      await appendFile(logPath, `${JSON.stringify(entry)}\n`, 'utf8');
+      return { ok: true, logPath, entry };
+    },
+
+    gsIssueLogPath: async () => ({ logPath: DEFAULT_LOG_PATH }),
+
+    gsSplitBulletsInRange: async (page, ctx, state, rangeRef, options = {}) => {
+      assertGoogleSheet(page, 'gsSplitBulletsInRange');
+      const cells = expandA1Range(rangeRef);
+      const dryRun = options.dryRun === true;
+      const verify = options.verify !== false;
+      const waitMs = Number.isInteger(options.waitMs) && options.waitMs >= 0 ? options.waitMs : DEFAULT_EDITOR_WAIT_MS;
+
+      const results = [];
+      for (const ref of cells) {
+        try {
+          await openEditorAtCell(page, ref, waitMs);
+          const snapshot = await readEditorSnapshot(page);
+          const transformed = splitBulletsText(snapshot.text, options);
+          const changed = transformed !== snapshot.text;
+
+          if (!changed) {
+            await closeEditor(page, false);
+            results.push({ ref, status: 'unchanged', changed: false, beforeLines: snapshot.lineCount, afterLines: snapshot.lineCount });
+            continue;
+          }
+
+          const preservedRanges = transformed.length === snapshot.text.length ? snapshot.boldRanges : [];
+          if (dryRun) {
+            await closeEditor(page, false);
+            results.push({
+              ref,
+              status: 'dry_run',
+              changed: true,
+              beforeLines: snapshot.lineCount,
+              afterLines: transformed.split('\n').length,
+              droppedBoldRanges: transformed.length !== snapshot.text.length,
+            });
+            continue;
+          }
+
+          const write = await writeEditorWithRanges(page, transformed, preservedRanges, snapshot.baseStyle);
+          if (write.after !== transformed) {
+            await closeEditor(page, false);
+            results.push({ ref, status: 'error', changed: true, error: 'text_mismatch_after_write' });
+            continue;
+          }
+
+          await closeEditor(page, true);
+
+          let verifyOk = true;
+          if (verify) {
+            await openEditorAtCell(page, ref, waitMs);
+            const verifySnapshot = await readEditorSnapshot(page);
+            await closeEditor(page, false);
+            verifyOk = verifySnapshot.text === transformed;
+          }
+
+          results.push({
+            ref,
+            status: verifyOk ? 'ok' : 'verify_failed',
+            changed: true,
+            beforeLines: snapshot.lineCount,
+            afterLines: transformed.split('\n').length,
+          });
+        } catch (err) {
+          try { await closeEditor(page, false); } catch { /* ignore */ }
+          results.push({ ref, status: 'error', changed: false, error: String(err?.message || err) });
+        }
+      }
+
+      return {
+        rangeRef: String(rangeRef),
+        total: results.length,
+        changed: results.filter((r) => r.changed).length,
+        unchanged: results.filter((r) => r.status === 'unchanged').length,
+        ok: results.filter((r) => r.status === 'ok' || r.status === 'dry_run').length,
+        failed: results.filter((r) => r.status === 'error' || r.status === 'verify_failed').length,
+        results,
+      };
+    },
+
+    gsRebalanceBoldInRange: async (page, ctx, state, rangeRef, options = {}) => {
+      assertGoogleSheet(page, 'gsRebalanceBoldInRange');
+      const cells = expandA1Range(rangeRef);
+      const dryRun = options.dryRun === true;
+      const verify = options.verify !== false;
+      const waitMs = Number.isInteger(options.waitMs) && options.waitMs >= 0 ? options.waitMs : DEFAULT_EDITOR_WAIT_MS;
+      const maxBoldPerLine = Number.isInteger(options.maxBoldPerLine) && options.maxBoldPerLine > 0 ? options.maxBoldPerLine : 1;
+      const keepExistingFallback = options.keepExistingFallback !== false;
+      const preferredGlobal = Array.isArray(options.preferredPhrases) ? options.preferredPhrases : [];
+      const preferredByCell = options.preferredPhrasesByCell && typeof options.preferredPhrasesByCell === 'object'
+        ? options.preferredPhrasesByCell
+        : {};
+
+      const results = [];
+      for (const ref of cells) {
+        try {
+          await openEditorAtCell(page, ref, waitMs);
+          const snapshot = await readEditorSnapshot(page);
+          const preferredLocal = Array.isArray(preferredByCell[ref]) ? preferredByCell[ref] : preferredGlobal;
+          const targetRanges = selectSparseBoldRanges(
+            snapshot.text,
+            snapshot.boldRanges,
+            preferredLocal,
+            maxBoldPerLine,
+            keepExistingFallback
+          );
+
+          const changed = !rangesEqual(snapshot.boldRanges, targetRanges);
+          if (!changed) {
+            await closeEditor(page, false);
+            results.push({ ref, status: 'unchanged', changed: false, lineCount: snapshot.lineCount, boldSegments: targetRanges.length });
+            continue;
+          }
+
+          if (dryRun) {
+            await closeEditor(page, false);
+            results.push({
+              ref,
+              status: 'dry_run',
+              changed: true,
+              lineCount: snapshot.lineCount,
+              beforeBoldSegments: snapshot.boldRanges.length,
+              afterBoldSegments: targetRanges.length,
+            });
+            continue;
+          }
+
+          const write = await writeEditorWithRanges(page, snapshot.text, targetRanges, snapshot.baseStyle);
+          if (write.after !== snapshot.text) {
+            await closeEditor(page, false);
+            results.push({ ref, status: 'error', changed: true, error: 'text_changed_while_rebalancing' });
+            continue;
+          }
+
+          await closeEditor(page, true);
+
+          let verifyOk = true;
+          if (verify) {
+            await openEditorAtCell(page, ref, waitMs);
+            const verifySnapshot = await readEditorSnapshot(page);
+            await closeEditor(page, false);
+            verifyOk = verifySnapshot.text === snapshot.text && rangesEqual(verifySnapshot.boldRanges, targetRanges);
+          }
+
+          results.push({
+            ref,
+            status: verifyOk ? 'ok' : 'verify_failed',
+            changed: true,
+            lineCount: snapshot.lineCount,
+            beforeBoldSegments: snapshot.boldRanges.length,
+            afterBoldSegments: targetRanges.length,
+          });
+        } catch (err) {
+          try { await closeEditor(page, false); } catch { /* ignore */ }
+          results.push({ ref, status: 'error', changed: false, error: String(err?.message || err) });
+        }
+      }
+
+      return {
+        rangeRef: String(rangeRef),
+        total: results.length,
+        changed: results.filter((r) => r.changed).length,
+        unchanged: results.filter((r) => r.status === 'unchanged').length,
+        ok: results.filter((r) => r.status === 'ok' || r.status === 'dry_run').length,
+        failed: results.filter((r) => r.status === 'error' || r.status === 'verify_failed').length,
+        results,
+      };
+    },
+
+    gsFormatBulletsInRange: async (page, ctx, state, rangeRef, options = {}) => {
+      assertGoogleSheet(page, 'gsFormatBulletsInRange');
+      const cells = expandA1Range(rangeRef);
+      const dryRun = options.dryRun === true;
+      const verify = options.verify !== false;
+      const waitMs = Number.isInteger(options.waitMs) && options.waitMs >= 0 ? options.waitMs : DEFAULT_EDITOR_WAIT_MS;
+      const maxBoldPerLine = Number.isInteger(options.maxBoldPerLine) && options.maxBoldPerLine > 0 ? options.maxBoldPerLine : 1;
+      const keepExistingFallback = options.keepExistingFallback !== false;
+      const preferredGlobal = Array.isArray(options.preferredPhrases) ? options.preferredPhrases : [];
+      const preferredByCell = options.preferredPhrasesByCell && typeof options.preferredPhrasesByCell === 'object'
+        ? options.preferredPhrasesByCell
+        : {};
+
+      const results = [];
+      for (const ref of cells) {
+        try {
+          await openEditorAtCell(page, ref, waitMs);
+          const snapshot = await readEditorSnapshot(page);
+          const transformed = splitBulletsText(snapshot.text, options);
+          const postSplitBaseRanges = transformed.length === snapshot.text.length ? snapshot.boldRanges : [];
+          const preferredLocal = Array.isArray(preferredByCell[ref]) ? preferredByCell[ref] : preferredGlobal;
+          const targetRanges = selectSparseBoldRanges(
+            transformed,
+            postSplitBaseRanges,
+            preferredLocal,
+            maxBoldPerLine,
+            keepExistingFallback
+          );
+
+          const textChanged = transformed !== snapshot.text;
+          const boldChanged = !rangesEqual(snapshot.boldRanges, targetRanges);
+          const changed = textChanged || boldChanged;
+
+          if (!changed) {
+            await closeEditor(page, false);
+            results.push({ ref, status: 'unchanged', changed: false, beforeLines: snapshot.lineCount, afterLines: snapshot.lineCount });
+            continue;
+          }
+
+          if (dryRun) {
+            await closeEditor(page, false);
+            results.push({
+              ref,
+              status: 'dry_run',
+              changed: true,
+              textChanged,
+              boldChanged,
+              beforeLines: snapshot.lineCount,
+              afterLines: transformed.split('\n').length,
+              beforeBoldSegments: snapshot.boldRanges.length,
+              afterBoldSegments: targetRanges.length,
+            });
+            continue;
+          }
+
+          const write = await writeEditorWithRanges(page, transformed, targetRanges, snapshot.baseStyle);
+          if (write.after !== transformed) {
+            await closeEditor(page, false);
+            results.push({ ref, status: 'error', changed: true, error: 'text_mismatch_after_write' });
+            continue;
+          }
+
+          await closeEditor(page, true);
+
+          let verifyOk = true;
+          if (verify) {
+            await openEditorAtCell(page, ref, waitMs);
+            const verifySnapshot = await readEditorSnapshot(page);
+            await closeEditor(page, false);
+            verifyOk = verifySnapshot.text === transformed && rangesEqual(verifySnapshot.boldRanges, targetRanges);
+          }
+
+          results.push({
+            ref,
+            status: verifyOk ? 'ok' : 'verify_failed',
+            changed: true,
+            textChanged,
+            boldChanged,
+            beforeLines: snapshot.lineCount,
+            afterLines: transformed.split('\n').length,
+            beforeBoldSegments: snapshot.boldRanges.length,
+            afterBoldSegments: targetRanges.length,
+          });
+        } catch (err) {
+          try { await closeEditor(page, false); } catch { /* ignore */ }
+          results.push({ ref, status: 'error', changed: false, error: String(err?.message || err) });
+        }
+      }
+
+      return {
+        rangeRef: String(rangeRef),
+        total: results.length,
+        changed: results.filter((r) => r.changed).length,
+        unchanged: results.filter((r) => r.status === 'unchanged').length,
+        ok: results.filter((r) => r.status === 'ok' || r.status === 'dry_run').length,
+        failed: results.filter((r) => r.status === 'error' || r.status === 'verify_failed').length,
+        results,
+      };
+    },
+  },
+};
diff --git a/plugins/registry.json b/plugins/registry.json
index da6006e..052dc57 100644
--- a/plugins/registry.json
+++ b/plugins/registry.json
@@ -9,6 +9,14 @@
       "skill_url": "https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/official/highlight/SKILL.md",
       "sha256": "d302bd9a0f6e96bd0c7a8666b560e01ab88f9f9e4c4694f14d97019f4cc04424"
     },
+    {
+      "name": "google-sheets",
+      "description": "Google Sheets helpers for reliable row scanning, cell reads, and issue logging",
+      "version": "1.0.0",
+      "url": "https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/official/google-sheets/index.js",
+      "skill_url": "https://raw.githubusercontent.com/ivalsaraj/browserforce/main/plugins/official/google-sheets/SKILL.md",
+      "sha256": "93853d5991b576e2989ae4f0dc011f9adf4d7c4dadcf44870f66f52e4a38d3c5"
+    },
     {
       "name": "openclaw",
       "description": "OpenClaw-specific BrowserForce usage policy",

From 10f1dd5a385013b223221481bfdb6981de14dac8 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Sun, 1 Mar 2026 23:26:35 +0530
Subject: [PATCH 097/192] docs: add google sheets automation issue log

---
 docs/google-sheets-issues.md | 37 ++++++++++++++++++++++++++++++++++++
 1 file changed, 37 insertions(+)
 create mode 100644 docs/google-sheets-issues.md

diff --git a/docs/google-sheets-issues.md b/docs/google-sheets-issues.md
new file mode 100644
index 0000000..64dcf12
--- /dev/null
+++ b/docs/google-sheets-issues.md
@@ -0,0 +1,37 @@
+# Google Sheets Automation Issue Log
+
+Append-only log for BrowserForce Google Sheets workflow failures, root causes, and fixes.
+
+Use this format for each new entry:
+
+## YYYY-MM-DD — [TAG] Short Title
+- Symptom:
+- Root cause:
+- Fix:
+- Rule:
+
+---
+
+## 2026-03-01 — [SCAN] Overscan Loop Beyond Used Rows
+- Symptom: Automation loop kept scanning up to row 80 while the table only had data through row 9.
+- Root cause: Fixed upper-bound scanning was used without detecting the contiguous used range first.
+- Fix: Added contiguous scanning with early-stop using empty streak detection.
+- Rule: Discover used rows first; never default to high hardcoded row caps.
+
+## 2026-03-01 — [FORMAT] Over-Highlighting Reduced Signal
+- Symptom: Too many bold segments made the full column look uniformly emphasized.
+- Root cause: Highlight heuristic selected many phrases per line without a density limit.
+- Fix: Reduced to one key bold phrase per bullet line.
+- Rule: Emphasis must stay sparse and intentional; cap highlights per line.
+
+## 2026-03-01 — [DOM] Trusted Types Blocked innerHTML Assignment
+- Symptom: Direct `innerHTML` rewrite failed with Trusted Types enforcement.
+- Root cause: Google Sheets editor enforces TrustedHTML assignment policies.
+- Fix: Switched to DOM node construction via `createElement` and `createTextNode`.
+- Rule: Prefer node-based DOM updates over raw HTML assignment in locked editors.
+
+## 2026-03-01 — [DISCOVERY] Prior-Art Check Before New Skill Logic
+- Symptom: Risk of rebuilding behavior that already exists in official integrations or MCP servers.
+- Root cause: Feature work started before surveying existing Claude and MCP Google Sheets solutions.
+- Fix: Added a mandatory pre-build lookup step against official docs + known MCP repositories.
+- Rule: Before expanding Sheets automation behavior, check official support and existing MCP implementations.

From 81d04196180a5009d75b09bd387eb49153656ae9 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 2 Mar 2026 13:44:10 +0530
Subject: [PATCH 098/192] fix(agent): fail fast on corrupt session index and
 write index atomically

---
 agent/src/session-store.js       | 239 +++++++++++++++++++++++++++++++
 test/agent/session-store.test.js |  79 ++++++++++
 2 files changed, 318 insertions(+)
 create mode 100644 agent/src/session-store.js
 create mode 100644 test/agent/session-store.test.js

diff --git a/agent/src/session-store.js b/agent/src/session-store.js
new file mode 100644
index 0000000..d9e15ad
--- /dev/null
+++ b/agent/src/session-store.js
@@ -0,0 +1,239 @@
+import { promises as fs } from 'node:fs';
+import { homedir } from 'node:os';
+import { dirname, join } from 'node:path';
+import { randomUUID } from 'node:crypto';
+
+const DEFAULT_STORAGE_ROOT = join(homedir(), '.browserforce', 'agent', 'sessions');
+const INDEX_FILE = 'index.json';
+const SESSION_ID_RE = /^[A-Za-z0-9_-]{1,128}$/;
+const MODEL_ID_RE = /^[A-Za-z0-9._:/-]{1,128}$/;
+const indexWriteQueues = new Map();
+
+function resolveStorageRoot(storageRoot) {
+  return storageRoot || DEFAULT_STORAGE_ROOT;
+}
+
+function indexPath(storageRoot) {
+  return join(storageRoot, INDEX_FILE);
+}
+
+function messageLogPath(storageRoot, sessionId) {
+  return join(storageRoot, `${sessionId}.jsonl`);
+}
+
+export function isValidSessionId(sessionId) {
+  return typeof sessionId === 'string' && SESSION_ID_RE.test(sessionId);
+}
+
+export function isValidModelId(model) {
+  return typeof model === 'string' && MODEL_ID_RE.test(model);
+}
+
+function assertValidSessionId(sessionId, fnName) {
+  if (!isValidSessionId(sessionId)) {
+    throw new Error(`${fnName} requires a safe sessionId`);
+  }
+}
+
+async function ensureStorageRoot(storageRoot) {
+  await fs.mkdir(storageRoot, { recursive: true });
+}
+
+async function readIndex(storageRoot) {
+  const path = indexPath(storageRoot);
+  let raw;
+  try {
+    raw = await fs.readFile(path, 'utf8');
+  } catch (error) {
+    if (error && error.code === 'ENOENT') return [];
+    throw error;
+  }
+
+  let parsed;
+  try {
+    parsed = JSON.parse(raw);
+  } catch (error) {
+    throw new Error(`invalid session index: ${error?.message || 'unable to parse json'}`);
+  }
+
+  if (!parsed || typeof parsed !== 'object' || !Array.isArray(parsed.sessions)) {
+    throw new Error('invalid session index: missing sessions array');
+  }
+
+  return parsed.sessions;
+}
+
+async function writeIndex(storageRoot, sessions) {
+  const path = indexPath(storageRoot);
+  const tmpPath = `${path}.tmp`;
+  await fs.mkdir(dirname(path), { recursive: true });
+  await fs.writeFile(tmpPath, `${JSON.stringify({ sessions }, null, 2)}\n`, 'utf8');
+  await fs.rename(tmpPath, path);
+}
+
+async function withIndexWriteLock(storageRoot, operation) {
+  const queue = indexWriteQueues.get(storageRoot) || Promise.resolve();
+  const next = queue.then(operation, operation);
+  indexWriteQueues.set(storageRoot, next.catch(() => {}));
+  return next;
+}
+
+function normalizeModel(model) {
+  if (model == null) return null;
+  const trimmed = String(model).trim();
+  if (!trimmed) return null;
+  if (!isValidModelId(trimmed)) {
+    throw new Error('model must be a safe model id');
+  }
+  return trimmed;
+}
+
+function sortSessionsNewestFirst(a, b) {
+  const aTs = Date.parse(a.updatedAt || a.createdAt || 0);
+  const bTs = Date.parse(b.updatedAt || b.createdAt || 0);
+  return bTs - aTs;
+}
+
+export async function createSession({ title = 'New chat', model = null, storageRoot } = {}) {
+  const root = resolveStorageRoot(storageRoot);
+  await ensureStorageRoot(root);
+
+  const sessionId = randomUUID();
+  const now = new Date().toISOString();
+  const session = {
+    sessionId,
+    title,
+    model: normalizeModel(model),
+    createdAt: now,
+    updatedAt: now,
+  };
+
+  await withIndexWriteLock(root, async () => {
+    const sessions = await readIndex(root);
+    sessions.push(session);
+    await writeIndex(root, sessions);
+  });
+
+  return session;
+}
+
+export async function listSessions({ limit = 50, storageRoot } = {}) {
+  const root = resolveStorageRoot(storageRoot);
+  await ensureStorageRoot(root);
+  const sessions = await readIndex(root);
+  return sessions
+    .slice()
+    .sort(sortSessionsNewestFirst)
+    .slice(0, limit);
+}
+
+export async function getSession({ sessionId, storageRoot } = {}) {
+  assertValidSessionId(sessionId, 'getSession');
+  const root = resolveStorageRoot(storageRoot);
+  await ensureStorageRoot(root);
+  const sessions = await readIndex(root);
+  return sessions.find((row) => row.sessionId === sessionId) || null;
+}
+
+export async function updateSession({ sessionId, patch = {}, storageRoot } = {}) {
+  assertValidSessionId(sessionId, 'updateSession');
+  if (!patch || typeof patch !== 'object') {
+    throw new Error('updateSession requires patch');
+  }
+
+  const root = resolveStorageRoot(storageRoot);
+  await ensureStorageRoot(root);
+  const now = new Date().toISOString();
+
+  return withIndexWriteLock(root, async () => {
+    const sessions = await readIndex(root);
+    const idx = sessions.findIndex((row) => row.sessionId === sessionId);
+    if (idx === -1) return null;
+
+    const current = sessions[idx];
+    const next = { ...current };
+    if (typeof patch.title === 'string') {
+      next.title = patch.title.trim() || current.title || 'New chat';
+    }
+    if (Object.prototype.hasOwnProperty.call(patch, 'model')) {
+      next.model = normalizeModel(patch.model);
+    }
+    next.updatedAt = now;
+    sessions[idx] = next;
+    await writeIndex(root, sessions);
+    return next;
+  });
+}
+
+export async function appendMessage({ sessionId, role, text, storageRoot } = {}) {
+  assertValidSessionId(sessionId, 'appendMessage');
+  if (!role) throw new Error('appendMessage requires role');
+  if (typeof text !== 'string') throw new Error('appendMessage requires text');
+
+  const root = resolveStorageRoot(storageRoot);
+  await ensureStorageRoot(root);
+
+  const now = new Date().toISOString();
+  const entry = {
+    id: randomUUID(),
+    sessionId,
+    role,
+    text,
+    createdAt: now,
+  };
+
+  const logPath = messageLogPath(root, sessionId);
+  await fs.appendFile(logPath, `${JSON.stringify(entry)}\n`, 'utf8');
+
+  await withIndexWriteLock(root, async () => {
+    const sessions = await readIndex(root);
+    const idx = sessions.findIndex((s) => s.sessionId === sessionId);
+    if (idx === -1) {
+      sessions.push({
+        sessionId,
+        title: 'Recovered chat',
+        createdAt: now,
+        updatedAt: now,
+      });
+    } else {
+      sessions[idx] = {
+        ...sessions[idx],
+        updatedAt: now,
+      };
+    }
+    await writeIndex(root, sessions);
+  });
+
+  return entry;
+}
+
+export async function readMessages({ sessionId, limit = 100, storageRoot } = {}) {
+  assertValidSessionId(sessionId, 'readMessages');
+
+  const root = resolveStorageRoot(storageRoot);
+  await ensureStorageRoot(root);
+
+  const logPath = messageLogPath(root, sessionId);
+  let raw;
+  try {
+    raw = await fs.readFile(logPath, 'utf8');
+  } catch (error) {
+    if (error && error.code === 'ENOENT') return [];
+    throw error;
+  }
+
+  const rows = raw
+    .split('\n')
+    .filter(Boolean)
+    .map((line) => {
+      try {
+        return JSON.parse(line);
+      } catch {
+        return null;
+      }
+    })
+    .filter(Boolean);
+
+  if (rows.length <= limit) return rows;
+  return rows.slice(rows.length - limit);
+}
diff --git a/test/agent/session-store.test.js b/test/agent/session-store.test.js
new file mode 100644
index 0000000..1816e8d
--- /dev/null
+++ b/test/agent/session-store.test.js
@@ -0,0 +1,79 @@
+import test from 'node:test';
+import assert from 'node:assert/strict';
+import { mkdtempSync, rmSync, writeFileSync } from 'node:fs';
+import { tmpdir } from 'node:os';
+import { join } from 'node:path';
+import {
+  createSession,
+  listSessions,
+  appendMessage,
+  readMessages,
+  updateSession,
+} from '../../agent/src/session-store.js';
+
+let storageRoot;
+
+test.before(() => {
+  storageRoot = mkdtempSync(join(tmpdir(), 'bf-sessions-'));
+});
+
+test.after(() => {
+  rmSync(storageRoot, { recursive: true, force: true });
+});
+
+test('createSession generates unique session ids', async () => {
+  const a = await createSession({ title: 'A', storageRoot });
+  const b = await createSession({ title: 'B', storageRoot });
+  assert.notEqual(a.sessionId, b.sessionId);
+});
+
+test('messages are stored and loaded by sessionId', async () => {
+  const { sessionId } = await createSession({ title: 'Test', storageRoot });
+  await appendMessage({ sessionId, role: 'user', text: 'hello', storageRoot });
+  const rows = await readMessages({ sessionId, limit: 20, storageRoot });
+  assert.equal(rows.at(-1).text, 'hello');
+});
+
+test('rejects unsafe session ids', async () => {
+  await assert.rejects(
+    appendMessage({ sessionId: '../escape', role: 'user', text: 'x', storageRoot }),
+    /safe sessionId/,
+  );
+});
+
+test('listSessions returns newest first', async () => {
+  const older = await createSession({ title: 'Older', storageRoot });
+  await new Promise((resolve) => setTimeout(resolve, 5));
+  const newer = await createSession({ title: 'Newer', storageRoot });
+  const rows = await listSessions({ limit: 10, storageRoot });
+  const olderIdx = rows.findIndex((row) => row.sessionId === older.sessionId);
+  const newerIdx = rows.findIndex((row) => row.sessionId === newer.sessionId);
+  assert.ok(newerIdx !== -1);
+  assert.ok(olderIdx !== -1);
+  assert.ok(newerIdx < olderIdx);
+});
+
+test('updateSession persists per-session model and title', async () => {
+  const created = await createSession({ title: 'Before', storageRoot });
+  const updated = await updateSession({
+    sessionId: created.sessionId,
+    patch: { title: 'After', model: 'gpt-5' },
+    storageRoot,
+  });
+
+  assert.equal(updated?.title, 'After');
+  assert.equal(updated?.model, 'gpt-5');
+
+  const rows = await listSessions({ limit: 10, storageRoot });
+  const row = rows.find((item) => item.sessionId === created.sessionId);
+  assert.equal(row?.title, 'After');
+  assert.equal(row?.model, 'gpt-5');
+});
+
+test('listSessions fails fast on corrupted index metadata', async () => {
+  writeFileSync(join(storageRoot, 'index.json'), '{this-is-not-json\n', 'utf8');
+  await assert.rejects(
+    listSessions({ limit: 10, storageRoot }),
+    /invalid session index/i,
+  );
+});

From 282e8dfc2e2806448de350e296f8a3c2167dd3ea Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 2 Mar 2026 13:49:25 +0530
Subject: [PATCH 099/192] fix(agent): use unique temp index files for
 concurrent atomic writes

---
 agent/src/session-store.js | 10 +++++++---
 1 file changed, 7 insertions(+), 3 deletions(-)

diff --git a/agent/src/session-store.js b/agent/src/session-store.js
index d9e15ad..8713a93 100644
--- a/agent/src/session-store.js
+++ b/agent/src/session-store.js
@@ -65,10 +65,14 @@ async function readIndex(storageRoot) {
 
 async function writeIndex(storageRoot, sessions) {
   const path = indexPath(storageRoot);
-  const tmpPath = `${path}.tmp`;
+  const tmpPath = `${path}.${process.pid}.${Date.now()}.${Math.random().toString(16).slice(2)}.tmp`;
   await fs.mkdir(dirname(path), { recursive: true });
-  await fs.writeFile(tmpPath, `${JSON.stringify({ sessions }, null, 2)}\n`, 'utf8');
-  await fs.rename(tmpPath, path);
+  try {
+    await fs.writeFile(tmpPath, `${JSON.stringify({ sessions }, null, 2)}\n`, 'utf8');
+    await fs.rename(tmpPath, path);
+  } finally {
+    try { await fs.unlink(tmpPath); } catch {}
+  }
 }
 
 async function withIndexWriteLock(storageRoot, operation) {

From 2e86b6d21242ac34d04f49fbac6cb74f3fc9efb4 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 2 Mar 2026 13:49:31 +0530
Subject: [PATCH 100/192] fix(chatd): clean up failed run starts to avoid
 leaked run state

---
 agent/src/chatd.js           | 491 +++++++++++++++++++++++++++++++++++
 test/agent/chatd-api.test.js | 193 ++++++++++++++
 2 files changed, 684 insertions(+)
 create mode 100644 agent/src/chatd.js
 create mode 100644 test/agent/chatd-api.test.js

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
new file mode 100644
index 0000000..28ff864
--- /dev/null
+++ b/agent/src/chatd.js
@@ -0,0 +1,491 @@
+import http from 'node:http';
+import { randomBytes } from 'node:crypto';
+import { promises as fs } from 'node:fs';
+import { homedir } from 'node:os';
+import { dirname, join } from 'node:path';
+import { fileURLToPath } from 'node:url';
+
+import { pickChatdPort } from './port-resolver.js';
+import { isAllowedOrigin, verifyBearer } from './auth.js';
+import { startCodexRun } from './codex-runner.js';
+import {
+  appendMessage,
+  createSession,
+  getSession,
+  isValidSessionId,
+  listSessions,
+  readMessages,
+  updateSession,
+} from './session-store.js';
+
+const BF_DIR = join(homedir(), '.browserforce');
+const CHATD_URL_PATH = join(BF_DIR, 'chatd-url.json');
+
+function nowIso() {
+  return new Date().toISOString();
+}
+
+function json(res, statusCode, body) {
+  res.statusCode = statusCode;
+  res.setHeader('content-type', 'application/json');
+  res.end(JSON.stringify(body));
+}
+
+function safeDecodeComponent(value) {
+  try {
+    return decodeURIComponent(value);
+  } catch {
+    return null;
+  }
+}
+
+async function readJsonBody(req) {
+  const chunks = [];
+  for await (const chunk of req) chunks.push(chunk);
+  if (chunks.length === 0) return {};
+  const raw = Buffer.concat(chunks).toString('utf8');
+  if (!raw.trim()) return {};
+  return JSON.parse(raw);
+}
+
+function buildEvent({ event, runId, sessionId, payload }) {
+  return {
+    event,
+    runId,
+    sessionId,
+    payload: payload || {},
+    timestamp: nowIso(),
+  };
+}
+
+async function writeChatdUrlFile({ port, token, writeChatdUrl = true, urlPath = CHATD_URL_PATH }) {
+  if (!writeChatdUrl) return;
+  await fs.mkdir(dirname(urlPath), { recursive: true });
+  await fs.writeFile(urlPath, `${JSON.stringify({ port, token })}\n`, { mode: 0o600 });
+}
+
+async function clearChatdUrlFile({ writeChatdUrl = true, urlPath = CHATD_URL_PATH }) {
+  if (!writeChatdUrl) return;
+  try {
+    await fs.unlink(urlPath);
+  } catch (error) {
+    if (error && error.code !== 'ENOENT') throw error;
+  }
+}
+
+function createDefaultRunExecutor({ codexCwd } = {}) {
+  return ({ runId, sessionId, message, model, onEvent, onExit, onError }) => startCodexRun({
+    runId,
+    sessionId,
+    prompt: message,
+    model,
+    cwd: codexCwd,
+    onEvent,
+    onExit,
+    onError,
+  });
+}
+
+export async function startChatd(opts = {}) {
+  const writeChatdUrl = opts.writeChatdUrl !== false;
+  const storageRoot = opts.storageRoot;
+  const token = opts.token || process.env.BF_CHATD_TOKEN || randomBytes(32).toString('base64url');
+  const chatdUrlPath = opts.chatdUrlPath || CHATD_URL_PATH;
+  const runExecutor = opts.runExecutor || createDefaultRunExecutor({ codexCwd: opts.codexCwd || process.cwd() });
+
+  let desiredPort = Number.isFinite(opts.port) ? Number(opts.port) : Number(process.env.BF_CHATD_PORT || 0);
+  if (!Number.isInteger(desiredPort) || desiredPort < 0) desiredPort = 0;
+
+  if (desiredPort === 0) {
+    desiredPort = await pickChatdPort({
+      envPort: Number(process.env.BF_CHATD_PORT || 0),
+      rangeStart: 19280,
+      rangeEnd: 19320,
+    }).catch(() => 0);
+  }
+
+  const startedAt = Date.now();
+  const sseClients = new Set();
+  const runs = new Map();
+
+  const broadcast = (evt) => {
+    const line = `data: ${JSON.stringify(evt)}\n\n`;
+    for (const client of sseClients) {
+      if (client.sessionId && client.sessionId !== evt.sessionId) continue;
+      try {
+        client.res.write(line);
+      } catch {
+        sseClients.delete(client);
+      }
+    }
+  };
+
+  async function finalizeRun(run, finalText) {
+    if (!run || run.status !== 'running' || run.finalSent) return;
+    run.finalSent = true;
+    run.status = 'done';
+    await appendMessage({ sessionId: run.sessionId, role: 'assistant', text: finalText, storageRoot });
+    broadcast(buildEvent({ event: 'chat.final', runId: run.runId, sessionId: run.sessionId, payload: { text: finalText } }));
+    runs.delete(run.runId);
+  }
+
+  function failRun(run, errorMessage) {
+    if (!run || run.status !== 'running') return;
+    run.status = 'error';
+    broadcast(buildEvent({
+      event: 'run.error',
+      runId: run.runId,
+      sessionId: run.sessionId,
+      payload: { error: errorMessage || 'Run failed' },
+    }));
+    runs.delete(run.runId);
+  }
+
+  const server = http.createServer(async (req, res) => {
+    try {
+      const base = `http://${req.headers.host || '127.0.0.1'}`;
+      const url = new URL(req.url || '/', base);
+
+      if (url.pathname === '/health' && req.method === 'GET') {
+        json(res, 200, {
+          ok: true,
+          pid: process.pid,
+          port: server.address()?.port || desiredPort,
+          uptimeMs: Date.now() - startedAt,
+        });
+        return;
+      }
+
+      if (url.pathname.startsWith('/v1/')) {
+        const origin = req.headers.origin;
+        if (!isAllowedOrigin(origin)) {
+          json(res, 403, { error: 'Forbidden - invalid origin' });
+          return;
+        }
+        if (!verifyBearer(req, token)) {
+          json(res, 401, { error: 'Unauthorized' });
+          return;
+        }
+      }
+
+      if (url.pathname === '/v1/sessions' && req.method === 'GET') {
+        const sessions = await listSessions({ storageRoot });
+        json(res, 200, { sessions });
+        return;
+      }
+
+      if (url.pathname === '/v1/sessions' && req.method === 'POST') {
+        let body = {};
+        try {
+          body = await readJsonBody(req);
+        } catch {
+          json(res, 400, { error: 'Invalid JSON body' });
+          return;
+        }
+        try {
+          const session = await createSession({
+            title: body.title || 'New chat',
+            model: body.model ?? null,
+            storageRoot,
+          });
+          json(res, 201, session);
+        } catch (error) {
+          json(res, 400, { error: error?.message || 'Invalid session body' });
+        }
+        return;
+      }
+
+      const sessionMatch = url.pathname.match(/^\/v1\/sessions\/([^/]+)$/);
+      if (sessionMatch && req.method === 'PATCH') {
+        const decodedSessionId = safeDecodeComponent(sessionMatch[1]);
+        if (!decodedSessionId || !isValidSessionId(decodedSessionId)) {
+          json(res, 400, { error: 'Invalid sessionId' });
+          return;
+        }
+
+        let body = {};
+        try {
+          body = await readJsonBody(req);
+        } catch {
+          json(res, 400, { error: 'Invalid JSON body' });
+          return;
+        }
+
+        try {
+          const updated = await updateSession({
+            sessionId: decodedSessionId,
+            patch: {
+              ...(Object.prototype.hasOwnProperty.call(body, 'title') ? { title: body.title } : {}),
+              ...(Object.prototype.hasOwnProperty.call(body, 'model') ? { model: body.model } : {}),
+            },
+            storageRoot,
+          });
+          if (!updated) {
+            json(res, 404, { error: 'Session not found' });
+            return;
+          }
+          json(res, 200, updated);
+        } catch (error) {
+          json(res, 400, { error: error?.message || 'Invalid session patch' });
+        }
+        return;
+      }
+
+      const messagesMatch = url.pathname.match(/^\/v1\/sessions\/([^/]+)\/messages$/);
+      if (messagesMatch && req.method === 'GET') {
+        const decodedSessionId = safeDecodeComponent(messagesMatch[1]);
+        if (!decodedSessionId || !isValidSessionId(decodedSessionId)) {
+          json(res, 400, { error: 'Invalid sessionId' });
+          return;
+        }
+        const limit = Number(url.searchParams.get('limit') || 100);
+        const messages = await readMessages({ sessionId: decodedSessionId, limit, storageRoot });
+        json(res, 200, { sessionId: decodedSessionId, messages });
+        return;
+      }
+
+      if (url.pathname === '/v1/events' && req.method === 'GET') {
+        const sessionId = url.searchParams.get('sessionId') || null;
+        if (sessionId && !isValidSessionId(sessionId)) {
+          json(res, 400, { error: 'Invalid sessionId' });
+          return;
+        }
+
+        res.writeHead(200, {
+          'content-type': 'text/event-stream; charset=utf-8',
+          'cache-control': 'no-cache, no-transform',
+          connection: 'keep-alive',
+        });
+        res.write(': connected\n\n');
+
+        const client = {
+          res,
+          sessionId,
+          heartbeat: setInterval(() => {
+            try {
+              res.write(': ping\n\n');
+            } catch {
+              // closed socket
+            }
+          }, 15000),
+        };
+        sseClients.add(client);
+
+        req.on('close', () => {
+          clearInterval(client.heartbeat);
+          sseClients.delete(client);
+        });
+        return;
+      }
+
+      if (url.pathname === '/v1/runs' && req.method === 'POST') {
+        let body;
+        try {
+          body = await readJsonBody(req);
+        } catch {
+          json(res, 400, { error: 'Invalid JSON body' });
+          return;
+        }
+
+        const { sessionId, message } = body || {};
+        if (!sessionId || typeof sessionId !== 'string') {
+          json(res, 400, { error: 'sessionId is required' });
+          return;
+        }
+        if (!isValidSessionId(sessionId)) {
+          json(res, 400, { error: 'sessionId is invalid' });
+          return;
+        }
+        if (!message || typeof message !== 'string') {
+          json(res, 400, { error: 'message is required' });
+          return;
+        }
+        const session = await getSession({ sessionId, storageRoot });
+        if (!session) {
+          json(res, 404, { error: 'Session not found' });
+          return;
+        }
+
+        const runId = randomBytes(12).toString('base64url');
+        const run = {
+          runId,
+          sessionId,
+          status: 'running',
+          abort: null,
+          assistantBuffer: '',
+          finalSent: false,
+          queue: Promise.resolve(),
+        };
+
+        const enqueue = (fn) => {
+          run.queue = run.queue.then(fn, fn);
+        };
+
+        try {
+          await appendMessage({ sessionId, role: 'user', text: message, storageRoot });
+          runs.set(runId, run);
+
+          const handle = runExecutor({
+            runId,
+            sessionId,
+            message,
+            model: session.model || null,
+            onEvent: (evt) => {
+              enqueue(async () => {
+                const active = runs.get(runId);
+                if (!active || active.status !== 'running') return;
+
+                if (evt.event === 'chat.delta') {
+                  const delta = evt.payload?.delta || '';
+                  if (delta) {
+                    active.assistantBuffer += delta;
+                    broadcast(buildEvent({ event: 'chat.delta', runId, sessionId, payload: { delta } }));
+                  }
+                  return;
+                }
+
+                if (evt.event === 'chat.final') {
+                  const text = evt.payload?.text || active.assistantBuffer || '';
+                  await finalizeRun(active, text);
+                  return;
+                }
+
+                if (evt.event === 'run.error') {
+                  failRun(active, evt.payload?.error || 'Run failed');
+                  return;
+                }
+
+                if (evt.event === 'run.started') {
+                  return;
+                }
+
+                broadcast(buildEvent({ event: evt.event, runId, sessionId, payload: evt.payload }));
+              });
+            },
+            onExit: ({ code, signal }) => {
+              enqueue(async () => {
+                const active = runs.get(runId);
+                if (!active || active.status !== 'running') return;
+
+                if (signal === 'SIGTERM' || active.status === 'aborted') return;
+
+                if (active.assistantBuffer) {
+                  await finalizeRun(active, active.assistantBuffer);
+                  return;
+                }
+
+                if (code === 0) {
+                  await finalizeRun(active, '');
+                  return;
+                }
+
+                failRun(active, `codex exited with code ${code ?? 'unknown'}`);
+              });
+            },
+            onError: (error) => {
+              enqueue(() => {
+                const active = runs.get(runId);
+                failRun(active, error?.message || 'Failed to start codex');
+              });
+            },
+          });
+
+          run.abort = handle?.abort || null;
+          broadcast(buildEvent({ event: 'run.started', runId, sessionId, payload: { message, model: session.model || null } }));
+          json(res, 202, { ok: true, runId, sessionId });
+        } catch (error) {
+          runs.delete(runId);
+          json(res, 500, { error: error?.message || 'Failed to start run' });
+        }
+        return;
+      }
+
+      const abortMatch = url.pathname.match(/^\/v1\/runs\/([^/]+)\/abort$/);
+      if (abortMatch && (req.method === 'DELETE' || req.method === 'POST')) {
+        const decodedRunId = safeDecodeComponent(abortMatch[1]);
+        if (!decodedRunId) {
+          json(res, 400, { error: 'Invalid runId' });
+          return;
+        }
+
+        const run = runs.get(decodedRunId);
+        if (!run) {
+          json(res, 404, { error: 'Run not found' });
+          return;
+        }
+
+        run.status = 'aborted';
+        run.abort?.();
+        runs.delete(decodedRunId);
+        broadcast(buildEvent({ event: 'run.aborted', runId: decodedRunId, sessionId: run.sessionId, payload: {} }));
+        json(res, 200, { ok: true, runId: decodedRunId, aborted: true });
+        return;
+      }
+
+      json(res, 404, { error: 'Not found' });
+    } catch (error) {
+      json(res, 500, { error: error?.message || 'Internal server error' });
+    }
+  });
+
+  await new Promise((resolve, reject) => {
+    server.once('error', reject);
+    server.listen(desiredPort, '127.0.0.1', resolve);
+  });
+
+  const port = server.address().port;
+  await writeChatdUrlFile({ port, token, writeChatdUrl, urlPath: chatdUrlPath });
+
+  const stop = async () => {
+    for (const run of runs.values()) {
+      run.status = 'aborted';
+      run.abort?.();
+    }
+    runs.clear();
+
+    for (const client of sseClients) {
+      clearInterval(client.heartbeat);
+      try {
+        client.res.end();
+      } catch {
+        // ignore
+      }
+    }
+    sseClients.clear();
+
+    await new Promise((resolve) => server.close(resolve));
+    await clearChatdUrlFile({ writeChatdUrl, urlPath: chatdUrlPath });
+  };
+
+  return {
+    token,
+    port,
+    baseUrl: `http://127.0.0.1:${port}`,
+    stop,
+  };
+}
+
+async function main() {
+  const daemon = await startChatd({
+    port: Number(process.env.BF_CHATD_PORT || 0),
+    token: process.env.BF_CHATD_TOKEN,
+    writeChatdUrl: true,
+  });
+
+  const shutdown = async () => {
+    await daemon.stop();
+    process.exit(0);
+  };
+
+  process.on('SIGTERM', shutdown);
+  process.on('SIGINT', shutdown);
+}
+
+if (process.argv[1] === fileURLToPath(import.meta.url)) {
+  main().catch((error) => {
+    console.error(`[chatd] ${error.stack || error.message}`);
+    process.exit(1);
+  });
+}
+
+export { CHATD_URL_PATH };
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
new file mode 100644
index 0000000..ad4e1a8
--- /dev/null
+++ b/test/agent/chatd-api.test.js
@@ -0,0 +1,193 @@
+import test from 'node:test';
+import assert from 'node:assert/strict';
+import { existsSync, mkdtempSync, rmSync } from 'node:fs';
+import { tmpdir } from 'node:os';
+import { join } from 'node:path';
+import { startChatd } from '../../agent/src/chatd.js';
+
+async function fetchWithRetry(url, init, attempts = 3) {
+  let lastError;
+  for (let i = 0; i < attempts; i += 1) {
+    try {
+      return await fetch(url, init);
+    } catch (error) {
+      lastError = error;
+      await new Promise((resolve) => setTimeout(resolve, 20));
+    }
+  }
+  throw lastError;
+}
+
+test('GET /health returns daemon metadata', async () => {
+  const daemon = await startChatd({ port: 0, writeChatdUrl: false });
+  try {
+    const res = await fetch(`${daemon.baseUrl}/health`);
+    const body = await res.json();
+    assert.equal(body.ok, true);
+    assert.ok(typeof body.port === 'number');
+  } finally {
+    await daemon.stop();
+  }
+});
+
+test('POST /v1/runs requires explicit sessionId', async () => {
+  const daemon = await startChatd({ port: 0, writeChatdUrl: false });
+  try {
+    const res = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ message: 'hello' }),
+    });
+    assert.equal(res.status, 400);
+  } finally {
+    await daemon.stop();
+  }
+});
+
+test('GET /v1/sessions/:id/messages rejects malformed encoded id', async () => {
+  const daemon = await startChatd({ port: 0, writeChatdUrl: false });
+  try {
+    const res = await fetch(`${daemon.baseUrl}/v1/sessions/%E0/messages`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(res.status, 400);
+  } finally {
+    await daemon.stop();
+  }
+});
+
+test('POST /v1/runs rejects unsafe sessionId', async () => {
+  const daemon = await startChatd({ port: 0, writeChatdUrl: false });
+  try {
+    const res = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: '../escape', message: 'hello' }),
+    });
+    assert.equal(res.status, 400);
+  } finally {
+    await daemon.stop();
+  }
+});
+
+test('stop removes chatd-url metadata file when enabled', async () => {
+  const tempDir = mkdtempSync(join(tmpdir(), 'bf-chatd-url-'));
+  const urlPath = join(tempDir, 'chatd-url.json');
+  const daemon = await startChatd({ port: 0, writeChatdUrl: true, chatdUrlPath: urlPath });
+  assert.equal(existsSync(urlPath), true);
+  await daemon.stop();
+  assert.equal(existsSync(urlPath), false);
+  rmSync(tempDir, { recursive: true, force: true });
+});
+
+test('POST /v1/runs uses injected run executor and persists assistant output', async () => {
+  const seenRuns = [];
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    runExecutor: ({ runId, sessionId, model, onEvent, onExit }) => {
+      seenRuns.push({ runId, sessionId, model });
+      setTimeout(() => {
+        onEvent({ event: 'chat.delta', runId, sessionId, payload: { delta: 'hel' } });
+      }, 10);
+      setTimeout(() => {
+        onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'hello' } });
+      }, 20);
+      setTimeout(() => onExit({ code: 0 }), 25);
+      return { abort() {} };
+    },
+  });
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'T' }),
+    }).then((res) => res.json());
+
+    const patched = await fetch(`${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}`, {
+      method: 'PATCH',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ model: 'gpt-5' }),
+    });
+    assert.equal(patched.status, 200);
+
+    const runRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'hi' }),
+    });
+    assert.equal(runRes.status, 202);
+
+    await new Promise((resolve) => setTimeout(resolve, 60));
+    assert.equal(seenRuns.at(-1)?.model, 'gpt-5');
+
+    const messagesBody = await fetch(
+      `${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}/messages`,
+      { headers: { authorization: `Bearer ${daemon.token}` } },
+    ).then((res) => res.json());
+    const messages = messagesBody.messages || [];
+    assert.equal(messages.at(-1).text, 'hello');
+  } finally {
+    await daemon.stop();
+  }
+});
+
+test('runExecutor synchronous failure does not leak abortable run', async () => {
+  const storageRoot = mkdtempSync(join(tmpdir(), 'bf-chatd-run-fail-'));
+  let attemptedRunId = null;
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    storageRoot,
+    runExecutor: ({ runId }) => {
+      attemptedRunId = runId;
+      throw new Error('runner boot failed');
+    },
+  });
+
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'T' }),
+    }).then((res) => res.json());
+
+    const runRes = await fetchWithRetry(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'hi' }),
+    });
+    assert.equal(runRes.status, 500);
+    assert.equal(typeof attemptedRunId, 'string');
+
+    const abortRes = await fetch(`${daemon.baseUrl}/v1/runs/${encodeURIComponent(attemptedRunId)}/abort`, {
+      method: 'DELETE',
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(abortRes.status, 404);
+  } finally {
+    await daemon.stop();
+    rmSync(storageRoot, { recursive: true, force: true });
+  }
+});

From 3027c79a8ae47fb41bce304eeced0f9a46f65141 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 2 Mar 2026 13:51:26 +0530
Subject: [PATCH 101/192] fix(sidepanel): preserve draft and fail fast on
 send/bootstrap errors

---
 extension/agent-panel.js                     | 462 +++++++++++++++++++
 test/agent/agent-panel-send-contract.test.js |  17 +
 2 files changed, 479 insertions(+)
 create mode 100644 extension/agent-panel.js
 create mode 100644 test/agent/agent-panel-send-contract.test.js

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
new file mode 100644
index 0000000..9031732
--- /dev/null
+++ b/extension/agent-panel.js
@@ -0,0 +1,462 @@
+import { applyEvent, initialState, reduceState } from './agent-panel-state.js';
+import {
+  assignSessionRunId,
+  clearSessionRunId,
+  getSessionRunId,
+  shouldApplySessionSelection,
+} from './agent-panel-runtime.js';
+
+const MODEL_PRESETS = [
+  { value: null, label: 'Default' },
+  { value: 'gpt-5', label: 'GPT-5' },
+  { value: 'gpt-5-mini', label: 'GPT-5 Mini' },
+  { value: 'o3', label: 'o3' },
+  { value: 'o4-mini', label: 'o4-mini' },
+];
+
+const state = {
+  value: initialState,
+  auth: null,
+  currentRunBySession: {},
+  eventController: null,
+  eventLoopToken: 0,
+  sessionSelectionToken: 0,
+  popover: 'none',
+};
+
+const statusEl = document.getElementById('bf-agent-status');
+const statusIconEl = document.getElementById('bf-agent-status-icon');
+const statusTextEl = document.getElementById('bf-agent-status-text');
+const modelTriggerBtn = document.getElementById('bf-model-trigger');
+const sessionTriggerBtn = document.getElementById('bf-session-trigger');
+const newSessionBtn = document.getElementById('bf-new-session');
+const popoverBackdropEl = document.getElementById('bf-popover-backdrop');
+const modelPanelEl = document.getElementById('bf-model-panel');
+const sessionPanelEl = document.getElementById('bf-session-panel');
+const modelListEl = document.getElementById('bf-model-list');
+const switchSessionListEl = document.getElementById('bf-switch-session-list');
+const transcriptEl = document.getElementById('bf-transcript');
+const chatFormEl = document.getElementById('bf-chat-form');
+const chatInputEl = document.getElementById('bf-chat-input');
+const stopRunBtn = document.getElementById('bf-stop-run');
+const sendBtn = chatFormEl.querySelector('button[type="submit"]');
+
+function setStatus(kind, text) {
+  statusTextEl.textContent = text;
+  statusEl.classList.toggle('error', kind === 'error');
+  statusIconEl.textContent = kind === 'error' ? '!' : '●';
+}
+
+function setComposerEnabled(enabled) {
+  chatInputEl.disabled = !enabled;
+  stopRunBtn.disabled = !enabled;
+  sendBtn.disabled = !enabled;
+}
+
+function dispatch(action) {
+  state.value = reduceState(state.value, action);
+  render();
+}
+
+function dispatchEvent(evt) {
+  state.value = applyEvent(state.value, evt);
+  if (evt?.event === 'run.started' && evt.sessionId && evt.runId) {
+    state.currentRunBySession = assignSessionRunId(state.currentRunBySession, evt.sessionId, evt.runId);
+  }
+  if (evt?.sessionId && evt?.runId && (evt.event === 'chat.final' || evt.event === 'run.error' || evt.event === 'run.aborted')) {
+    state.currentRunBySession = clearSessionRunId(state.currentRunBySession, evt.sessionId, evt.runId);
+  }
+  render();
+}
+
+function getActiveSession() {
+  return state.value.sessions.find((item) => item.sessionId === state.value.activeSessionId) || null;
+}
+
+function getActiveMessages() {
+  return state.value.messagesBySession[state.value.activeSessionId] || [];
+}
+
+function formatModelLabel(model) {
+  return model && String(model).trim() ? model : 'Default';
+}
+
+function renderSelectors() {
+  const activeSession = getActiveSession();
+  modelTriggerBtn.textContent = `Model: ${formatModelLabel(activeSession?.model)}`;
+  sessionTriggerBtn.textContent = activeSession?.title || 'Session';
+}
+
+function renderModelList() {
+  const activeSession = getActiveSession();
+  const activeModel = activeSession?.model || null;
+
+  const rows = MODEL_PRESETS.map((preset) => {
+    const active = (preset.value || null) === activeModel ? 'active' : '';
+    return `<li><button type="button" data-model="${escapeHtml(preset.value || '')}" class="${active}">${escapeHtml(preset.label)}</button></li>`;
+  });
+  rows.push('<li><button type="button" data-model-custom="1">Custom...</button></li>');
+
+  modelListEl.innerHTML = rows.join('');
+
+  modelListEl.querySelectorAll('button[data-model]').forEach((button) => {
+    button.addEventListener('click', () => {
+      const model = button.dataset.model || null;
+      updateActiveSessionModel(model).catch((error) => {
+        setStatus('error', error.message || 'Unable to update model');
+      });
+    });
+  });
+
+  const customBtn = modelListEl.querySelector('button[data-model-custom]');
+  if (customBtn) {
+    customBtn.addEventListener('click', async () => {
+      const current = activeModel || '';
+      const value = window.prompt('Enter model id', current);
+      if (value === null) return;
+      const model = value.trim() || null;
+      try {
+        await updateActiveSessionModel(model);
+      } catch (error) {
+        setStatus('error', error.message || 'Unable to update model');
+      }
+    });
+  }
+}
+
+function renderSessions() {
+  const sessions = state.value.sessions;
+  if (!sessions.length) {
+    switchSessionListEl.innerHTML = '<li class="empty-item">No sessions</li>';
+    return;
+  }
+
+  switchSessionListEl.innerHTML = sessions
+    .map((session) => {
+      const active = session.sessionId === state.value.activeSessionId ? 'active' : '';
+      const title = session.title || session.sessionId;
+      return `<li><button type="button" data-session-id="${session.sessionId}" class="${active}">${escapeHtml(title)}</button></li>`;
+    })
+    .join('');
+
+  switchSessionListEl.querySelectorAll('button[data-session-id]').forEach((button) => {
+    button.addEventListener('click', async () => {
+      await selectSession(button.dataset.sessionId);
+      setPopover('none');
+    });
+  });
+}
+
+function renderTranscript() {
+  const messages = getActiveMessages();
+  const sessionId = state.value.activeSessionId;
+  const sessionRunId = getSessionRunId(state.currentRunBySession, sessionId);
+  const run = sessionRunId ? state.value.runs[sessionRunId] : null;
+
+  const chunks = messages.map((msg) => {
+    const role = msg.role || 'assistant';
+    return `<article class="message ${role}">${escapeHtml(msg.text || '')}</article>`;
+  });
+
+  if (run && !run.done) {
+    chunks.push(`<article class="message assistant">${escapeHtml(run.text || '')}</article>`);
+  }
+
+  transcriptEl.innerHTML = chunks.join('') || '<article class="message assistant">No messages yet.</article>';
+  transcriptEl.scrollTop = transcriptEl.scrollHeight;
+}
+
+function setPopover(popover) {
+  state.popover = popover;
+  renderPopovers();
+}
+
+function renderPopovers() {
+  const modelOpen = state.popover === 'model';
+  const sessionOpen = state.popover === 'session';
+  const anyOpen = modelOpen || sessionOpen;
+
+  modelTriggerBtn.setAttribute('aria-expanded', modelOpen ? 'true' : 'false');
+  sessionTriggerBtn.setAttribute('aria-expanded', sessionOpen ? 'true' : 'false');
+  popoverBackdropEl.classList.toggle('hidden', !anyOpen);
+  modelPanelEl.classList.toggle('hidden', !modelOpen);
+  sessionPanelEl.classList.toggle('hidden', !sessionOpen);
+}
+
+function render() {
+  renderSelectors();
+  renderModelList();
+  renderSessions();
+  renderTranscript();
+  renderPopovers();
+}
+
+function escapeHtml(value) {
+  const div = document.createElement('div');
+  div.textContent = value;
+  return div.innerHTML;
+}
+
+function sleep(ms) {
+  return new Promise((resolve) => setTimeout(resolve, ms));
+}
+
+async function getRelayHttpUrl() {
+  const stored = await chrome.storage.local.get(['relayUrl']);
+  const relayUrl = stored.relayUrl || 'ws://127.0.0.1:19222/extension';
+  if (relayUrl.startsWith('ws://')) return relayUrl.replace('ws://', 'http://').replace('/extension', '');
+  if (relayUrl.startsWith('wss://')) return relayUrl.replace('wss://', 'https://').replace('/extension', '');
+  return 'http://127.0.0.1:19222';
+}
+
+async function loadAuth() {
+  const relayHttpUrl = await getRelayHttpUrl();
+  const res = await fetch(`${relayHttpUrl}/chatd-url`);
+  if (!res.ok) throw new Error('daemon_unavailable');
+  const body = await res.json();
+  state.auth = {
+    baseUrl: `http://127.0.0.1:${body.port}`,
+    token: body.token,
+  };
+}
+
+async function api(path, init = {}) {
+  const headers = {
+    'content-type': 'application/json',
+    authorization: `Bearer ${state.auth.token}`,
+    ...(init.headers || {}),
+  };
+  return fetch(`${state.auth.baseUrl}${path}`, { ...init, headers });
+}
+
+async function readJsonOrEmpty(response) {
+  try {
+    return await response.json();
+  } catch {
+    return {};
+  }
+}
+
+async function ensureOk(response, fallbackMessage) {
+  if (response.ok) return response;
+  const body = await readJsonOrEmpty(response);
+  throw new Error(body.error || `${fallbackMessage} (${response.status})`);
+}
+
+async function loadSessions(preferredSessionId = null) {
+  const res = await api('/v1/sessions');
+  await ensureOk(res, 'Failed to load sessions');
+  const body = await readJsonOrEmpty(res);
+  const sessions = body.sessions || [];
+  const activeFromPreference = preferredSessionId && sessions.some((s) => s.sessionId === preferredSessionId)
+    ? preferredSessionId
+    : null;
+  dispatch({
+    type: 'session.list.loaded',
+    sessions,
+    activeSessionId: activeFromPreference || sessions[0]?.sessionId || null,
+  });
+}
+
+async function loadMessages(sessionId) {
+  const res = await api(`/v1/sessions/${encodeURIComponent(sessionId)}/messages?limit=200`, {
+    method: 'GET',
+    headers: {},
+  });
+  await ensureOk(res, 'Failed to load messages');
+  const body = await readJsonOrEmpty(res);
+  dispatch({ type: 'messages.loaded', sessionId, messages: body.messages || [] });
+}
+
+async function selectSession(sessionId) {
+  state.sessionSelectionToken += 1;
+  const selectionToken = state.sessionSelectionToken;
+  dispatch({ type: 'session.selected', sessionId });
+  await loadMessages(sessionId);
+  if (!shouldApplySessionSelection({
+    requestToken: selectionToken,
+    latestRequestToken: state.sessionSelectionToken,
+    requestedSessionId: sessionId,
+    activeSessionId: state.value.activeSessionId,
+  })) {
+    return;
+  }
+  connectEvents(sessionId);
+}
+
+async function createSession() {
+  const res = await api('/v1/sessions', {
+    method: 'POST',
+    body: JSON.stringify({ title: 'New Session' }),
+  });
+  await ensureOk(res, 'Failed to create session');
+  const created = await readJsonOrEmpty(res);
+  await loadSessions(created.sessionId);
+  await selectSession(created.sessionId);
+}
+
+async function updateActiveSessionModel(model) {
+  const sessionId = state.value.activeSessionId;
+  if (!sessionId) return;
+
+  const res = await api(`/v1/sessions/${encodeURIComponent(sessionId)}`, {
+    method: 'PATCH',
+    body: JSON.stringify({ model }),
+  });
+  if (!res.ok) {
+    const body = await res.json().catch(() => ({}));
+    throw new Error(body.error || 'Unable to update model');
+  }
+
+  await loadSessions(sessionId);
+  setPopover('none');
+  setStatus('ready', 'Ready');
+}
+
+async function consumeEventStream(body, loopToken) {
+  if (!body) return;
+  const reader = body.getReader();
+  const decoder = new TextDecoder();
+  let buffer = '';
+
+  while (state.eventLoopToken === loopToken) {
+    const { done, value } = await reader.read();
+    if (done) break;
+    buffer += decoder.decode(value, { stream: true });
+    const frames = buffer.split('\n\n');
+    buffer = frames.pop() || '';
+    for (const frame of frames) {
+      for (const line of frame.split('\n')) {
+        if (!line.startsWith('data: ')) continue;
+        try {
+          const evt = JSON.parse(line.slice(6));
+          dispatchEvent(evt);
+        } catch {
+          // ignore malformed event
+        }
+      }
+    }
+  }
+}
+
+function connectEvents(sessionId) {
+  state.eventLoopToken += 1;
+  const loopToken = state.eventLoopToken;
+  if (state.eventController) state.eventController.abort();
+
+  (async () => {
+    let backoffMs = 250;
+    while (state.eventLoopToken === loopToken && state.value.activeSessionId === sessionId) {
+      const controller = new AbortController();
+      state.eventController = controller;
+
+      try {
+        const response = await fetch(
+          `${state.auth.baseUrl}/v1/events?sessionId=${encodeURIComponent(sessionId)}`,
+          {
+            headers: { authorization: `Bearer ${state.auth.token}` },
+            signal: controller.signal,
+          },
+        );
+
+        if (!response.ok) {
+          throw new Error(`Event stream failed (${response.status})`);
+        }
+
+        backoffMs = 250;
+        await consumeEventStream(response.body, loopToken);
+      } catch {
+        if (controller.signal.aborted || state.eventLoopToken !== loopToken) break;
+        const jitter = Math.floor(Math.random() * 150);
+        await sleep(backoffMs + jitter);
+        backoffMs = Math.min(backoffMs * 2, 4000);
+      }
+    }
+  })().catch(() => {
+    // no-op
+  });
+}
+
+async function sendMessage(text) {
+  const sessionId = state.value.activeSessionId;
+  if (!sessionId || !text.trim()) return;
+
+  const existing = getActiveMessages();
+  dispatch({ type: 'messages.loaded', sessionId, messages: [...existing, { role: 'user', text }] });
+
+  const res = await api('/v1/runs', {
+    method: 'POST',
+    body: JSON.stringify({ sessionId, message: text }),
+  });
+  if (!res.ok) {
+    dispatch({ type: 'messages.loaded', sessionId, messages: existing });
+    const body = await readJsonOrEmpty(res);
+    throw new Error(body.error || `Failed to send message (${res.status})`);
+  }
+  const body = await readJsonOrEmpty(res);
+  if (body.runId) {
+    state.currentRunBySession = assignSessionRunId(state.currentRunBySession, sessionId, body.runId);
+  }
+}
+
+async function stopRun() {
+  const sessionId = state.value.activeSessionId;
+  const runId = getSessionRunId(state.currentRunBySession, sessionId);
+  if (!runId) return;
+  await api(`/v1/runs/${encodeURIComponent(runId)}/abort`, {
+    method: 'DELETE',
+    headers: {},
+  });
+}
+
+chatFormEl.addEventListener('submit', async (event) => {
+  event.preventDefault();
+  const text = chatInputEl.value;
+  try {
+    await sendMessage(text);
+    chatInputEl.value = '';
+  } catch (error) {
+    chatInputEl.value = text;
+    setStatus('error', error?.message || 'Failed to send message');
+  }
+});
+
+newSessionBtn.addEventListener('click', () => {
+  createSession()
+    .then(() => setPopover('none'))
+    .catch((err) => setStatus('error', err.message || 'Unable to create session'));
+});
+
+stopRunBtn.addEventListener('click', () => {
+  stopRun().catch((err) => setStatus('error', err.message || 'Unable to stop run'));
+});
+
+modelTriggerBtn.addEventListener('click', () => {
+  setPopover(state.popover === 'model' ? 'none' : 'model');
+});
+
+sessionTriggerBtn.addEventListener('click', () => {
+  setPopover(state.popover === 'session' ? 'none' : 'session');
+});
+
+popoverBackdropEl.addEventListener('click', () => {
+  setPopover('none');
+});
+
+(async function init() {
+  try {
+    setStatus('info', 'Connecting...');
+    await loadAuth();
+    await loadSessions();
+    if (!state.value.activeSessionId) {
+      await createSession();
+    } else {
+      await selectSession(state.value.activeSessionId);
+    }
+    setComposerEnabled(true);
+    setStatus('ready', 'Ready');
+  } catch {
+    setComposerEnabled(false);
+    setStatus('error', 'Daemon unavailable');
+  }
+})();
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
new file mode 100644
index 0000000..e7301c1
--- /dev/null
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -0,0 +1,17 @@
+import fs from 'node:fs';
+import test from 'node:test';
+import assert from 'node:assert/strict';
+
+const js = fs.readFileSync('extension/agent-panel.js', 'utf8');
+
+test('sendMessage validates run creation response status', () => {
+  assert.match(js, /async function sendMessage[\s\S]*if \(!res\.ok\)/);
+  assert.match(js, /Failed to send message/);
+  assert.match(js, /messages: existing/);
+});
+
+test('submit handler preserves draft on send failure', () => {
+  assert.match(js, /chatFormEl\.addEventListener\('submit'/);
+  assert.match(js, /try\s*\{\s*await sendMessage\(text\);[\s\S]*chatInputEl\.value = '';/);
+  assert.match(js, /catch\s*\(\w+\)\s*\{[\s\S]*chatInputEl\.value = text;/);
+});

From cf63486c4a676726e3cd48d16816bba5a53ad644 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 2 Mar 2026 13:52:25 +0530
Subject: [PATCH 102/192] fix(auth): require strict localhost origin matching

---
 agent/src/auth.js       | 22 ++++++++++++++++++++++
 test/agent/auth.test.js | 37 +++++++++++++++++++++++++++++++++++++
 2 files changed, 59 insertions(+)
 create mode 100644 agent/src/auth.js
 create mode 100644 test/agent/auth.test.js

diff --git a/agent/src/auth.js b/agent/src/auth.js
new file mode 100644
index 0000000..e104851
--- /dev/null
+++ b/agent/src/auth.js
@@ -0,0 +1,22 @@
+export function verifyBearer(req, expectedToken) {
+  const header = req?.headers?.authorization || req?.headers?.Authorization || '';
+  if (!header.startsWith('Bearer ')) return false;
+  const token = header.slice(7);
+  return token === expectedToken;
+}
+
+export function isAllowedOrigin(origin) {
+  if (!origin) return true;
+  let parsed;
+  try {
+    parsed = new URL(origin);
+  } catch {
+    return false;
+  }
+
+  if (parsed.protocol === 'chrome-extension:') return true;
+  if (parsed.protocol !== 'http:' && parsed.protocol !== 'https:') return false;
+
+  const host = parsed.hostname;
+  return host === '127.0.0.1' || host === 'localhost' || host === '::1' || host === '[::1]';
+}
diff --git a/test/agent/auth.test.js b/test/agent/auth.test.js
new file mode 100644
index 0000000..cdcc5e2
--- /dev/null
+++ b/test/agent/auth.test.js
@@ -0,0 +1,37 @@
+import test from 'node:test';
+import assert from 'node:assert/strict';
+import { verifyBearer, isAllowedOrigin } from '../../agent/src/auth.js';
+
+test('rejects missing bearer token', () => {
+  assert.equal(verifyBearer({ headers: {} }, 'secret'), false);
+});
+
+test('rejects wrong bearer token', () => {
+  assert.equal(verifyBearer({ headers: { authorization: 'Bearer wrong' } }, 'secret'), false);
+});
+
+test('accepts valid bearer token', () => {
+  assert.equal(verifyBearer({ headers: { authorization: 'Bearer secret' } }, 'secret'), true);
+});
+
+test('allows chrome-extension origin', () => {
+  assert.equal(isAllowedOrigin('chrome-extension://abc123'), true);
+});
+
+test('allows requests with no origin (CLI)', () => {
+  assert.equal(isAllowedOrigin(undefined), true);
+});
+
+test('allows localhost origins', () => {
+  assert.equal(isAllowedOrigin('http://127.0.0.1:3000'), true);
+  assert.equal(isAllowedOrigin('http://localhost:3000'), true);
+});
+
+test('rejects external origins', () => {
+  assert.equal(isAllowedOrigin('https://evil.com'), false);
+});
+
+test('rejects deceptive localhost-like origins', () => {
+  assert.equal(isAllowedOrigin('http://localhost.evil.com:3000'), false);
+  assert.equal(isAllowedOrigin('https://127.0.0.1.attacker.tld'), false);
+});

From 890a9652c0c1f7380124d5ff3af6562e711a80af Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 2 Mar 2026 13:53:10 +0530
Subject: [PATCH 103/192] fix(chatd): honor BF_CHATD_URL_PATH in daemon path
 resolution

---
 agent/src/chatd.js           |  2 +-
 test/agent/chatd-api.test.js | 17 +++++++++++++++++
 2 files changed, 18 insertions(+), 1 deletion(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 28ff864..3e26f07 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -90,7 +90,7 @@ export async function startChatd(opts = {}) {
   const writeChatdUrl = opts.writeChatdUrl !== false;
   const storageRoot = opts.storageRoot;
   const token = opts.token || process.env.BF_CHATD_TOKEN || randomBytes(32).toString('base64url');
-  const chatdUrlPath = opts.chatdUrlPath || CHATD_URL_PATH;
+  const chatdUrlPath = opts.chatdUrlPath || process.env.BF_CHATD_URL_PATH || CHATD_URL_PATH;
   const runExecutor = opts.runExecutor || createDefaultRunExecutor({ codexCwd: opts.codexCwd || process.cwd() });
 
   let desiredPort = Number.isFinite(opts.port) ? Number(opts.port) : Number(process.env.BF_CHATD_PORT || 0);
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index ad4e1a8..8d046a5 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -86,6 +86,23 @@ test('stop removes chatd-url metadata file when enabled', async () => {
   rmSync(tempDir, { recursive: true, force: true });
 });
 
+test('daemon honors BF_CHATD_URL_PATH when no explicit path option is provided', async () => {
+  const tempDir = mkdtempSync(join(tmpdir(), 'bf-chatd-env-url-'));
+  const envUrlPath = join(tempDir, 'chatd-env-url.json');
+  const previous = process.env.BF_CHATD_URL_PATH;
+  process.env.BF_CHATD_URL_PATH = envUrlPath;
+
+  const daemon = await startChatd({ port: 0, writeChatdUrl: true });
+  try {
+    assert.equal(existsSync(envUrlPath), true);
+  } finally {
+    await daemon.stop();
+    if (previous == null) delete process.env.BF_CHATD_URL_PATH;
+    else process.env.BF_CHATD_URL_PATH = previous;
+    rmSync(tempDir, { recursive: true, force: true });
+  }
+});
+
 test('POST /v1/runs uses injected run executor and persists assistant output', async () => {
   const seenRuns = [];
   const daemon = await startChatd({

From 306c492f210464a95a0831549526861c8bddf88e Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 2 Mar 2026 13:54:24 +0530
Subject: [PATCH 104/192] fix(extension): rollback relay URL when reconnect
 attempt fails

---
 extension/background.js                       | 95 +++++++++++++++++--
 .../relay-url-reconnect-contract.test.js      | 20 ++++
 2 files changed, 106 insertions(+), 9 deletions(-)
 create mode 100644 test/agent/relay-url-reconnect-contract.test.js

diff --git a/extension/background.js b/extension/background.js
index 9f1e1c7..adf8d50 100644
--- a/extension/background.js
+++ b/extension/background.js
@@ -36,8 +36,7 @@ let restrictionExplained = false;
 
 (async function init() {
   const stored = await chrome.storage.local.get(['relayUrl']);
-  const relayUrl = stored.relayUrl || RELAY_URL_DEFAULT;
-  currentRelayUrl = relayUrl;
+  currentRelayUrl = stored.relayUrl || RELAY_URL_DEFAULT;
 
   // Register debugger listeners once (persists across reconnections)
   chrome.debugger.onEvent.addListener(onDebuggerEvent);
@@ -51,22 +50,22 @@ let restrictionExplained = false;
   chrome.alarms.create('bf-reconnect', { periodInMinutes: 0.5 });
   chrome.alarms.onAlarm.addListener((alarm) => {
     if (alarm.name === 'bf-reconnect' && !ws) {
-      startMaintainLoop(relayUrl);
+      startMaintainLoop();
     }
   });
 
-  startMaintainLoop(relayUrl);
+  startMaintainLoop();
 })();
 
 // ─── Connection Management ───────────────────────────────────────────────────
 
-function startMaintainLoop(relayUrl) {
+function startMaintainLoop() {
   if (maintainLoopActive) return;
   maintainLoopActive = true;
-  maintainConnection(relayUrl);
+  maintainConnection();
 }
 
-async function maintainConnection(relayUrl) {
+async function maintainConnection() {
   while (true) {
     if (!ws || ws.readyState !== WebSocket.OPEN) {
       if (connectionState !== 'connecting') {
@@ -75,7 +74,7 @@ async function maintainConnection(relayUrl) {
       }
 
       try {
-        await connect(relayUrl);
+        await connect(currentRelayUrl);
       } catch {
         connectionState = 'disconnected';
         updateBadge();
@@ -141,6 +140,41 @@ function connect(relayUrl) {
   });
 }
 
+function requestRelayReconnect() {
+  if (connectionState !== 'connecting') {
+    connectionState = 'connecting';
+    updateBadge();
+  }
+
+  if (ws) {
+    try {
+      ws.close();
+    } catch {
+      // ignore close races
+    }
+  } else if (!maintainLoopActive) {
+    startMaintainLoop();
+  }
+}
+
+async function waitForConnectionState(timeoutMs = 5000) {
+  const started = Date.now();
+  while (Date.now() - started < timeoutMs) {
+    if (connectionState === 'connected') return connectionState;
+    await sleep(200);
+  }
+  return connectionState;
+}
+
+function isValidRelayUrl(relayUrl) {
+  try {
+    const parsed = new URL(relayUrl);
+    return parsed.protocol === 'ws:' || parsed.protocol === 'wss:';
+  } catch {
+    return false;
+  }
+}
+
 // ─── Relay Message Handling ──────────────────────────────────────────────────
 
 function handleRelayMessage(msg) {
@@ -614,7 +648,11 @@ async function checkInactiveTabs() {
 
 chrome.storage.onChanged.addListener(async (changes) => {
   if (changes.relayUrl) {
-    currentRelayUrl = changes.relayUrl.newValue || RELAY_URL_DEFAULT;
+    const nextRelayUrl = changes.relayUrl.newValue || RELAY_URL_DEFAULT;
+    if (nextRelayUrl !== currentRelayUrl) {
+      currentRelayUrl = nextRelayUrl;
+      requestRelayReconnect();
+    }
   }
 
   if (changes.autoDetachMinutes || changes.autoCloseMinutes) {
@@ -799,6 +837,45 @@ chrome.runtime.onMessage.addListener((msg, _sender, sendResponse) => {
     return true; // async sendResponse
   }
 
+  if (msg.type === 'updateRelayUrl') {
+    const relayUrl = typeof msg.relayUrl === 'string' ? msg.relayUrl.trim() : '';
+    if (!relayUrl) {
+      sendResponse({ error: 'Relay URL is required' });
+      return false;
+    }
+    if (!isValidRelayUrl(relayUrl)) {
+      sendResponse({ error: 'Relay URL must start with ws:// or wss://' });
+      return false;
+    }
+
+    const previousRelayUrl = currentRelayUrl;
+    currentRelayUrl = relayUrl;
+    requestRelayReconnect();
+    waitForConnectionState(5000).then((settledState) => {
+      if (settledState !== 'connected') {
+        currentRelayUrl = previousRelayUrl;
+        requestRelayReconnect();
+        waitForConnectionState(5000).then((fallbackState) => {
+          sendResponse({ error: 'Connection failed', connectionState: fallbackState });
+        });
+        return;
+      }
+
+      chrome.storage.local.set({ relayUrl }, () => {
+        if (chrome.runtime.lastError) {
+          sendResponse({ error: chrome.runtime.lastError.message || 'Failed to save relay URL' });
+          return;
+        }
+        sendResponse({ ok: true, connectionState: settledState });
+      });
+    }).catch(() => {
+      currentRelayUrl = previousRelayUrl;
+      requestRelayReconnect();
+      sendResponse({ error: 'Connection failed', connectionState: connectionState });
+    });
+    return true; // async sendResponse
+  }
+
   if (msg.type === 'attachCurrentTab') {
     chrome.tabs.query({ active: true, currentWindow: true }, async (tabs) => {
       const tab = tabs[0];
diff --git a/test/agent/relay-url-reconnect-contract.test.js b/test/agent/relay-url-reconnect-contract.test.js
new file mode 100644
index 0000000..bd8c8c9
--- /dev/null
+++ b/test/agent/relay-url-reconnect-contract.test.js
@@ -0,0 +1,20 @@
+import fs from 'node:fs';
+import test from 'node:test';
+import assert from 'node:assert/strict';
+
+const popupJs = fs.readFileSync('extension/popup.js', 'utf8');
+const backgroundJs = fs.readFileSync('extension/background.js', 'utf8');
+
+test('popup save flow requests relay reconnect and shows feedback states', () => {
+  assert.match(popupJs, /Connecting\.\.\./);
+  assert.match(popupJs, /Connected/);
+  assert.match(popupJs, /Connection failed/);
+  assert.match(popupJs, /type:\s*'updateRelayUrl'/);
+});
+
+test('background handles updateRelayUrl message and triggers reconnect', () => {
+  assert.match(backgroundJs, /msg\.type === 'updateRelayUrl'/);
+  assert.match(backgroundJs, /requestRelayReconnect\(/);
+  assert.match(backgroundJs, /const previousRelayUrl = currentRelayUrl/);
+  assert.match(backgroundJs, /currentRelayUrl = previousRelayUrl/);
+});

From 911326e0ef61cdcdfa0ee9169bc7cd12c5a40752 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 2 Mar 2026 13:55:57 +0530
Subject: [PATCH 105/192] test(agent): avoid fixed-port collisions in port
 resolver fallback test

---
 test/agent/port-resolver.test.js | 54 ++++++++++++++++++++++++++++++++
 1 file changed, 54 insertions(+)
 create mode 100644 test/agent/port-resolver.test.js

diff --git a/test/agent/port-resolver.test.js b/test/agent/port-resolver.test.js
new file mode 100644
index 0000000..66b3ddd
--- /dev/null
+++ b/test/agent/port-resolver.test.js
@@ -0,0 +1,54 @@
+import test from 'node:test';
+import assert from 'node:assert/strict';
+import net from 'node:net';
+import { pickChatdPort } from '../../agent/src/port-resolver.js';
+
+async function bindPort(port) {
+  return new Promise((resolve, reject) => {
+    const server = net.createServer();
+    server.once('error', reject);
+    server.listen(port, '127.0.0.1', () => resolve(server));
+  });
+}
+
+async function getEphemeralPort() {
+  const server = await bindPort(0);
+  const port = server.address().port;
+  await new Promise((resolve) => server.close(resolve));
+  return port;
+}
+
+async function findConsecutiveFreeRange(size = 3) {
+  for (let attempt = 0; attempt < 40; attempt += 1) {
+    const start = await getEphemeralPort();
+    const blockers = [];
+    let ok = true;
+    for (let i = 1; i < size; i += 1) {
+      try {
+        blockers.push(await bindPort(start + i));
+      } catch {
+        ok = false;
+        break;
+      }
+    }
+    await Promise.all(blockers.map((server) => new Promise((resolve) => server.close(resolve))));
+    if (ok) return start;
+  }
+  throw new Error('unable to find a consecutive free port range for test');
+}
+
+test('prefers BF_CHATD_PORT when free', async () => {
+  const port = await pickChatdPort({ envPort: 19301, rangeStart: 19280, rangeEnd: 19320 });
+  assert.equal(port, 19301);
+});
+
+test('falls back to first free range port when env port unavailable', async () => {
+  const rangeStart = await findConsecutiveFreeRange(3);
+  const blocker = await bindPort(rangeStart);
+  try {
+    const port = await pickChatdPort({ envPort: rangeStart, rangeStart, rangeEnd: rangeStart + 2 });
+    assert.equal(port, rangeStart + 1);
+  } finally {
+    await new Promise((resolve) => blocker.close(resolve));
+  }
+});

From d0b9b59c8a34a008110de852aa0ee711154edfda Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 2 Mar 2026 20:18:38 +0530
Subject: [PATCH 106/192] fix(chatd): keep runs alive across transient codex
 transport errors

---
 agent/src/codex-runner.js       | 165 ++++++++++++++++++++++++++++++++
 test/agent/codex-runner.test.js |  57 +++++++++++
 2 files changed, 222 insertions(+)
 create mode 100644 agent/src/codex-runner.js
 create mode 100644 test/agent/codex-runner.test.js

diff --git a/agent/src/codex-runner.js b/agent/src/codex-runner.js
new file mode 100644
index 0000000..96d6088
--- /dev/null
+++ b/agent/src/codex-runner.js
@@ -0,0 +1,165 @@
+import { spawn } from 'node:child_process';
+import readline from 'node:readline';
+
+function envelope({ event, runId, sessionId, payload }) {
+  return {
+    event,
+    runId,
+    sessionId,
+    payload: payload || {},
+    timestamp: new Date().toISOString(),
+  };
+}
+
+function safeParse(line) {
+  if (typeof line !== 'string') return null;
+  try {
+    return JSON.parse(line);
+  } catch {
+    return null;
+  }
+}
+
+export function normalizeCodexLine({ runId, sessionId, line }) {
+  const parsed = safeParse(line);
+  if (!parsed || typeof parsed !== 'object') {
+    return envelope({ event: 'chat.delta', runId, sessionId, payload: { delta: String(line || '') } });
+  }
+
+  const type = String(parsed.type || '').toLowerCase();
+
+  if (type === 'delta' || type === 'text_delta') {
+    return envelope({ event: 'chat.delta', runId, sessionId, payload: { delta: String(parsed.text || '') } });
+  }
+
+  if (type === 'final' || type === 'done' || type === 'text_final') {
+    return envelope({ event: 'chat.final', runId, sessionId, payload: { text: String(parsed.text || '') } });
+  }
+
+  if (type === 'thread.started' || type === 'turn.started' || type === 'run_started') {
+    return envelope({ event: 'run.started', runId, sessionId, payload: parsed });
+  }
+
+  if (type === 'item.completed') {
+    const itemType = parsed.item?.type || '';
+    if (itemType === 'agent_message') {
+      return envelope({
+        event: 'chat.final',
+        runId,
+        sessionId,
+        payload: { text: String(parsed.item?.text || '') },
+      });
+    }
+    if (itemType === 'reasoning') {
+      return envelope({ event: 'tool.delta', runId, sessionId, payload: parsed.item || parsed });
+    }
+  }
+
+  if (type === 'error') {
+    return envelope({
+      event: 'tool.delta',
+      runId,
+      sessionId,
+      payload: { level: 'warning', message: parsed.message || parsed.error || 'unknown warning' },
+    });
+  }
+
+  if (type === 'run_error' || type === 'thread.error') {
+    return envelope({
+      event: 'run.error',
+      runId,
+      sessionId,
+      payload: { error: parsed.error || parsed.message || 'unknown error' },
+    });
+  }
+
+  if (type === 'run_aborted' || type === 'aborted') {
+    return envelope({ event: 'run.aborted', runId, sessionId, payload: parsed });
+  }
+
+  if (type === 'tool_start') {
+    return envelope({ event: 'tool.started', runId, sessionId, payload: parsed });
+  }
+
+  if (type === 'tool_delta') {
+    return envelope({ event: 'tool.delta', runId, sessionId, payload: parsed });
+  }
+
+  if (type === 'tool_end') {
+    return envelope({ event: 'tool.final', runId, sessionId, payload: parsed });
+  }
+
+  return envelope({ event: 'run.event', runId, sessionId, payload: parsed });
+}
+
+export function buildCodexExecArgs({ prompt, model, args } = {}) {
+  if (Array.isArray(args) && args.length > 0) return args;
+  const resolved = ['exec', '--json'];
+  if (typeof model === 'string' && model.trim()) {
+    resolved.push('--model', model.trim());
+  }
+  resolved.push(prompt || '');
+  return resolved;
+}
+
+export function startCodexRun({
+  runId,
+  sessionId,
+  prompt,
+  cwd,
+  onEvent,
+  onExit,
+  onError,
+  command,
+  args,
+  model,
+} = {}) {
+  const cmd = command || process.env.BF_CHATD_CODEX_COMMAND || 'codex';
+  const argv = buildCodexExecArgs({ prompt, model, args });
+
+  const child = spawn(cmd, argv, {
+    cwd,
+    env: process.env,
+    stdio: ['ignore', 'pipe', 'pipe'],
+  });
+
+  const stdoutLines = readline.createInterface({ input: child.stdout });
+  stdoutLines.on('line', (line) => {
+    try {
+      const evt = normalizeCodexLine({ runId, sessionId, line });
+      onEvent?.(evt);
+    } catch (error) {
+      onError?.(error);
+    }
+  });
+
+  const stderrLines = readline.createInterface({ input: child.stderr });
+  stderrLines.on('line', (line) => {
+    if (!line) return;
+    onEvent?.(envelope({
+      event: 'tool.delta',
+      runId,
+      sessionId,
+      payload: { stream: 'stderr', text: line },
+    }));
+  });
+
+  child.on('error', (error) => {
+    onError?.(error);
+  });
+
+  child.on('close', (code, signal) => {
+    onExit?.({ code, signal });
+  });
+
+  return {
+    pid: child.pid,
+    abort() {
+      try {
+        child.kill('SIGTERM');
+      } catch {
+        // ignore kill races
+      }
+    },
+  };
+}
diff --git a/test/agent/codex-runner.test.js b/test/agent/codex-runner.test.js
new file mode 100644
index 0000000..d0ce6ad
--- /dev/null
+++ b/test/agent/codex-runner.test.js
@@ -0,0 +1,57 @@
+import test from 'node:test';
+import assert from 'node:assert/strict';
+import { buildCodexExecArgs, normalizeCodexLine } from '../../agent/src/codex-runner.js';
+
+test('maps text delta line to chat.delta event', () => {
+  const evt = normalizeCodexLine({
+    runId: 'r1',
+    sessionId: 's1',
+    line: '{"type":"delta","text":"hi"}',
+  });
+
+  assert.equal(evt.event, 'chat.delta');
+  assert.equal(evt.payload.delta, 'hi');
+});
+
+test('maps final line to chat.final event', () => {
+  const evt = normalizeCodexLine({
+    runId: 'r1',
+    sessionId: 's1',
+    line: '{"type":"final","text":"done"}',
+  });
+
+  assert.equal(evt.event, 'chat.final');
+  assert.equal(evt.payload.text, 'done');
+});
+
+test('maps codex item.completed agent_message to chat.final', () => {
+  const evt = normalizeCodexLine({
+    runId: 'r1',
+    sessionId: 's1',
+    line: '{"type":"item.completed","item":{"type":"agent_message","text":"hello"}}',
+  });
+  assert.equal(evt.event, 'chat.final');
+  assert.equal(evt.payload.text, 'hello');
+});
+
+test('buildCodexExecArgs includes --model when session model is set', () => {
+  const args = buildCodexExecArgs({ prompt: 'hi', model: 'gpt-5' });
+  assert.deepEqual(args, ['exec', '--json', '--model', 'gpt-5', 'hi']);
+});
+
+test('buildCodexExecArgs omits --model when model is empty', () => {
+  const args = buildCodexExecArgs({ prompt: 'hi', model: '' });
+  assert.deepEqual(args, ['exec', '--json', 'hi']);
+});
+
+test('maps transient codex error line to non-fatal tool event', () => {
+  const evt = normalizeCodexLine({
+    runId: 'r1',
+    sessionId: 's1',
+    line: '{"type":"error","message":"Reconnecting... 2/5"}',
+  });
+
+  assert.equal(evt.event, 'tool.delta');
+  assert.equal(evt.payload.level, 'warning');
+  assert.match(evt.payload.message, /Reconnecting/);
+});

From 7a30164e9695385da045c70d6dddeec8e0bbfbaf Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Mon, 2 Mar 2026 20:18:47 +0530
Subject: [PATCH 107/192] fix(sidepanel): load codex model presets and reduce
 noisy duplicate session labels

---
 agent/src/chatd.js           | 76 +++++++++++++++++++++++++++++++++++-
 extension/agent-panel.js     | 56 ++++++++++++++++++++------
 test/agent/chatd-api.test.js | 19 +++++++++
 3 files changed, 138 insertions(+), 13 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 3e26f07..eb52cd7 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -1,7 +1,7 @@
 import http from 'node:http';
 import { randomBytes } from 'node:crypto';
 import { promises as fs } from 'node:fs';
-import { homedir } from 'node:os';
+import { homedir, tmpdir } from 'node:os';
 import { dirname, join } from 'node:path';
 import { fileURLToPath } from 'node:url';
 
@@ -12,6 +12,7 @@ import {
   appendMessage,
   createSession,
   getSession,
+  isValidModelId,
   isValidSessionId,
   listSessions,
   readMessages,
@@ -20,6 +21,65 @@ import {
 
 const BF_DIR = join(homedir(), '.browserforce');
 const CHATD_URL_PATH = join(BF_DIR, 'chatd-url.json');
+const CODEX_CONFIG_PATH = join(homedir(), '.codex', 'config.toml');
+
+function parseTopLevelTomlString(raw, key) {
+  const lines = String(raw || '').split(/\r?\n/);
+  for (const rawLine of lines) {
+    const line = rawLine.trim();
+    if (!line || line.startsWith('#')) continue;
+    if (line.startsWith('[')) break;
+
+    const doubleQuoted = line.match(new RegExp(`^${key}\\s*=\\s*\"([^\"]+)\"(?:\\s*#.*)?$`));
+    if (doubleQuoted) return doubleQuoted[1].trim();
+
+    const singleQuoted = line.match(new RegExp(`^${key}\\s*=\\s*'([^']+)'(?:\\s*#.*)?$`));
+    if (singleQuoted) return singleQuoted[1].trim();
+  }
+  return null;
+}
+
+async function resolveConfiguredModel() {
+  const envModel = String(process.env.BF_CHATD_DEFAULT_MODEL || '').trim();
+  if (envModel && isValidModelId(envModel)) return envModel;
+
+  try {
+    const raw = await fs.readFile(CODEX_CONFIG_PATH, 'utf8');
+    const model = parseTopLevelTomlString(raw, 'model');
+    if (model && isValidModelId(model)) return model;
+  } catch {
+    // no local codex config is fine
+  }
+  return null;
+}
+
+function dedupeModelRows(rows) {
+  const seen = new Set();
+  const out = [{ value: null, label: 'Default' }];
+  for (const row of rows) {
+    if (!row || typeof row.value !== 'string') continue;
+    const value = row.value.trim();
+    if (!value || seen.has(value) || !isValidModelId(value)) continue;
+    seen.add(value);
+    out.push({ value, label: row.label || value });
+  }
+  return out;
+}
+
+async function listModelPresets({ storageRoot } = {}) {
+  const configuredModel = await resolveConfiguredModel();
+  const sessions = await listSessions({ limit: 200, storageRoot });
+  const sessionRows = sessions
+    .map((session) => String(session?.model || '').trim())
+    .filter(Boolean)
+    .map((value) => ({ value, label: value }));
+
+  const configuredRow = configuredModel
+    ? [{ value: configuredModel, label: `${configuredModel} (Configured)` }]
+    : [];
+
+  return dedupeModelRows([...configuredRow, ...sessionRows]);
+}
 
 function nowIso() {
   return new Date().toISOString();
@@ -88,7 +148,10 @@ function createDefaultRunExecutor({ codexCwd } = {}) {
 
 export async function startChatd(opts = {}) {
   const writeChatdUrl = opts.writeChatdUrl !== false;
-  const storageRoot = opts.storageRoot;
+  const ephemeralStorageRoot = (!opts.storageRoot && !writeChatdUrl)
+    ? await fs.mkdtemp(join(tmpdir(), 'bf-chatd-'))
+    : null;
+  const storageRoot = opts.storageRoot || ephemeralStorageRoot;
   const token = opts.token || process.env.BF_CHATD_TOKEN || randomBytes(32).toString('base64url');
   const chatdUrlPath = opts.chatdUrlPath || process.env.BF_CHATD_URL_PATH || CHATD_URL_PATH;
   const runExecutor = opts.runExecutor || createDefaultRunExecutor({ codexCwd: opts.codexCwd || process.cwd() });
@@ -174,6 +237,12 @@ export async function startChatd(opts = {}) {
         return;
       }
 
+      if (url.pathname === '/v1/models' && req.method === 'GET') {
+        const models = await listModelPresets({ storageRoot });
+        json(res, 200, { models });
+        return;
+      }
+
       if (url.pathname === '/v1/sessions' && req.method === 'POST') {
         let body = {};
         try {
@@ -455,6 +524,9 @@ export async function startChatd(opts = {}) {
 
     await new Promise((resolve) => server.close(resolve));
     await clearChatdUrlFile({ writeChatdUrl, urlPath: chatdUrlPath });
+    if (ephemeralStorageRoot) {
+      await fs.rm(ephemeralStorageRoot, { recursive: true, force: true });
+    }
   };
 
   return {
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 9031732..b758618 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -6,17 +6,10 @@ import {
   shouldApplySessionSelection,
 } from './agent-panel-runtime.js';
 
-const MODEL_PRESETS = [
-  { value: null, label: 'Default' },
-  { value: 'gpt-5', label: 'GPT-5' },
-  { value: 'gpt-5-mini', label: 'GPT-5 Mini' },
-  { value: 'o3', label: 'o3' },
-  { value: 'o4-mini', label: 'o4-mini' },
-];
-
 const state = {
   value: initialState,
   auth: null,
+  modelPresets: [{ value: null, label: 'Default' }],
   currentRunBySession: {},
   eventController: null,
   eventLoopToken: 0,
@@ -91,7 +84,7 @@ function renderModelList() {
   const activeSession = getActiveSession();
   const activeModel = activeSession?.model || null;
 
-  const rows = MODEL_PRESETS.map((preset) => {
+  const rows = state.modelPresets.map((preset) => {
     const active = (preset.value || null) === activeModel ? 'active' : '';
     return `<li><button type="button" data-model="${escapeHtml(preset.value || '')}" class="${active}">${escapeHtml(preset.label)}</button></li>`;
   });
@@ -131,11 +124,18 @@ function renderSessions() {
     return;
   }
 
+  const titleCounts = new Map();
+  for (const session of sessions) {
+    const title = (session.title || '').trim() || session.sessionId;
+    titleCounts.set(title, (titleCounts.get(title) || 0) + 1);
+  }
+
   switchSessionListEl.innerHTML = sessions
     .map((session) => {
       const active = session.sessionId === state.value.activeSessionId ? 'active' : '';
       const title = session.title || session.sessionId;
-      return `<li><button type="button" data-session-id="${session.sessionId}" class="${active}">${escapeHtml(title)}</button></li>`;
+      const suffix = (titleCounts.get(title) || 0) > 1 ? ` · ${session.sessionId.slice(0, 8)}` : '';
+      return `<li><button type="button" data-session-id="${session.sessionId}" class="${active}">${escapeHtml(`${title}${suffix}`)}</button></li>`;
     })
     .join('');
 
@@ -211,7 +211,10 @@ async function getRelayHttpUrl() {
 
 async function loadAuth() {
   const relayHttpUrl = await getRelayHttpUrl();
-  const res = await fetch(`${relayHttpUrl}/chatd-url`);
+  const extensionId = chrome?.runtime?.id;
+  const res = await fetch(`${relayHttpUrl}/chatd-url`, {
+    headers: extensionId ? { 'x-browserforce-extension-id': extensionId } : {},
+  });
   if (!res.ok) throw new Error('daemon_unavailable');
   const body = await res.json();
   state.auth = {
@@ -258,6 +261,31 @@ async function loadSessions(preferredSessionId = null) {
   });
 }
 
+function normalizeModelRows(input) {
+  const source = Array.isArray(input) ? input : [];
+  const seen = new Set(['__default__']);
+  const rows = [{ value: null, label: 'Default' }];
+  for (const row of source) {
+    if (!row || typeof row !== 'object') continue;
+    const value = row.value == null ? null : String(row.value).trim();
+    const key = value || '__default__';
+    if (seen.has(key)) continue;
+    seen.add(key);
+    rows.push({
+      value,
+      label: row.label && String(row.label).trim() ? String(row.label).trim() : (value || 'Default'),
+    });
+  }
+  return rows;
+}
+
+async function loadModelPresets() {
+  const res = await api('/v1/models', { method: 'GET', headers: {} });
+  await ensureOk(res, 'Failed to load models');
+  const body = await readJsonOrEmpty(res);
+  state.modelPresets = normalizeModelRows(body.models);
+}
+
 async function loadMessages(sessionId) {
   const res = await api(`/v1/sessions/${encodeURIComponent(sessionId)}/messages?limit=200`, {
     method: 'GET',
@@ -308,6 +336,7 @@ async function updateActiveSessionModel(model) {
     throw new Error(body.error || 'Unable to update model');
   }
 
+  await loadModelPresets().catch(() => {});
   await loadSessions(sessionId);
   setPopover('none');
   setStatus('ready', 'Ready');
@@ -447,6 +476,11 @@ popoverBackdropEl.addEventListener('click', () => {
   try {
     setStatus('info', 'Connecting...');
     await loadAuth();
+    try {
+      await loadModelPresets();
+    } catch {
+      state.modelPresets = [{ value: null, label: 'Default' }];
+    }
     await loadSessions();
     if (!state.value.activeSessionId) {
       await createSession();
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index 8d046a5..d2d558f 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -30,6 +30,25 @@ test('GET /health returns daemon metadata', async () => {
   }
 });
 
+test('GET /v1/models returns default and configured model', async () => {
+  const previous = process.env.BF_CHATD_DEFAULT_MODEL;
+  process.env.BF_CHATD_DEFAULT_MODEL = 'gpt-5.3-codex';
+  const daemon = await startChatd({ port: 0, writeChatdUrl: false });
+  try {
+    const res = await fetch(`${daemon.baseUrl}/v1/models`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(res.status, 200);
+    const body = await res.json();
+    assert.deepEqual(body.models[0], { value: null, label: 'Default' });
+    assert.equal(body.models.some((row) => row.value === 'gpt-5.3-codex'), true);
+  } finally {
+    await daemon.stop();
+    if (previous == null) delete process.env.BF_CHATD_DEFAULT_MODEL;
+    else process.env.BF_CHATD_DEFAULT_MODEL = previous;
+  }
+});
+
 test('POST /v1/runs requires explicit sessionId', async () => {
   const daemon = await startChatd({ port: 0, writeChatdUrl: false });
   try {

From c00e26f25d1b390996cc99915537ee5f66ec12b9 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 11:21:11 +0530
Subject: [PATCH 108/192] feat(chatd): source sidepanel models from live codex
 app-server catalog

---
 agent/src/chatd.js           | 145 ++++++++++++++++++++++++++++++++++-
 test/agent/chatd-api.test.js |  34 +++++++-
 2 files changed, 172 insertions(+), 7 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index eb52cd7..070945c 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -1,4 +1,5 @@
 import http from 'node:http';
+import { spawn } from 'node:child_process';
 import { randomBytes } from 'node:crypto';
 import { promises as fs } from 'node:fs';
 import { homedir, tmpdir } from 'node:os';
@@ -22,6 +23,7 @@ import {
 const BF_DIR = join(homedir(), '.browserforce');
 const CHATD_URL_PATH = join(BF_DIR, 'chatd-url.json');
 const CODEX_CONFIG_PATH = join(homedir(), '.codex', 'config.toml');
+const MODEL_LIST_TIMEOUT_MS = 5000;
 
 function parseTopLevelTomlString(raw, key) {
   const lines = String(raw || '').split(/\r?\n/);
@@ -66,7 +68,138 @@ function dedupeModelRows(rows) {
   return out;
 }
 
-async function listModelPresets({ storageRoot } = {}) {
+function safeParseJsonLine(line) {
+  if (typeof line !== 'string') return null;
+  try {
+    return JSON.parse(line);
+  } catch {
+    return null;
+  }
+}
+
+function normalizeModelCatalogRows(models) {
+  return (Array.isArray(models) ? models : [])
+    .filter((row) => row && typeof row === 'object' && !row.hidden)
+    .map((row) => {
+      const value = String(row.model || row.id || '').trim();
+      const label = String(row.displayName || row.model || row.id || '').trim();
+      if (!value || !isValidModelId(value)) return null;
+      return { value, label: label || value };
+    })
+    .filter(Boolean);
+}
+
+async function fetchCodexModelCatalog({
+  command = process.env.BF_CHATD_CODEX_COMMAND || 'codex',
+  timeoutMs = MODEL_LIST_TIMEOUT_MS,
+} = {}) {
+  return new Promise((resolve, reject) => {
+    const child = spawn(command, ['app-server', '--listen', 'stdio://'], {
+      stdio: ['pipe', 'pipe', 'pipe'],
+      env: process.env,
+    });
+
+    let settled = false;
+    let stderrText = '';
+    let stdoutBuffer = '';
+
+    const finish = (error, models = []) => {
+      if (settled) return;
+      settled = true;
+      clearTimeout(timer);
+      try { child.kill('SIGTERM'); } catch {}
+      if (error) reject(error);
+      else resolve(models);
+    };
+
+    const timer = setTimeout(() => {
+      finish(new Error('Timed out while loading Codex models'));
+    }, timeoutMs);
+
+    child.stderr.setEncoding('utf8');
+    child.stderr.on('data', (chunk) => {
+      stderrText += String(chunk || '');
+    });
+
+    child.stdout.setEncoding('utf8');
+    child.stdout.on('data', (chunk) => {
+      stdoutBuffer += String(chunk || '');
+      let idx = stdoutBuffer.indexOf('\n');
+      while (idx !== -1) {
+        const line = stdoutBuffer.slice(0, idx).trim();
+        stdoutBuffer = stdoutBuffer.slice(idx + 1);
+        idx = stdoutBuffer.indexOf('\n');
+        if (!line) continue;
+
+        const msg = safeParseJsonLine(line);
+        if (!msg || typeof msg !== 'object') continue;
+
+        if (msg.id === 1 && msg.error) {
+          finish(new Error(msg.error?.message || 'Codex initialize failed'));
+          return;
+        }
+        if (msg.id === 1 && msg.result) {
+          try {
+            child.stdin.write(`${JSON.stringify({ jsonrpc: '2.0', method: 'initialized' })}\n`);
+            child.stdin.write(`${JSON.stringify({
+              jsonrpc: '2.0',
+              id: 2,
+              method: 'model/list',
+              params: { includeHidden: false, limit: 100 },
+            })}\n`);
+          } catch {
+            finish(new Error('Failed to request Codex model list'));
+          }
+          continue;
+        }
+
+        if (msg.id === 2 && msg.error) {
+          finish(new Error(msg.error?.message || 'Codex model/list failed'));
+          return;
+        }
+
+        if (msg.id === 2 && msg.result) {
+          finish(null, msg.result?.data || []);
+        }
+      }
+    });
+
+    child.on('error', (error) => {
+      finish(error);
+    });
+
+    child.on('exit', (code) => {
+      if (settled) return;
+      finish(new Error(`Codex app-server exited before model/list (${code ?? 'unknown'}) ${stderrText}`.trim()));
+    });
+
+    try {
+      child.stdin.write(`${JSON.stringify({
+        jsonrpc: '2.0',
+        id: 1,
+        method: 'initialize',
+        params: {
+          clientInfo: { name: 'browserforce-chatd', version: '1.0.0' },
+          capabilities: { experimentalApi: false },
+        },
+      })}\n`);
+    } catch {
+      finish(new Error('Failed to initialize Codex app-server'));
+    }
+  });
+}
+
+async function listModelPresets({ storageRoot, modelFetcher } = {}) {
+  let liveRows = [];
+  if (typeof modelFetcher === 'function') {
+    try {
+      const liveModels = await modelFetcher();
+      liveRows = normalizeModelCatalogRows(liveModels);
+    } catch {
+      liveRows = [];
+    }
+  }
+
   const configuredModel = await resolveConfiguredModel();
   const sessions = await listSessions({ limit: 200, storageRoot });
   const sessionRows = sessions
@@ -74,11 +207,11 @@ async function listModelPresets({ storageRoot } = {}) {
     .filter(Boolean)
     .map((value) => ({ value, label: value }));
 
-  const configuredRow = configuredModel
+  const configuredRow = configuredModel && !liveRows.some((row) => row.value === configuredModel)
     ? [{ value: configuredModel, label: `${configuredModel} (Configured)` }]
     : [];
 
-  return dedupeModelRows([...configuredRow, ...sessionRows]);
+  return dedupeModelRows([...liveRows, ...configuredRow, ...sessionRows]);
 }
 
 function nowIso() {
@@ -155,6 +288,10 @@ export async function startChatd(opts = {}) {
   const token = opts.token || process.env.BF_CHATD_TOKEN || randomBytes(32).toString('base64url');
   const chatdUrlPath = opts.chatdUrlPath || process.env.BF_CHATD_URL_PATH || CHATD_URL_PATH;
   const runExecutor = opts.runExecutor || createDefaultRunExecutor({ codexCwd: opts.codexCwd || process.cwd() });
+  const modelFetcher = opts.modelFetcher || (() => fetchCodexModelCatalog({
+    command: opts.codexCommand || process.env.BF_CHATD_CODEX_COMMAND || 'codex',
+    timeoutMs: Number(process.env.BF_CHATD_MODEL_LIST_TIMEOUT_MS || MODEL_LIST_TIMEOUT_MS),
+  }));
 
   let desiredPort = Number.isFinite(opts.port) ? Number(opts.port) : Number(process.env.BF_CHATD_PORT || 0);
   if (!Number.isInteger(desiredPort) || desiredPort < 0) desiredPort = 0;
@@ -238,7 +375,7 @@ export async function startChatd(opts = {}) {
       }
 
       if (url.pathname === '/v1/models' && req.method === 'GET') {
-        const models = await listModelPresets({ storageRoot });
+        const models = await listModelPresets({ storageRoot, modelFetcher });
         json(res, 200, { models });
         return;
       }
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index d2d558f..577fef8 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -30,17 +30,45 @@ test('GET /health returns daemon metadata', async () => {
   }
 });
 
-test('GET /v1/models returns default and configured model', async () => {
+test('GET /v1/models returns Codex live model list from model fetcher', async () => {
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    modelFetcher: async () => ([
+      { model: 'gpt-5.3-codex', displayName: 'GPT-5.3 Codex', isDefault: true, hidden: false },
+      { model: 'gpt-5.1-codex-mini', displayName: 'GPT-5.1 Codex Mini', isDefault: false, hidden: false },
+    ]),
+  });
+  try {
+    const res = await fetch(`${daemon.baseUrl}/v1/models`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(res.status, 200);
+    const body = await res.json();
+    assert.deepEqual(body.models[0], { value: null, label: 'Default' });
+    assert.deepEqual(body.models[1], { value: 'gpt-5.3-codex', label: 'GPT-5.3 Codex' });
+    assert.deepEqual(body.models[2], { value: 'gpt-5.1-codex-mini', label: 'GPT-5.1 Codex Mini' });
+  } finally {
+    await daemon.stop();
+  }
+});
+
+test('GET /v1/models falls back to configured model when model fetcher fails', async () => {
   const previous = process.env.BF_CHATD_DEFAULT_MODEL;
   process.env.BF_CHATD_DEFAULT_MODEL = 'gpt-5.3-codex';
-  const daemon = await startChatd({ port: 0, writeChatdUrl: false });
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    modelFetcher: async () => {
+      throw new Error('model list unavailable');
+    },
+  });
   try {
     const res = await fetch(`${daemon.baseUrl}/v1/models`, {
       headers: { authorization: `Bearer ${daemon.token}` },
     });
     assert.equal(res.status, 200);
     const body = await res.json();
-    assert.deepEqual(body.models[0], { value: null, label: 'Default' });
     assert.equal(body.models.some((row) => row.value === 'gpt-5.3-codex'), true);
   } finally {
     await daemon.stop();

From 72f1e023b7a07bae6d57cb025f30485be041e2ae Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 11:29:03 +0530
Subject: [PATCH 109/192] feat(sidepanel): add claude-style transcript layout
 and expandable run steps timeline

---
 extension/agent-panel-state.js | 224 ++++++++++++++++++++++++
 extension/agent-panel.css      | 301 +++++++++++++++++++++++++++++++++
 extension/agent-panel.js       |  56 +++++-
 test/agent/sse-events.test.js  |  56 ++++++
 4 files changed, 634 insertions(+), 3 deletions(-)
 create mode 100644 extension/agent-panel-state.js
 create mode 100644 extension/agent-panel.css
 create mode 100644 test/agent/sse-events.test.js

diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
new file mode 100644
index 0000000..eab2998
--- /dev/null
+++ b/extension/agent-panel-state.js
@@ -0,0 +1,224 @@
+export const initialState = {
+  sessions: [],
+  activeSessionId: null,
+  messagesBySession: {},
+  runs: {},
+};
+
+function firstString(values) {
+  for (const value of values) {
+    if (typeof value === 'string' && value.trim()) return value.trim();
+  }
+  return '';
+}
+
+function trimStepLabel(label) {
+  const text = String(label || '').trim();
+  if (!text) return '';
+  return text.length > 160 ? `${text.slice(0, 157)}...` : text;
+}
+
+function pushStep(run, step) {
+  const steps = Array.isArray(run?.steps) ? run.steps.slice() : [];
+  const normalized = {
+    kind: step.kind || 'reasoning',
+    status: step.status || 'running',
+    label: trimStepLabel(step.label),
+  };
+  if (!normalized.label) return steps;
+  const last = steps[steps.length - 1];
+  if (last && last.label === normalized.label && last.kind === normalized.kind && last.status === normalized.status) {
+    return steps;
+  }
+  steps.push(normalized);
+  if (steps.length > 100) steps.shift();
+  return steps;
+}
+
+function stepLabelForToolEvent(evt) {
+  const payload = evt?.payload || {};
+  if (evt.event === 'tool.started') {
+    return firstString([
+      payload.title,
+      payload.name,
+      payload.tool,
+      payload.toolName,
+      payload.command,
+    ]) || 'Tool call started';
+  }
+  if (evt.event === 'tool.final') {
+    return firstString([
+      payload.title,
+      payload.name,
+      payload.tool,
+      payload.toolName,
+      payload.command,
+    ]) || 'Tool call completed';
+  }
+  if (evt.event === 'tool.delta') {
+    return firstString([
+      payload.text,
+      payload.message,
+      payload.delta,
+      payload.command,
+      payload.name,
+      payload.tool,
+      payload.toolName,
+      payload.type === 'reasoning' ? 'Reasoning' : '',
+    ]) || 'Working...';
+  }
+  return '';
+}
+
+function upsertRun(state, runId, patch) {
+  return {
+    ...state.runs,
+    [runId]: {
+      ...(state.runs[runId] || { runId, text: '', done: false, steps: [] }),
+      ...patch,
+    },
+  };
+}
+
+export function reduceState(state = initialState, action = {}) {
+  if (action.type === 'session.list.loaded') {
+    const sessions = Array.isArray(action.sessions) ? action.sessions : [];
+    const activeSessionId = action.activeSessionId
+      || state.activeSessionId
+      || sessions[0]?.sessionId
+      || null;
+    return {
+      ...state,
+      sessions,
+      activeSessionId,
+    };
+  }
+
+  if (action.type === 'session.selected') {
+    return {
+      ...state,
+      activeSessionId: action.sessionId,
+    };
+  }
+
+  if (action.type === 'messages.loaded') {
+    return {
+      ...state,
+      messagesBySession: {
+        ...state.messagesBySession,
+        [action.sessionId]: Array.isArray(action.messages) ? action.messages : [],
+      },
+    };
+  }
+
+  return state;
+}
+
+export function applyEvent(state = initialState, evt = {}) {
+  if (!evt || !evt.event) return state;
+
+  if (evt.event === 'run.started') {
+    return {
+      ...state,
+      runs: upsertRun(state, evt.runId, {
+        sessionId: evt.sessionId,
+        text: '',
+        done: false,
+        error: null,
+        steps: [],
+      }),
+    };
+  }
+
+  if (evt.event === 'chat.delta') {
+    const run = state.runs[evt.runId] || { text: '', done: false };
+    const delta = evt.payload?.delta || '';
+    return {
+      ...state,
+      runs: upsertRun(state, evt.runId, {
+        sessionId: evt.sessionId,
+        text: `${run.text || ''}${delta}`,
+      }),
+    };
+  }
+
+  if (evt.event === 'chat.final') {
+    const finalText = evt.payload?.text || state.runs[evt.runId]?.text || '';
+    const currentMessages = state.messagesBySession[evt.sessionId] || [];
+    const hasStoredFinal = currentMessages.some(
+      (message) => message.runId === evt.runId && message.role === 'assistant',
+    );
+    const nextMessages = (!hasStoredFinal && finalText)
+      ? [...currentMessages, { role: 'assistant', text: finalText, runId: evt.runId }]
+      : currentMessages;
+
+    return {
+      ...state,
+      messagesBySession: {
+        ...state.messagesBySession,
+        [evt.sessionId]: nextMessages,
+      },
+      runs: upsertRun(state, evt.runId, {
+        sessionId: evt.sessionId,
+        text: finalText,
+        done: true,
+      }),
+    };
+  }
+
+  if (evt.event === 'run.error') {
+    const run = state.runs[evt.runId] || { steps: [] };
+    const error = evt.payload?.error || 'Unknown error';
+    return {
+      ...state,
+      runs: upsertRun(state, evt.runId, {
+        sessionId: evt.sessionId,
+        done: true,
+        error,
+        steps: pushStep(run, {
+          kind: 'status',
+          status: 'failed',
+          label: `Failed: ${error}`,
+        }),
+      }),
+    };
+  }
+
+  if (evt.event === 'run.aborted') {
+    const run = state.runs[evt.runId] || { steps: [] };
+    return {
+      ...state,
+      runs: upsertRun(state, evt.runId, {
+        sessionId: evt.sessionId,
+        done: true,
+        aborted: true,
+        steps: pushStep(run, {
+          kind: 'status',
+          status: 'aborted',
+          label: 'Stopped',
+        }),
+      }),
+    };
+  }
+
+  if (evt.event === 'tool.started' || evt.event === 'tool.delta' || evt.event === 'tool.final') {
+    const run = state.runs[evt.runId] || { text: '', done: false, steps: [] };
+    const status = evt.event === 'tool.final'
+      ? 'done'
+      : 'running';
+    const kind = evt.event === 'tool.delta'
+      ? 'reasoning'
+      : 'tool';
+    const label = stepLabelForToolEvent(evt);
+    return {
+      ...state,
+      runs: upsertRun(state, evt.runId, {
+        sessionId: evt.sessionId,
+        done: false,
+        steps: pushStep(run, { kind, status, label }),
+      }),
+    };
+  }
+
+  return state;
+}
diff --git a/extension/agent-panel.css b/extension/agent-panel.css
new file mode 100644
index 0000000..a4cf795
--- /dev/null
+++ b/extension/agent-panel.css
@@ -0,0 +1,301 @@
+:root {
+  color-scheme: light;
+  --panel-bg: linear-gradient(180deg, #f8f9fc 0%, #eef2f8 100%);
+  --card-bg: #ffffff;
+  --line: #d7ddea;
+  --text: #1f2430;
+  --muted: #5f6878;
+  --accent: #2f7bf6;
+  --menu-bg: rgba(24, 28, 36, 0.94);
+  --menu-line: rgba(255, 255, 255, 0.12);
+  --menu-text: #f4f6fb;
+}
+
+* {
+  box-sizing: border-box;
+}
+
+body {
+  margin: 0;
+  font-family: ui-sans-serif, -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif;
+  color: var(--text);
+  background: var(--panel-bg);
+}
+
+.agent-shell {
+  height: 100vh;
+  min-height: 0;
+  display: grid;
+  grid-template-rows: auto 1fr;
+  position: relative;
+}
+
+.agent-header {
+  padding: 10px 12px 8px;
+  border-bottom: 1px solid var(--line);
+  background: linear-gradient(180deg, #fff, #f9fbff);
+}
+
+.header-row {
+  display: flex;
+  align-items: center;
+  gap: 8px;
+}
+
+.selector-pill {
+  flex: 1;
+  min-height: 30px;
+  border: 1px solid var(--line);
+  border-radius: 999px;
+  background: #fff;
+  color: #2b3242;
+  font-size: 12px;
+  text-align: left;
+  padding: 0 10px;
+  white-space: nowrap;
+  overflow: hidden;
+  text-overflow: ellipsis;
+}
+
+.icon-btn {
+  min-width: 32px;
+  min-height: 30px;
+  border-radius: 8px;
+  border: 1px solid var(--line);
+  background: #fff;
+  color: #2d3342;
+  font-size: 20px;
+  line-height: 1;
+  padding: 0;
+}
+
+.status {
+  margin: 8px 0 0;
+  font-size: 12px;
+  color: var(--muted);
+  display: flex;
+  align-items: center;
+  gap: 6px;
+}
+
+.status-icon {
+  font-size: 10px;
+  color: #22a06b;
+}
+
+.status.error {
+  color: #8f2836;
+}
+
+.status.error .status-icon {
+  color: #d14357;
+}
+
+.chat {
+  min-height: 0;
+  display: grid;
+  grid-template-rows: 1fr auto;
+}
+
+.transcript {
+  overflow: auto;
+  padding: 14px;
+}
+
+.message-row {
+  display: flex;
+  flex-direction: column;
+  margin-bottom: 10px;
+  gap: 6px;
+}
+
+.message-row.user {
+  align-items: flex-end;
+}
+
+.message-row.assistant {
+  align-items: flex-start;
+}
+
+.message {
+  font-size: 13px;
+  white-space: pre-wrap;
+  line-height: 1.45;
+}
+
+.message.user {
+  background: #10131a;
+  color: #f3f6fc;
+  border: 1px solid #0c1018;
+  border-radius: 16px;
+  padding: 10px 14px;
+  max-width: min(85%, 520px);
+  box-shadow: 0 4px 12px rgba(0, 0, 0, 0.18);
+}
+
+.message.assistant {
+  background: transparent;
+  border: 0;
+  color: #1f2430;
+  padding: 0;
+  max-width: 100%;
+}
+
+.run-steps-summary {
+  display: inline-flex;
+  flex-direction: column;
+  align-items: flex-start;
+  gap: 8px;
+}
+
+.run-steps-trigger {
+  all: unset;
+  cursor: pointer;
+  color: #646f84;
+  font-size: 13px;
+  font-weight: 500;
+}
+
+.run-steps-trigger:hover {
+  color: #2f7bf6;
+}
+
+.run-steps-list {
+  list-style: none;
+  margin: 0;
+  padding: 0 0 0 10px;
+  border-left: 1px solid #d8dfed;
+  display: grid;
+  gap: 8px;
+}
+
+.run-step {
+  display: flex;
+  align-items: flex-start;
+  gap: 8px;
+  color: #4e586d;
+}
+
+.run-step-label {
+  white-space: pre-wrap;
+}
+
+.run-step-marker {
+  width: 8px;
+  height: 8px;
+  margin-top: 6px;
+  border-radius: 999px;
+  background: #96a1b8;
+  flex: 0 0 8px;
+}
+
+.run-step.tool .run-step-marker {
+  background: #2f7bf6;
+}
+
+.run-step.done .run-step-marker {
+  background: #1f9d63;
+}
+
+.run-step.failed .run-step-marker {
+  background: #d14357;
+}
+
+.composer {
+  border-top: 1px solid var(--line);
+  padding: 10px;
+  display: grid;
+  gap: 8px;
+  background: #fff;
+}
+
+.composer textarea {
+  width: 100%;
+  min-height: 72px;
+  max-height: 160px;
+  resize: vertical;
+  padding: 10px;
+  border: 1px solid var(--line);
+  border-radius: 8px;
+  font: inherit;
+}
+
+.composer-actions {
+  display: flex;
+  justify-content: flex-end;
+  gap: 8px;
+}
+
+button {
+  border: 0;
+  border-radius: 8px;
+  background: var(--accent);
+  color: #fff;
+  padding: 8px 12px;
+  cursor: pointer;
+}
+
+button.secondary {
+  background: #e7edf8;
+  color: #1c3f7e;
+}
+
+.menu-backdrop {
+  position: absolute;
+  inset: 0;
+  background: rgba(0, 0, 0, 0.15);
+  z-index: 20;
+}
+
+.popover-panel {
+  position: absolute;
+  top: 52px;
+  left: 12px;
+  right: 12px;
+  border-radius: 14px;
+  background: var(--menu-bg);
+  border: 1px solid var(--menu-line);
+  box-shadow: 0 18px 36px rgba(0, 0, 0, 0.28);
+  backdrop-filter: blur(14px);
+  z-index: 21;
+  max-height: min(360px, calc(100vh - 70px));
+  overflow: auto;
+}
+
+.popover-list {
+  list-style: none;
+  margin: 0;
+  padding: 8px;
+}
+
+.popover-list button {
+  width: 100%;
+  text-align: left;
+  border: 0;
+  background: transparent;
+  color: var(--menu-text);
+  border-radius: 10px;
+  padding: 10px 12px;
+  margin: 1px 0;
+  font-size: 14px;
+}
+
+.popover-list button.active,
+.popover-list button:hover {
+  background: rgba(255, 255, 255, 0.11);
+}
+
+.popover-list .hint {
+  font-size: 12px;
+  color: rgba(255, 255, 255, 0.66);
+  margin-top: 4px;
+}
+
+.popover-list .empty-item {
+  color: rgba(255, 255, 255, 0.75);
+  padding: 10px 12px;
+}
+
+.hidden {
+  display: none;
+}
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index b758618..8425d18 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -11,6 +11,7 @@ const state = {
   auth: null,
   modelPresets: [{ value: null, label: 'Default' }],
   currentRunBySession: {},
+  expandedRunSteps: {},
   eventController: null,
   eventLoopToken: 0,
   sessionSelectionToken: 0,
@@ -147,6 +148,48 @@ function renderSessions() {
   });
 }
 
+function isRunStepsExpanded(runId) {
+  return !!state.expandedRunSteps?.[runId];
+}
+
+function toggleRunSteps(runId) {
+  if (!runId) return;
+  state.expandedRunSteps = {
+    ...(state.expandedRunSteps || {}),
+    [runId]: !isRunStepsExpanded(runId),
+  };
+  renderTranscript();
+}
+
+function renderRunSteps(runId, run) {
+  if (!runId || !run || !Array.isArray(run.steps) || run.steps.length === 0) return '';
+  const count = run.steps.length;
+  const expanded = isRunStepsExpanded(runId);
+  const summary = `<button type="button" class="run-steps-trigger" data-run-steps-toggle="${escapeHtml(runId)}">${count} step${count === 1 ? '' : 's'}</button>`;
+  if (!expanded) {
+    return `<div class="run-steps-summary">${summary}</div>`;
+  }
+
+  const items = run.steps
+    .map((step) => {
+      const kind = step?.kind || 'reasoning';
+      const status = step?.status || 'running';
+      const label = step?.label || 'Step';
+      return `<li class="run-step ${escapeHtml(kind)} ${escapeHtml(status)}"><span class="run-step-marker" aria-hidden="true"></span><span class="run-step-label">${escapeHtml(label)}</span></li>`;
+    })
+    .join('');
+
+  return `<div class="run-steps-summary expanded">${summary}<ol class="run-steps-list">${items}</ol></div>`;
+}
+
+function bindTranscriptHandlers() {
+  transcriptEl.querySelectorAll('button[data-run-steps-toggle]').forEach((button) => {
+    button.addEventListener('click', () => {
+      toggleRunSteps(button.getAttribute('data-run-steps-toggle'));
+    });
+  });
+}
+
 function renderTranscript() {
   const messages = getActiveMessages();
   const sessionId = state.value.activeSessionId;
@@ -155,14 +198,21 @@ function renderTranscript() {
 
   const chunks = messages.map((msg) => {
     const role = msg.role || 'assistant';
-    return `<article class="message ${role}">${escapeHtml(msg.text || '')}</article>`;
+    if (role === 'user') {
+      return `<article class="message-row user"><div class="message user">${escapeHtml(msg.text || '')}</div></article>`;
+    }
+
+    const messageRun = msg.runId ? state.value.runs[msg.runId] : null;
+    return `<article class="message-row assistant">${renderRunSteps(msg.runId, messageRun)}<div class="message assistant">${escapeHtml(msg.text || '')}</div></article>`;
   });
 
   if (run && !run.done) {
-    chunks.push(`<article class="message assistant">${escapeHtml(run.text || '')}</article>`);
+    const liveText = run.text ? `<div class="message assistant">${escapeHtml(run.text || '')}</div>` : '';
+    chunks.push(`<article class="message-row assistant">${renderRunSteps(sessionRunId, run)}${liveText}</article>`);
   }
 
-  transcriptEl.innerHTML = chunks.join('') || '<article class="message assistant">No messages yet.</article>';
+  transcriptEl.innerHTML = chunks.join('') || '<article class="message-row assistant"><div class="message assistant">No messages yet.</div></article>';
+  bindTranscriptHandlers();
   transcriptEl.scrollTop = transcriptEl.scrollHeight;
 }
 
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
new file mode 100644
index 0000000..1e91059
--- /dev/null
+++ b/test/agent/sse-events.test.js
@@ -0,0 +1,56 @@
+import test from 'node:test';
+import assert from 'node:assert/strict';
+import { applyEvent } from '../../extension/agent-panel-state.js';
+
+const baseState = {
+  sessions: [],
+  activeSessionId: null,
+  messagesBySession: {},
+  runs: {},
+};
+
+test('chat.delta appends to in-flight run text', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const next = applyEvent(s1, { event: 'chat.delta', runId: 'r1', sessionId: 's1', payload: { delta: 'Hi' } });
+  assert.equal(next.runs.r1.text, 'Hi');
+});
+
+test('chat.final finalizes run output', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const next = applyEvent(s1, { event: 'chat.final', runId: 'r1', sessionId: 's1', payload: { text: 'Done' } });
+  assert.equal(next.runs.r1.done, true);
+  assert.equal(next.runs.r1.text, 'Done');
+  assert.equal(next.messagesBySession.s1.at(-1).text, 'Done');
+});
+
+test('run.aborted marks run terminal', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const next = applyEvent(s1, { event: 'run.aborted', runId: 'r1', sessionId: 's1', payload: {} });
+  assert.equal(next.runs.r1.done, true);
+  assert.equal(next.runs.r1.aborted, true);
+});
+
+test('tool and reasoning events are tracked as steps', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, { event: 'tool.started', runId: 'r1', sessionId: 's1', payload: { tool: 'fetch' } });
+  const s3 = applyEvent(s2, {
+    event: 'tool.delta',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: { type: 'reasoning', text: 'Planning the next action' },
+  });
+  const s4 = applyEvent(s3, { event: 'tool.final', runId: 'r1', sessionId: 's1', payload: { tool: 'fetch' } });
+
+  assert.equal(Array.isArray(s4.runs.r1.steps), true);
+  assert.equal(s4.runs.r1.steps.length, 3);
+  assert.match(s4.runs.r1.steps[0].label, /fetch/i);
+  assert.match(s4.runs.r1.steps[1].label, /Planning/);
+});
+
+test('run.error appends a final failed step', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, { event: 'run.error', runId: 'r1', sessionId: 's1', payload: { error: 'boom' } });
+  const last = s2.runs.r1.steps.at(-1);
+  assert.equal(last.status, 'failed');
+  assert.match(last.label, /boom/);
+});

From 7844b21a06ddc744dc98e2651749183cb2b6bf58 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 11:31:55 +0530
Subject: [PATCH 110/192] feat(sidepanel): render step icons for reasoning and
 tool call timeline

---
 extension/agent-panel-runtime.js       |  45 +++++++++
 extension/agent-panel.css              | 133 +++++++++++++++++++++++--
 extension/agent-panel.js               |   4 +-
 test/agent/agent-panel-runtime.test.js |  50 ++++++++++
 4 files changed, 221 insertions(+), 11 deletions(-)
 create mode 100644 extension/agent-panel-runtime.js
 create mode 100644 test/agent/agent-panel-runtime.test.js

diff --git a/extension/agent-panel-runtime.js b/extension/agent-panel-runtime.js
new file mode 100644
index 0000000..63fe634
--- /dev/null
+++ b/extension/agent-panel-runtime.js
@@ -0,0 +1,45 @@
+export function getSessionRunId(currentRunBySession, sessionId) {
+  if (!sessionId) return null;
+  return currentRunBySession?.[sessionId] || null;
+}
+
+export function assignSessionRunId(currentRunBySession, sessionId, runId) {
+  if (!sessionId || !runId) return currentRunBySession || {};
+  return {
+    ...(currentRunBySession || {}),
+    [sessionId]: runId,
+  };
+}
+
+export function clearSessionRunId(currentRunBySession, sessionId, runId) {
+  if (!sessionId) return currentRunBySession || {};
+  const next = { ...(currentRunBySession || {}) };
+  if (!runId || next[sessionId] === runId) {
+    delete next[sessionId];
+  }
+  return next;
+}
+
+export function shouldApplySessionSelection({ requestToken, latestRequestToken, requestedSessionId, activeSessionId }) {
+  return (
+    requestToken === latestRequestToken
+    && requestedSessionId === activeSessionId
+  );
+}
+
+export function classifyRunStepIcon(step = {}) {
+  const status = String(step.status || '').toLowerCase();
+  if (status === 'failed') return 'failed';
+  if (status === 'done' || /\bdone\b/.test(String(step.label || '').toLowerCase())) return 'done';
+
+  const label = String(step.label || '').toLowerCase();
+  const kind = String(step.kind || '').toLowerCase();
+
+  if (kind === 'reasoning') return 'reasoning';
+
+  if (/screenshot|screen shot|capture|image/.test(label)) return 'camera';
+  if (/extract|read|open|search|scan|inspect|lookup|page text|document/.test(label)) return 'view';
+  if (/plan|steps|todo|checklist/.test(label)) return 'plan';
+  if (kind === 'tool') return 'tool';
+  return 'reasoning';
+}
diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index a4cf795..095ccb9 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -180,25 +180,138 @@ body {
   white-space: pre-wrap;
 }
 
-.run-step-marker {
+.run-step-icon {
+  width: 16px;
+  height: 16px;
+  margin-top: 2px;
+  flex: 0 0 16px;
+  color: #97a2b8;
+  position: relative;
+}
+
+.run-step-icon::before,
+.run-step-icon::after {
+  content: '';
+  position: absolute;
+}
+
+.run-step-icon.icon-reasoning::before {
+  top: 4px;
+  left: 4px;
   width: 8px;
   height: 8px;
-  margin-top: 6px;
   border-radius: 999px;
-  background: #96a1b8;
-  flex: 0 0 8px;
+  background: currentColor;
+}
+
+.run-step-icon.icon-tool::before {
+  top: 2px;
+  left: 2px;
+  width: 12px;
+  height: 12px;
+  border: 1.5px solid currentColor;
+  border-radius: 3px;
+}
+
+.run-step-icon.icon-view::before {
+  top: 5px;
+  left: 1px;
+  width: 14px;
+  height: 8px;
+  border: 1.5px solid currentColor;
+  border-radius: 10px;
+}
+
+.run-step-icon.icon-view::after {
+  top: 7px;
+  left: 6px;
+  width: 4px;
+  height: 4px;
+  border: 1.5px solid currentColor;
+  border-radius: 999px;
 }
 
-.run-step.tool .run-step-marker {
-  background: #2f7bf6;
+.run-step-icon.icon-camera::before {
+  top: 5px;
+  left: 1px;
+  width: 14px;
+  height: 9px;
+  border: 1.5px solid currentColor;
+  border-radius: 3px;
 }
 
-.run-step.done .run-step-marker {
-  background: #1f9d63;
+.run-step-icon.icon-camera::after {
+  top: 1px;
+  left: 4px;
+  width: 6px;
+  height: 4px;
+  border: 1.5px solid currentColor;
+  border-bottom: 0;
+  border-radius: 2px 2px 0 0;
 }
 
-.run-step.failed .run-step-marker {
-  background: #d14357;
+.run-step-icon.icon-plan::before {
+  top: 3px;
+  left: 3px;
+  width: 2px;
+  height: 2px;
+  border-radius: 999px;
+  background: currentColor;
+  box-shadow: 0 4px 0 currentColor, 0 8px 0 currentColor;
+}
+
+.run-step-icon.icon-plan::after {
+  top: 3px;
+  left: 7px;
+  width: 7px;
+  height: 2px;
+  border-radius: 2px;
+  background: currentColor;
+  box-shadow: 0 4px 0 currentColor, 0 8px 0 currentColor;
+}
+
+.run-step-icon.icon-done::before {
+  top: 1px;
+  left: 1px;
+  width: 12px;
+  height: 12px;
+  border: 1.5px solid currentColor;
+  border-radius: 999px;
+}
+
+.run-step-icon.icon-done::after {
+  top: 6px;
+  left: 5px;
+  width: 6px;
+  height: 3px;
+  border-left: 1.5px solid currentColor;
+  border-bottom: 1.5px solid currentColor;
+  transform: rotate(-45deg);
+}
+
+.run-step-icon.icon-failed::before {
+  top: 1px;
+  left: 1px;
+  width: 12px;
+  height: 12px;
+  border: 1.5px solid currentColor;
+  border-radius: 999px;
+}
+
+.run-step-icon.icon-failed::after {
+  top: 7px;
+  left: 4px;
+  width: 8px;
+  height: 1.5px;
+  background: currentColor;
+}
+
+.run-step.done .run-step-icon {
+  color: #1f9d63;
+}
+
+.run-step.failed .run-step-icon {
+  color: #d14357;
 }
 
 .composer {
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 8425d18..f8ded00 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -1,6 +1,7 @@
 import { applyEvent, initialState, reduceState } from './agent-panel-state.js';
 import {
   assignSessionRunId,
+  classifyRunStepIcon,
   clearSessionRunId,
   getSessionRunId,
   shouldApplySessionSelection,
@@ -175,7 +176,8 @@ function renderRunSteps(runId, run) {
       const kind = step?.kind || 'reasoning';
       const status = step?.status || 'running';
       const label = step?.label || 'Step';
-      return `<li class="run-step ${escapeHtml(kind)} ${escapeHtml(status)}"><span class="run-step-marker" aria-hidden="true"></span><span class="run-step-label">${escapeHtml(label)}</span></li>`;
+      const icon = classifyRunStepIcon(step);
+      return `<li class="run-step ${escapeHtml(kind)} ${escapeHtml(status)}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="run-step-label">${escapeHtml(label)}</span></li>`;
     })
     .join('');
 
diff --git a/test/agent/agent-panel-runtime.test.js b/test/agent/agent-panel-runtime.test.js
new file mode 100644
index 0000000..22fb8f5
--- /dev/null
+++ b/test/agent/agent-panel-runtime.test.js
@@ -0,0 +1,50 @@
+import test from 'node:test';
+import assert from 'node:assert/strict';
+import {
+  assignSessionRunId,
+  classifyRunStepIcon,
+  clearSessionRunId,
+  getSessionRunId,
+  shouldApplySessionSelection,
+} from '../../extension/agent-panel-runtime.js';
+
+test('run ids are scoped per session', () => {
+  let mapping = {};
+  mapping = assignSessionRunId(mapping, 's1', 'r1');
+  mapping = assignSessionRunId(mapping, 's2', 'r2');
+
+  assert.equal(getSessionRunId(mapping, 's1'), 'r1');
+  assert.equal(getSessionRunId(mapping, 's2'), 'r2');
+  assert.equal(getSessionRunId(mapping, 's3'), null);
+
+  mapping = clearSessionRunId(mapping, 's1', 'r1');
+  assert.equal(getSessionRunId(mapping, 's1'), null);
+  assert.equal(getSessionRunId(mapping, 's2'), 'r2');
+});
+
+test('stale selection requests are rejected after async load', () => {
+  const stale = shouldApplySessionSelection({
+    requestToken: 1,
+    latestRequestToken: 2,
+    requestedSessionId: 's1',
+    activeSessionId: 's2',
+  });
+  assert.equal(stale, false);
+
+  const current = shouldApplySessionSelection({
+    requestToken: 2,
+    latestRequestToken: 2,
+    requestedSessionId: 's2',
+    activeSessionId: 's2',
+  });
+  assert.equal(current, true);
+});
+
+test('classifies step icons from reasoning/tool labels', () => {
+  assert.equal(classifyRunStepIcon({ kind: 'reasoning', label: 'Let me create a plan first' }), 'reasoning');
+  assert.equal(classifyRunStepIcon({ kind: 'tool', label: 'Extract page text' }), 'view');
+  assert.equal(classifyRunStepIcon({ kind: 'tool', label: 'Take screenshot' }), 'camera');
+  assert.equal(classifyRunStepIcon({ kind: 'tool', label: 'Created a plan' }), 'plan');
+  assert.equal(classifyRunStepIcon({ kind: 'status', status: 'done', label: 'Done' }), 'done');
+  assert.equal(classifyRunStepIcon({ kind: 'status', status: 'failed', label: 'Failed' }), 'failed');
+});

From 20d977d3794d7cebd4682a78e21e6b5d9aa1cd2c Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 11:41:23 +0530
Subject: [PATCH 111/192] agent: auto-attach active tab and include tab context
 in runs

---
 agent/src/chatd.js                           | 43 ++++++++++++-
 extension/agent-panel.js                     | 68 +++++++++++++++++++-
 test/agent/agent-panel-send-contract.test.js |  8 +++
 test/agent/chatd-api.test.js                 | 49 ++++++++++++++
 4 files changed, 165 insertions(+), 3 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 070945c..cb52d94 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -232,6 +232,38 @@ function safeDecodeComponent(value) {
   }
 }
 
+function sanitizeContextText(value, maxLen = 320) {
+  if (value == null) return '';
+  const normalized = String(value).replace(/\s+/g, ' ').trim();
+  if (!normalized) return '';
+  return normalized.length > maxLen ? `${normalized.slice(0, maxLen - 3)}...` : normalized;
+}
+
+function normalizeBrowserContext(raw) {
+  if (!raw || typeof raw !== 'object') return null;
+  const tabId = Number.isInteger(raw.tabId) ? raw.tabId : null;
+  const title = sanitizeContextText(raw.title, 180);
+  const url = sanitizeContextText(raw.url, 500);
+  if (tabId == null && !title && !url) return null;
+  return { tabId, title, url };
+}
+
+function buildRunPrompt({ message, browserContext }) {
+  if (!browserContext) return message;
+
+  const lines = [
+    'BrowserForce active tab context:',
+  ];
+  if (browserContext.tabId != null) lines.push(`- Active tab id: ${browserContext.tabId}`);
+  if (browserContext.title) lines.push(`- Active tab title: ${browserContext.title}`);
+  if (browserContext.url) lines.push(`- Active tab URL: ${browserContext.url}`);
+  lines.push('Assume the user is referring to this active tab unless they explicitly say otherwise.');
+  lines.push('If the request is ambiguous or you are not sure, ask the user a clarifying question before acting.');
+  lines.push('');
+  lines.push(`User request: ${message}`);
+  return lines.join('\n');
+}
+
 async function readJsonBody(req) {
   const chunks = [];
   for await (const chunk of req) chunks.push(chunk);
@@ -511,6 +543,8 @@ export async function startChatd(opts = {}) {
           json(res, 404, { error: 'Session not found' });
           return;
         }
+        const browserContext = normalizeBrowserContext(body?.browserContext);
+        const promptMessage = buildRunPrompt({ message, browserContext });
 
         const runId = randomBytes(12).toString('base64url');
         const run = {
@@ -534,7 +568,7 @@ export async function startChatd(opts = {}) {
           const handle = runExecutor({
             runId,
             sessionId,
-            message,
+            message: promptMessage,
             model: session.model || null,
             onEvent: (evt) => {
               enqueue(async () => {
@@ -597,7 +631,12 @@ export async function startChatd(opts = {}) {
           });
 
           run.abort = handle?.abort || null;
-          broadcast(buildEvent({ event: 'run.started', runId, sessionId, payload: { message, model: session.model || null } }));
+          broadcast(buildEvent({
+            event: 'run.started',
+            runId,
+            sessionId,
+            payload: { message, model: session.model || null, browserContext },
+          }));
           json(res, 202, { ok: true, runId, sessionId });
         } catch (error) {
           runs.delete(runId);
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index f8ded00..b6c6d34 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -253,6 +253,68 @@ function sleep(ms) {
   return new Promise((resolve) => setTimeout(resolve, ms));
 }
 
+function runtimeMessage(message) {
+  return new Promise((resolve, reject) => {
+    if (!chrome?.runtime?.sendMessage) {
+      resolve(null);
+      return;
+    }
+    try {
+      chrome.runtime.sendMessage(message, (response) => {
+        if (chrome.runtime.lastError) {
+          reject(new Error(chrome.runtime.lastError.message || 'runtime message failed'));
+          return;
+        }
+        resolve(response || null);
+      });
+    } catch (error) {
+      reject(error);
+    }
+  });
+}
+
+function isIgnoredAttachError(errorMessage) {
+  const text = String(errorMessage || '').toLowerCase();
+  return (
+    text.includes('already attached')
+    || text.includes('cannot attach internal')
+    || text.includes('no active tab')
+  );
+}
+
+async function ensureCurrentTabAttached() {
+  try {
+    const response = await runtimeMessage({ type: 'attachCurrentTab' });
+    if (response?.error && !isIgnoredAttachError(response.error)) {
+      console.warn('[bf-agent] attachCurrentTab failed:', response.error);
+    }
+  } catch {
+    // best-effort only
+  }
+}
+
+async function getActiveTabContext() {
+  if (!chrome?.tabs?.query) return null;
+  try {
+    const [tab] = await chrome.tabs.query({ active: true, currentWindow: true });
+    if (!tab || typeof tab.id !== 'number') return null;
+    const title = String(tab.title || '').trim().slice(0, 180);
+    const url = String(tab.url || '').trim();
+    if (
+      !url
+      || url.startsWith('chrome://')
+      || url.startsWith('chrome-extension://')
+      || url.startsWith('edge://')
+      || url.startsWith('devtools://')
+    ) {
+      return { tabId: tab.id, title, url: null };
+    }
+    return { tabId: tab.id, title, url: url.slice(0, 500) };
+  } catch {
+    return null;
+  }
+}
+
 async function getRelayHttpUrl() {
   const stored = await chrome.storage.local.get(['relayUrl']);
   const relayUrl = stored.relayUrl || 'ws://127.0.0.1:19222/extension';
@@ -465,9 +527,12 @@ async function sendMessage(text) {
   const existing = getActiveMessages();
   dispatch({ type: 'messages.loaded', sessionId, messages: [...existing, { role: 'user', text }] });
 
+  await ensureCurrentTabAttached();
+  const browserContext = await getActiveTabContext();
+
   const res = await api('/v1/runs', {
     method: 'POST',
-    body: JSON.stringify({ sessionId, message: text }),
+    body: JSON.stringify({ sessionId, message: text, browserContext }),
   });
   if (!res.ok) {
     dispatch({ type: 'messages.loaded', sessionId, messages: existing });
@@ -528,6 +593,7 @@ popoverBackdropEl.addEventListener('click', () => {
   try {
     setStatus('info', 'Connecting...');
     await loadAuth();
+    await ensureCurrentTabAttached();
     try {
       await loadModelPresets();
     } catch {
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index e7301c1..48b172f 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -15,3 +15,11 @@ test('submit handler preserves draft on send failure', () => {
   assert.match(js, /try\s*\{\s*await sendMessage\(text\);[\s\S]*chatInputEl\.value = '';/);
   assert.match(js, /catch\s*\(\w+\)\s*\{[\s\S]*chatInputEl\.value = text;/);
 });
+
+test('sidepanel auto-attaches current tab and sends browserContext with runs', () => {
+  assert.match(js, /async function ensureCurrentTabAttached\(\)/);
+  assert.match(js, /runtimeMessage\(\{\s*type:\s*'attachCurrentTab'\s*\}\)/);
+  assert.match(js, /await ensureCurrentTabAttached\(\);/);
+  assert.match(js, /const browserContext = await getActiveTabContext\(\);/);
+  assert.match(js, /JSON\.stringify\(\{\s*sessionId,\s*message:\s*text,\s*browserContext\s*\}\)/);
+});
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index 577fef8..704d6d0 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -255,3 +255,52 @@ test('runExecutor synchronous failure does not leak abortable run', async () =>
     rmSync(storageRoot, { recursive: true, force: true });
   }
 });
+
+test('POST /v1/runs includes active tab context in runExecutor prompt', async () => {
+  const seenRuns = [];
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    runExecutor: ({ runId, sessionId, message, onExit }) => {
+      seenRuns.push({ runId, sessionId, message });
+      setTimeout(() => onExit({ code: 0 }), 5);
+      return { abort() {} };
+    },
+  });
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'context' }),
+    }).then((res) => res.json());
+
+    const runRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({
+        sessionId: created.sessionId,
+        message: 'summarize this page',
+        browserContext: {
+          tabId: 42,
+          title: 'Pricing',
+          url: 'https://example.com/pricing',
+        },
+      }),
+    });
+    assert.equal(runRes.status, 202);
+    await new Promise((resolve) => setTimeout(resolve, 20));
+    const prompt = seenRuns.at(-1)?.message || '';
+    assert.match(prompt, /Active tab title: Pricing/);
+    assert.match(prompt, /Active tab URL: https:\/\/example\.com\/pricing/);
+    assert.match(prompt, /If the request is ambiguous/i);
+    assert.match(prompt, /User request:\s*summarize this page/i);
+  } finally {
+    await daemon.stop();
+  }
+});

From 525bfcefec38bf52b923ba0ccc4e451b0ba4a139 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 12:23:40 +0530
Subject: [PATCH 112/192] relay: add chatd-url endpoint with extension-id auth
 fallback

---
 relay/src/index.js              | 56 ++++++++++++++++++-
 relay/test/relay-server.test.js | 97 +++++++++++++++++++++++++++++++++
 2 files changed, 151 insertions(+), 2 deletions(-)

diff --git a/relay/src/index.js b/relay/src/index.js
index 33179d4..1335f1f 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -16,6 +16,7 @@ const DEFAULT_CDP_LOG_BUFFER_LIMIT = 10000;
 const BF_DIR = path.join(os.homedir(), '.browserforce');
 const TOKEN_FILE = path.join(BF_DIR, 'auth-token');
 const CDP_URL_FILE = path.join(BF_DIR, 'cdp-url');
+const CHATD_URL_FILE = path.join(BF_DIR, 'chatd-url.json');
 const BF_PLUGINS_DIR = path.join(BF_DIR, 'plugins');
 const CLIENT_MODE_SINGLE = 'single-active';
 const CLIENT_MODE_MULTI = 'multi-client';
@@ -356,6 +357,42 @@ class RelayServer {
       return;
     }
 
+    if (url.pathname === '/chatd-url' && req.method === 'GET') {
+      if (!this._requireExtensionOrigin(req, res)) return;
+      try {
+        const body = fs.readFileSync(CHATD_URL_FILE, 'utf8');
+        const parsed = JSON.parse(body);
+        if (!Number.isInteger(parsed?.port) || typeof parsed?.token !== 'string') {
+          throw new Error('invalid shape');
+        }
+
+        let healthy = false;
+        try {
+          const healthRes = await fetch(`http://127.0.0.1:${parsed.port}/health`, {
+            signal: AbortSignal.timeout(500),
+          });
+          healthy = healthRes.ok;
+        } catch {
+          healthy = false;
+        }
+        if (!healthy) {
+          res.statusCode = 404;
+          res.end(JSON.stringify({ error: 'chatd not running' }));
+          return;
+        }
+        res.end(JSON.stringify(parsed));
+      } catch (err) {
+        if (err && err.code === 'ENOENT') {
+          res.statusCode = 404;
+          res.end(JSON.stringify({ error: 'chatd not running' }));
+          return;
+        }
+        res.statusCode = 500;
+        res.end(JSON.stringify({ error: 'invalid chatd-url metadata' }));
+      }
+      return;
+    }
+
     if (url.pathname === '/logs/status' && req.method === 'GET') {
       if (!this._requireExtensionOrigin(req, res)) return;
       res.end(JSON.stringify(this._logsStatus()));
@@ -538,7 +575,10 @@ class RelayServer {
 
   _requireExtensionOrigin(req, res) {
     const origin = this._extensionOriginFromReq(req);
-    if (!origin) {
+    const requestedExtensionId = String(req?.headers?.['x-browserforce-extension-id'] || '').trim();
+    const extensionIdPattern = /^[a-p]{32}$/;
+
+    if (!origin && !extensionIdPattern.test(requestedExtensionId)) {
       res.statusCode = 403;
       res.end(JSON.stringify({ error: 'Forbidden — extension origin required' }));
       return false;
@@ -550,11 +590,23 @@ class RelayServer {
       res.end(JSON.stringify({ error: 'Extension not connected' }));
       return false;
     }
-    if (trustedOrigin && origin !== trustedOrigin) {
+
+    if (origin) {
+      if (origin !== trustedOrigin) {
+        res.statusCode = 403;
+        res.end(JSON.stringify({ error: 'Forbidden — extension origin mismatch' }));
+        return false;
+      }
+      return true;
+    }
+
+    const trustedExtensionId = String(trustedOrigin).replace('chrome-extension://', '');
+    if (requestedExtensionId !== trustedExtensionId) {
       res.statusCode = 403;
       res.end(JSON.stringify({ error: 'Forbidden — extension origin mismatch' }));
       return false;
     }
+
     return true;
   }
 
diff --git a/relay/test/relay-server.test.js b/relay/test/relay-server.test.js
index c225871..88a1819 100644
--- a/relay/test/relay-server.test.js
+++ b/relay/test/relay-server.test.js
@@ -201,6 +201,103 @@ describe('HTTP Endpoints', () => {
   });
 });
 
+describe('Chatd URL Endpoint', () => {
+  let relay;
+  let port;
+  const chatdUrlPath = path.join(BF_DIR, 'chatd-url.json');
+
+  before(async () => {
+    port = getRandomPort();
+    relay = new RelayServer(port);
+    relay.start({ writeCdpUrl: false });
+    await sleep(200);
+    fs.rmSync(chatdUrlPath, { force: true });
+  });
+
+  after(() => {
+    relay.stop();
+    fs.rmSync(chatdUrlPath, { force: true });
+  });
+
+  it('GET /chatd-url returns 404 when chatd not running', async () => {
+    const ext = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://test' },
+    });
+    ext.on('message', (data) => {
+      const msg = JSON.parse(data.toString());
+      if (msg.method === 'ping') ext.send(JSON.stringify({ method: 'pong' }));
+    });
+
+    const { status, body } = await httpGetWithHeaders(`http://127.0.0.1:${port}/chatd-url`, {
+      Origin: 'chrome-extension://test',
+    });
+    assert.equal(status, 404);
+    assert.match(body.error, /chatd not running/);
+
+    ext.close();
+    await sleep(50);
+  });
+
+  it('GET /chatd-url returns 404 when metadata is stale', async () => {
+    fs.writeFileSync(chatdUrlPath, JSON.stringify({ port: 65534, token: 'stale' }));
+
+    const ext = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://test' },
+    });
+    ext.on('message', (data) => {
+      const msg = JSON.parse(data.toString());
+      if (msg.method === 'ping') ext.send(JSON.stringify({ method: 'pong' }));
+    });
+
+    const { status, body } = await httpGetWithHeaders(`http://127.0.0.1:${port}/chatd-url`, {
+      Origin: 'chrome-extension://test',
+    });
+    assert.equal(status, 404);
+    assert.match(body.error, /chatd not running/);
+
+    ext.close();
+    await sleep(50);
+  });
+
+  it('GET /chatd-url accepts extension id header when Origin is absent', async () => {
+    const ext = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://abcdefghijklmnopabcdefghijklmnop' },
+    });
+    ext.on('message', (data) => {
+      const msg = JSON.parse(data.toString());
+      if (msg.method === 'ping') ext.send(JSON.stringify({ method: 'pong' }));
+    });
+
+    const { status, body } = await httpGetWithHeaders(`http://127.0.0.1:${port}/chatd-url`, {
+      'x-browserforce-extension-id': 'abcdefghijklmnopabcdefghijklmnop',
+    });
+    assert.equal(status, 404);
+    assert.match(body.error, /chatd not running/);
+
+    ext.close();
+    await sleep(50);
+  });
+
+  it('GET /chatd-url rejects mismatched extension id header', async () => {
+    const ext = await connectWs(`ws://127.0.0.1:${port}/extension`, {
+      headers: { Origin: 'chrome-extension://abcdefghijklmnopabcdefghijklmnop' },
+    });
+    ext.on('message', (data) => {
+      const msg = JSON.parse(data.toString());
+      if (msg.method === 'ping') ext.send(JSON.stringify({ method: 'pong' }));
+    });
+
+    const { status, body } = await httpGetWithHeaders(`http://127.0.0.1:${port}/chatd-url`, {
+      'x-browserforce-extension-id': 'ponmlkjihgfedcbaponmlkjihgfedcba',
+    });
+    assert.equal(status, 403);
+    assert.match(body.error, /origin mismatch/);
+
+    ext.close();
+    await sleep(50);
+  });
+});
+
 // ─── Logs Viewer Endpoints ───────────────────────────────────────────────────
 
 describe('Logs Viewer Endpoints', () => {

From c722fff4e5c852344ec31189c324c1e20d45c8e6 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 12:23:50 +0530
Subject: [PATCH 113/192] cli(agent): add daemon start/status/stop with
 lockfile and port fallback

---
 agent/src/lockfile.js        |  68 +++++++++++++++++++++
 agent/src/port-resolver.js   |  27 +++++++++
 bin.js                       | 111 ++++++++++++++++++++++++++++++++++-
 test/agent/cli-agent.test.js |  45 ++++++++++++++
 4 files changed, 250 insertions(+), 1 deletion(-)
 create mode 100644 agent/src/lockfile.js
 create mode 100644 agent/src/port-resolver.js
 create mode 100644 test/agent/cli-agent.test.js

diff --git a/agent/src/lockfile.js b/agent/src/lockfile.js
new file mode 100644
index 0000000..7661163
--- /dev/null
+++ b/agent/src/lockfile.js
@@ -0,0 +1,68 @@
+import { promises as fs } from 'node:fs';
+import { homedir } from 'node:os';
+import { dirname, join } from 'node:path';
+
+export const DEFAULT_CHATD_LOCK_PATH = join(homedir(), '.browserforce', 'chatd-lock.json');
+
+function resolveLockPath(lockPath) {
+  return lockPath || DEFAULT_CHATD_LOCK_PATH;
+}
+
+export async function isLockAlive({ lock } = {}) {
+  if (!lock || !Number.isInteger(lock.pid) || lock.pid <= 0) return false;
+  try {
+    process.kill(lock.pid, 0);
+    return true;
+  } catch {
+    return false;
+  }
+}
+
+export async function writeLock({ pid, port, token, lockPath } = {}) {
+  if (!Number.isInteger(pid) || pid <= 0) throw new Error('writeLock requires a positive integer pid');
+  if (!Number.isInteger(port) || port <= 0) throw new Error('writeLock requires a positive integer port');
+  if (!token || typeof token !== 'string') throw new Error('writeLock requires token');
+
+  const finalPath = resolveLockPath(lockPath);
+  await fs.mkdir(dirname(finalPath), { recursive: true });
+  const payload = { pid, port, token };
+  await fs.writeFile(finalPath, `${JSON.stringify(payload)}\n`, { mode: 0o600 });
+  return payload;
+}
+
+export async function readLock({ lockPath } = {}) {
+  const finalPath = resolveLockPath(lockPath);
+  let raw;
+  try {
+    raw = await fs.readFile(finalPath, 'utf8');
+  } catch (error) {
+    if (error && error.code === 'ENOENT') return null;
+    throw error;
+  }
+
+  let data;
+  try {
+    data = JSON.parse(raw);
+  } catch {
+    return null;
+  }
+
+  if (!Number.isInteger(data.pid) || !Number.isInteger(data.port) || typeof data.token !== 'string') {
+    return null;
+  }
+
+  if (!await isLockAlive({ lock: data })) {
+    return null;
+  }
+
+  return data;
+}
+
+export async function clearLock({ lockPath } = {}) {
+  const finalPath = resolveLockPath(lockPath);
+  try {
+    await fs.unlink(finalPath);
+  } catch (error) {
+    if (error && error.code !== 'ENOENT') throw error;
+  }
+}
diff --git a/agent/src/port-resolver.js b/agent/src/port-resolver.js
new file mode 100644
index 0000000..26f1fc8
--- /dev/null
+++ b/agent/src/port-resolver.js
@@ -0,0 +1,27 @@
+import net from 'node:net';
+
+function isIntegerPort(value) {
+  return Number.isInteger(value) && value > 0 && value <= 65535;
+}
+
+async function isPortFree(port) {
+  return new Promise((resolve) => {
+    const server = net.createServer();
+    server.once('error', () => resolve(false));
+    server.listen(port, '127.0.0.1', () => {
+      server.close(() => resolve(true));
+    });
+  });
+}
+
+export async function pickChatdPort({ envPort, rangeStart = 19280, rangeEnd = 19320 } = {}) {
+  if (isIntegerPort(envPort) && await isPortFree(envPort)) {
+    return envPort;
+  }
+
+  for (let port = rangeStart; port <= rangeEnd; port += 1) {
+    if (await isPortFree(port)) return port;
+  }
+
+  throw new Error(`No free port in range ${rangeStart}-${rangeEnd}`);
+}
diff --git a/bin.js b/bin.js
index b08da13..773980c 100644
--- a/bin.js
+++ b/bin.js
@@ -595,6 +595,114 @@ async function cmdSetup() {
   }
 }
 
+async function fetchChatdHealth(port, attempts = 20, delayMs = 100) {
+  for (let i = 0; i < attempts; i += 1) {
+    try {
+      const health = await httpGet(`http://127.0.0.1:${port}/health`);
+      if (health && health.ok) return health;
+    } catch {
+      // Keep polling until timeout.
+    }
+    await new Promise((resolve) => setTimeout(resolve, delayMs));
+  }
+  return null;
+}
+
+async function cmdAgent() {
+  const sub = positionals[1];
+  const { pickChatdPort } = await import('./agent/src/port-resolver.js');
+  const { writeLock, readLock, clearLock, isLockAlive } = await import('./agent/src/lockfile.js');
+  const { randomBytes } = await import('node:crypto');
+  const { spawn } = await import('node:child_process');
+  const { promises: fsp } = await import('node:fs');
+  const { homedir } = await import('node:os');
+  const { join } = await import('node:path');
+
+  const lockPath = process.env.BF_CHATD_LOCK_PATH || join(homedir(), '.browserforce', 'chatd-lock.json');
+  const chatdUrlPath = process.env.BF_CHATD_URL_PATH || join(homedir(), '.browserforce', 'chatd-url.json');
+
+  if (sub === 'start') {
+    const current = await readLock({ lockPath });
+    if (current && await isLockAlive({ lock: current })) {
+      output({ started: false, running: true, pid: current.pid, port: current.port }, values.json);
+      return;
+    }
+
+    const envPort = Number(process.env.BF_CHATD_PORT || 0);
+    const port = await pickChatdPort({ envPort });
+    const token = randomBytes(32).toString('base64url');
+
+    const child = spawn(
+      process.execPath,
+      [fileURLToPath(new URL('./agent/src/chatd.js', import.meta.url))],
+      {
+        detached: true,
+        stdio: 'ignore',
+        env: { ...process.env, BF_CHATD_PORT: String(port), BF_CHATD_TOKEN: token },
+      },
+    );
+    child.unref();
+
+    await writeLock({ pid: child.pid, port, token, lockPath });
+    const health = await fetchChatdHealth(port, 30, 100);
+    output({ started: true, pid: child.pid, port, ready: !!health }, values.json);
+    return;
+  }
+
+  if (sub === 'status') {
+    const lock = await readLock({ lockPath });
+    if (!lock) {
+      output({ running: false }, values.json);
+      return;
+    }
+
+    const alive = await isLockAlive({ lock });
+    if (!alive) {
+      await clearLock({ lockPath });
+      output({ running: false, stale: true }, values.json);
+      return;
+    }
+
+    const health = await fetchChatdHealth(lock.port, 10, 100);
+    output({
+      running: !!health,
+      pid: lock.pid,
+      port: lock.port,
+      health,
+    }, values.json);
+    return;
+  }
+
+  if (sub === 'stop') {
+    const clearChatdUrlFile = async () => {
+      try { await fsp.unlink(chatdUrlPath); } catch (error) { if (error?.code !== 'ENOENT') throw error; }
+    };
+    const lock = await readLock({ lockPath });
+    if (!lock) {
+      await clearLock({ lockPath });
+      await clearChatdUrlFile();
+      output({ stopped: true, running: false }, values.json);
+      return;
+    }
+
+    const alive = await isLockAlive({ lock });
+    if (alive) {
+      try {
+        process.kill(lock.pid, 'SIGTERM');
+      } catch {
+        // ignore kill race
+      }
+    }
+    await clearLock({ lockPath });
+    await clearChatdUrlFile();
+    output({ stopped: true, pid: lock.pid, port: lock.port }, values.json);
+    return;
+  }
+
+  console.error('Usage: browserforce agent start|status|stop');
+  process.exit(1);
+}
+
 function cmdHelp() {
   console.log(`
   BrowserForce — Give AI agents your real Chrome browser
@@ -610,6 +718,7 @@ function cmdHelp() {
     browserforce plugin list        List installed plugins
     browserforce plugin install <n> Install a plugin from the registry
     browserforce plugin remove <n>  Remove an installed plugin
+    browserforce agent <subcmd>     Start/status/stop local BrowserForce Agent daemon
     browserforce setup openclaw     Configure OpenClaw + optional autostart
     browserforce update             Update to the latest version
     browserforce install-extension  Copy extension to ~/.browserforce/extension/
@@ -645,7 +754,7 @@ const commands = {
   serve: cmdServe, mcp: cmdMcp, status: cmdStatus, tabs: cmdTabs,
   screenshot: cmdScreenshot, snapshot: cmdSnapshot, navigate: cmdNavigate,
   execute: cmdExecute, plugin: cmdPlugin, update: cmdUpdate,
-  'install-extension': cmdInstallExtension, setup: cmdSetup, help: cmdHelp,
+  'install-extension': cmdInstallExtension, setup: cmdSetup, agent: cmdAgent, help: cmdHelp,
 };
 
 const handler = commands[command];
diff --git a/test/agent/cli-agent.test.js b/test/agent/cli-agent.test.js
new file mode 100644
index 0000000..d686545
--- /dev/null
+++ b/test/agent/cli-agent.test.js
@@ -0,0 +1,45 @@
+import test from 'node:test';
+import assert from 'node:assert/strict';
+import { execFile } from 'node:child_process';
+import { promisify } from 'node:util';
+import { mkdtempSync, rmSync } from 'node:fs';
+import { join } from 'node:path';
+import { tmpdir } from 'node:os';
+
+const exec = promisify(execFile);
+
+function cli(args, env) {
+  return exec('node', ['bin.js', ...args], {
+    cwd: process.cwd(),
+    env: { ...process.env, ...env },
+  });
+}
+
+test('browserforce agent start allocates a non-conflicting port', async () => {
+  const home = mkdtempSync(join(tmpdir(), 'bf-agent-cli-home-'));
+  try {
+    const { stdout } = await cli(['agent', 'start', '--json'], { HOME: home });
+    const body = JSON.parse(stdout);
+    assert.equal(body.started, true);
+    assert.ok(Number.isInteger(body.port));
+    assert.ok(Number.isInteger(body.pid));
+
+    await cli(['agent', 'stop', '--json'], { HOME: home });
+  } finally {
+    rmSync(home, { recursive: true, force: true });
+  }
+});
+
+test('browserforce agent status prints chatd health', async () => {
+  const home = mkdtempSync(join(tmpdir(), 'bf-agent-cli-home-'));
+  try {
+    await cli(['agent', 'start', '--json'], { HOME: home });
+    const { stdout } = await cli(['agent', 'status', '--json'], { HOME: home });
+    const body = JSON.parse(stdout);
+    assert.equal(body.running, true);
+    assert.equal(body.health.ok, true);
+    await cli(['agent', 'stop', '--json'], { HOME: home });
+  } finally {
+    rmSync(home, { recursive: true, force: true });
+  }
+});

From 222a347de0c0dc3869dd460916b0245ebb6c8f09 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 12:24:02 +0530
Subject: [PATCH 114/192] extension: wire sidepanel entry and popup agent
 launcher UX

---
 extension/agent-panel.html              | 47 +++++++++++++++++++++++++
 extension/manifest.json                 |  6 +++-
 extension/popup.css                     | 13 +++++++
 extension/popup.html                    |  1 +
 extension/popup.js                      | 46 ++++++++++++++++++++++--
 test/agent/agent-panel-contract.test.js | 21 +++++++++++
 test/agent/extension-manifest.test.js   | 10 ++++++
 test/agent/popup-contract.test.js       |  9 +++++
 8 files changed, 149 insertions(+), 4 deletions(-)
 create mode 100644 extension/agent-panel.html
 create mode 100644 test/agent/agent-panel-contract.test.js
 create mode 100644 test/agent/extension-manifest.test.js
 create mode 100644 test/agent/popup-contract.test.js

diff --git a/extension/agent-panel.html b/extension/agent-panel.html
new file mode 100644
index 0000000..b53408e
--- /dev/null
+++ b/extension/agent-panel.html
@@ -0,0 +1,47 @@
+<!DOCTYPE html>
+<html lang="en">
+<head>
+  <meta charset="UTF-8">
+  <meta name="viewport" content="width=device-width, initial-scale=1.0">
+  <title>BrowserForce Agent</title>
+  <link rel="stylesheet" href="agent-panel.css">
+</head>
+<body>
+  <main class="agent-shell">
+    <header class="agent-header">
+      <div class="header-row">
+        <button id="bf-model-trigger" type="button" class="selector-pill" aria-expanded="false">Model</button>
+        <button id="bf-session-trigger" type="button" class="selector-pill" aria-expanded="false">Session</button>
+        <button id="bf-new-session" type="button" class="icon-btn" aria-label="New Session" title="New Session">+</button>
+      </div>
+      <p id="bf-agent-status" class="status">
+        <span id="bf-agent-status-icon" class="status-icon" aria-hidden="true">●</span>
+        <span id="bf-agent-status-text">Starting...</span>
+      </p>
+    </header>
+
+    <section class="chat">
+      <div id="bf-transcript" class="transcript"></div>
+      <form id="bf-chat-form" class="composer">
+        <textarea id="bf-chat-input" placeholder="Send a message to BrowserForce Agent"></textarea>
+        <div class="composer-actions">
+          <button id="bf-stop-run" type="button" class="secondary">Stop</button>
+          <button type="submit">Send</button>
+        </div>
+      </form>
+    </section>
+
+    <div id="bf-popover-backdrop" class="menu-backdrop hidden"></div>
+
+    <section id="bf-model-panel" class="popover-panel hidden">
+      <ul id="bf-model-list" class="popover-list"></ul>
+    </section>
+
+    <section id="bf-session-panel" class="popover-panel hidden">
+      <ul id="bf-switch-session-list" class="popover-list"></ul>
+    </section>
+  </main>
+
+  <script type="module" src="agent-panel.js"></script>
+</body>
+</html>
diff --git a/extension/manifest.json b/extension/manifest.json
index 6240fc6..2730d36 100644
--- a/extension/manifest.json
+++ b/extension/manifest.json
@@ -8,7 +8,8 @@
     "tabs",
     "tabGroups",
     "storage",
-    "alarms"
+    "alarms",
+    "sidePanel"
   ],
   "host_permissions": [
     "http://127.0.0.1/*",
@@ -30,5 +31,8 @@
       "16": "icons/icon16.png",
       "48": "icons/icon48.png"
     }
+  },
+  "side_panel": {
+    "default_path": "agent-panel.html"
   }
 }
diff --git a/extension/popup.css b/extension/popup.css
index d2e3b28..f26a11b 100644
--- a/extension/popup.css
+++ b/extension/popup.css
@@ -257,6 +257,19 @@ button:active { background: #388e3c; }
 .attach-btn:hover { background: #eee; border-color: #aaa; }
 .attach-btn:active { background: #e0e0e0; }
 
+.agent-btn {
+  width: 100%;
+  margin-top: 8px;
+  margin-bottom: 8px;
+  padding: 10px;
+  background: #2f7bf6;
+  color: #fff;
+  border-radius: 6px;
+}
+
+.agent-btn:hover { background: #1d66d9; }
+.agent-btn:active { background: #1757b8; }
+
 .logs-btn {
   width: 100%;
   margin-top: 8px;
diff --git a/extension/popup.html b/extension/popup.html
index 3deedb5..703efd6 100644
--- a/extension/popup.html
+++ b/extension/popup.html
@@ -41,6 +41,7 @@ <h1>BrowserForce</h1>
       </section>
 
       <button id="bf-attach-tab" class="attach-btn">+ Attach Current Tab</button>
+      <button id="bf-open-agent" class="agent-btn">Open BrowserForce Agent</button>
 
       <button id="bf-open-logs" class="logs-btn">View Full Logs</button>
     </div>
diff --git a/extension/popup.js b/extension/popup.js
index 8ba9ff1..76d9905 100644
--- a/extension/popup.js
+++ b/extension/popup.js
@@ -21,6 +21,7 @@ const tabCountEl = document.getElementById('bf-tab-count');
 const tabsListEl = document.getElementById('bf-tabs-list');
 const autoTimerEl = document.getElementById('bf-auto-timer');
 const attachBtn = document.getElementById('bf-attach-tab');
+const openAgentBtn = document.getElementById('bf-open-agent');
 const openLogsBtn = document.getElementById('bf-open-logs');
 const modeSelect = document.getElementById('bf-mode');
 const executionModeSelect = document.getElementById('bf-execution-mode');
@@ -67,12 +68,41 @@ chrome.storage.local.get(SETTINGS_KEYS, (s) => {
 
 // --- Save Handlers ---
 
+function setSaveUrlFeedback(label, disabled) {
+  saveUrlBtn.textContent = label;
+  saveUrlBtn.disabled = !!disabled;
+}
+
 saveUrlBtn.addEventListener('click', () => {
   const url = relayUrlInput.value.trim();
   if (!url) return;
-  chrome.storage.local.set({ relayUrl: url }, () => {
-    saveUrlBtn.textContent = 'Saved';
-    setTimeout(() => { saveUrlBtn.textContent = 'Save'; }, 1200);
+  setSaveUrlFeedback('Connecting...', true);
+  setStatus('connecting', 'connecting');
+
+  chrome.runtime.sendMessage({ type: 'updateRelayUrl', relayUrl: url }, (response) => {
+    if (chrome.runtime.lastError || !response) {
+      setSaveUrlFeedback('Connection failed', false);
+      setStatus('disconnected', 'connection failed');
+      setTimeout(() => setSaveUrlFeedback('Save', false), 1800);
+      return;
+    }
+
+    if (response.error) {
+      setSaveUrlFeedback('Connection failed', false);
+      setStatus('disconnected', response.error);
+      setTimeout(() => {
+        setSaveUrlFeedback('Save', false);
+        refreshStatus();
+      }, 1800);
+      return;
+    }
+
+    setSaveUrlFeedback('Connected', false);
+    setStatus(response.connectionState || 'connected', response.connectionState || 'connected');
+    setTimeout(() => {
+      setSaveUrlFeedback('Save', false);
+      refreshStatus();
+    }, 1200);
   });
 });
 
@@ -165,6 +195,16 @@ attachBtn.addEventListener('click', () => {
   });
 });
 
+openAgentBtn.addEventListener('click', async () => {
+  try {
+    const [tab] = await chrome.tabs.query({ active: true, currentWindow: true });
+    await chrome.sidePanel.open({ windowId: tab?.windowId });
+  } catch {
+    openAgentBtn.textContent = 'Failed to open';
+    setTimeout(() => { openAgentBtn.textContent = 'Open BrowserForce Agent'; }, 1500);
+  }
+});
+
 openLogsBtn.addEventListener('click', () => {
   chrome.runtime.openOptionsPage();
 });
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
new file mode 100644
index 0000000..3739bc3
--- /dev/null
+++ b/test/agent/agent-panel-contract.test.js
@@ -0,0 +1,21 @@
+import fs from 'node:fs';
+import test from 'node:test';
+import assert from 'node:assert/strict';
+
+const html = fs.readFileSync('extension/agent-panel.html', 'utf8');
+
+test('agent panel has inline model and session selectors with popovers', () => {
+  assert.match(html, /id="bf-model-trigger"/);
+  assert.match(html, /id="bf-session-trigger"/);
+  assert.match(html, /id="bf-new-session"/);
+  assert.match(html, /aria-label="New Session"/);
+  assert.match(html, /id="bf-model-panel"/);
+  assert.match(html, /id="bf-session-panel"/);
+  assert.match(html, /id="bf-model-list"/);
+  assert.match(html, /id="bf-switch-session-list"/);
+});
+
+test('agent panel no longer renders title or persistent session sidebar', () => {
+  assert.doesNotMatch(html, /<h1/);
+  assert.doesNotMatch(html, /<aside class="sessions">/);
+});
diff --git a/test/agent/extension-manifest.test.js b/test/agent/extension-manifest.test.js
new file mode 100644
index 0000000..9061d2c
--- /dev/null
+++ b/test/agent/extension-manifest.test.js
@@ -0,0 +1,10 @@
+import fs from 'node:fs';
+import test from 'node:test';
+import assert from 'node:assert/strict';
+
+const manifest = JSON.parse(fs.readFileSync('extension/manifest.json', 'utf8'));
+
+test('manifest includes sidePanel permission and default_path', () => {
+  assert.ok(manifest.permissions.includes('sidePanel'));
+  assert.equal(manifest.side_panel.default_path, 'agent-panel.html');
+});
diff --git a/test/agent/popup-contract.test.js b/test/agent/popup-contract.test.js
new file mode 100644
index 0000000..3995a17
--- /dev/null
+++ b/test/agent/popup-contract.test.js
@@ -0,0 +1,9 @@
+import fs from 'node:fs';
+import test from 'node:test';
+import assert from 'node:assert/strict';
+
+const html = fs.readFileSync('extension/popup.html', 'utf8');
+
+test('popup includes Open BrowserForce Agent button', () => {
+  assert.match(html, /Open BrowserForce Agent/);
+});

From dac4c3eea09820ac3fd3020e796f2501e82a314a Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 12:24:30 +0530
Subject: [PATCH 115/192] test(sidepanel): cover session selection and message
 hydration reducers

---
 test/agent/session-ui-state.test.js | 36 +++++++++++++++++++++++++++++
 1 file changed, 36 insertions(+)
 create mode 100644 test/agent/session-ui-state.test.js

diff --git a/test/agent/session-ui-state.test.js b/test/agent/session-ui-state.test.js
new file mode 100644
index 0000000..d0ebd3f
--- /dev/null
+++ b/test/agent/session-ui-state.test.js
@@ -0,0 +1,36 @@
+import test from 'node:test';
+import assert from 'node:assert/strict';
+import { reduceState } from '../../extension/agent-panel-state.js';
+
+test('selectSession replaces active transcript with selected session messages', () => {
+  const state = {
+    activeSessionId: 's1',
+    sessions: [],
+    runs: {},
+    messagesBySession: {
+      s1: [{ role: 'user', text: 'one' }],
+      s2: [{ role: 'assistant', text: 'two' }],
+    },
+  };
+
+  const next = reduceState(state, { type: 'session.selected', sessionId: 's2' });
+  assert.equal(next.activeSessionId, 's2');
+  assert.equal(next.messagesBySession.s2[0].text, 'two');
+});
+
+test('messages.loaded hydrates transcript for the selected session', () => {
+  const state = {
+    activeSessionId: 's1',
+    sessions: [],
+    runs: {},
+    messagesBySession: {},
+  };
+
+  const next = reduceState(state, {
+    type: 'messages.loaded',
+    sessionId: 's1',
+    messages: [{ role: 'assistant', text: 'hello' }],
+  });
+
+  assert.equal(next.messagesBySession.s1[0].text, 'hello');
+});

From 0bad0450a06cb5c866d9b3c35fb43f4cc9611e84 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 12:24:46 +0530
Subject: [PATCH 116/192] chore: include agent package artifacts and document
 sidepanel daemon workflow

---
 README.md    | 27 +++++++++++++++++++++++++++
 package.json |  5 +++--
 2 files changed, 30 insertions(+), 2 deletions(-)

diff --git a/README.md b/README.md
index b870760..1d9890f 100644
--- a/README.md
+++ b/README.md
@@ -374,6 +374,9 @@ browserforce -e "<code>"        # Run Playwright JavaScript (one-shot)
 browserforce plugin list        # List installed plugins
 browserforce plugin install <n> # Install a plugin from the registry
 browserforce plugin remove <n>  # Remove an installed plugin
+browserforce agent start        # Start local BrowserForce Agent daemon (chatd)
+browserforce agent status       # Show daemon PID/port + /health
+browserforce agent stop         # Stop daemon and clear lockfile
 browserforce setup openclaw [--dry-run] [--json] [--no-autostart] # Configure OpenClaw + optional autostart
 browserforce update             # Update to the latest version
 browserforce install-extension  # Copy extension to ~/.browserforce/extension/
@@ -383,6 +386,30 @@ Setup flags: `--dry-run` (preview), `--no-autostart` (skip OS login daemon/servi
 
 Each `-e` command is one-shot — state does not persist between calls. For persistent state, use the MCP server.
 
+### BrowserForce Agent Side Panel
+
+BrowserForce now includes a side-panel chat UI in the Chrome extension for resumable local sessions.
+
+- Open popup -> `Open BrowserForce Agent` to open the side panel.
+- Use the session list to switch between chats; transcripts hydrate per selected `sessionId`.
+- Session identity is explicit and persisted; there is no fixed/hardcoded chat session ID.
+- Streaming uses `fetch` + `ReadableStream` for SSE, not `EventSource`, so the panel can send `Authorization: Bearer ...` headers.
+
+Daemon lifecycle:
+
+```bash
+browserforce agent start
+browserforce agent status
+browserforce agent stop
+```
+
+Port/auth bootstrap:
+
+- `agent start` picks a loopback port. If `BF_CHATD_PORT` is set and free, it is used.
+- If that port is unavailable, BrowserForce falls back to the first free port in `19280-19320`.
+- The daemon writes `~/.browserforce/chatd-url.json` (`{ port, token }`, mode `0600`).
+- Side-panel JS reads relay URL from extension storage, calls relay `GET /chatd-url` (extension-origin gated), then connects directly to chatd with Bearer auth.
+
 
 ## Deep Dive Sections
 
diff --git a/package.json b/package.json
index 1af8494..1efaefd 100644
--- a/package.json
+++ b/package.json
@@ -35,7 +35,8 @@
     "relay/package.json",
     "mcp/src/",
     "mcp/package.json",
-    "skills/"
+    "skills/",
+    "agent/"
   ],
   "dependencies": {
     "@modelcontextprotocol/sdk": "^1.12.1",
@@ -49,7 +50,7 @@
     "relay:dev": "lsof -ti tcp:19222 | xargs kill -9 2>/dev/null; sleep 0.3; node --watch relay/src/index.js",
     "mcp": "node mcp/src/index.js",
     "postinstall": "node scripts/postinstall-openclaw.mjs",
-    "test": "node --test relay/test/relay-server.test.js && node --test mcp/test/mcp-tools.test.js && node --test mcp/test/plugin-loader.test.js && node --test mcp/test/plugin-installer.test.js && node --test mcp/test/exec-engine-plugins.test.js && node --test mcp/test/mcp-plugin-integration.test.js && node --test test/cli.test.js && node --test test/postinstall.test.js",
+    "test": "node --test relay/test/relay-server.test.js && node --test mcp/test/mcp-tools.test.js && node --test mcp/test/plugin-loader.test.js && node --test mcp/test/plugin-installer.test.js && node --test mcp/test/exec-engine-plugins.test.js && node --test mcp/test/mcp-plugin-integration.test.js && node --test test/agent/port-resolver.test.js && node --test test/agent/session-store.test.js && node --test test/agent/codex-runner.test.js && node --test test/agent/chatd-api.test.js && node --test test/agent/extension-manifest.test.js && node --test test/agent/popup-contract.test.js && node --test test/agent/relay-url-reconnect-contract.test.js && node --test test/agent/agent-panel-contract.test.js && node --test test/agent/agent-panel-send-contract.test.js && node --test test/agent/session-ui-state.test.js && node --test test/agent/sse-events.test.js && node --test test/agent/auth.test.js && node --test test/agent/agent-panel-runtime.test.js && node --test test/agent/cli-agent.test.js && node --test test/cli.test.js && node --test test/postinstall.test.js",
     "test:relay": "node --test relay/test/relay-server.test.js",
     "test:mcp": "node --test mcp/test/mcp-tools.test.js && node --test mcp/test/plugin-loader.test.js && node --test mcp/test/plugin-installer.test.js && node --test mcp/test/exec-engine-plugins.test.js && node --test mcp/test/mcp-plugin-integration.test.js"
   }

From 33aa805b6edc786b3078dd3f9f5c1b344494e450 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 13:02:43 +0530
Subject: [PATCH 117/192] fix(sidepanel): avoid premature finalization and
 submit on Enter

---
 agent/src/codex-runner.js                    | 4 ++--
 extension/agent-panel.js                     | 7 +++++++
 test/agent/agent-panel-send-contract.test.js | 7 +++++++
 test/agent/codex-runner.test.js              | 6 +++---
 4 files changed, 19 insertions(+), 5 deletions(-)

diff --git a/agent/src/codex-runner.js b/agent/src/codex-runner.js
index 96d6088..073f550 100644
--- a/agent/src/codex-runner.js
+++ b/agent/src/codex-runner.js
@@ -44,10 +44,10 @@ export function normalizeCodexLine({ runId, sessionId, line }) {
     const itemType = parsed.item?.type || '';
     if (itemType === 'agent_message') {
       return envelope({
-        event: 'chat.final',
+        event: 'chat.delta',
         runId,
         sessionId,
-        payload: { text: String(parsed.item?.text || '') },
+        payload: { delta: String(parsed.item?.text || '') },
       });
     }
     if (itemType === 'reasoning') {
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index b6c6d34..c48fe65 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -567,6 +567,13 @@ chatFormEl.addEventListener('submit', async (event) => {
   }
 });
 
+chatInputEl.addEventListener('keydown', (event) => {
+  if (event.key !== 'Enter' || event.shiftKey) return;
+  if (event.isComposing) return;
+  event.preventDefault();
+  chatFormEl.requestSubmit();
+});
+
 newSessionBtn.addEventListener('click', () => {
   createSession()
     .then(() => setPopover('none'))
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 48b172f..faf3ddd 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -23,3 +23,10 @@ test('sidepanel auto-attaches current tab and sends browserContext with runs', (
   assert.match(js, /const browserContext = await getActiveTabContext\(\);/);
   assert.match(js, /JSON\.stringify\(\{\s*sessionId,\s*message:\s*text,\s*browserContext\s*\}\)/);
 });
+
+test('enter key submits composer and shift+enter keeps newline', () => {
+  assert.match(js, /chatInputEl\.addEventListener\('keydown'/);
+  assert.match(js, /if\s*\(\s*event\.key\s*!==\s*'Enter'\s*\|\|\s*event\.shiftKey\s*\)\s*return;/);
+  assert.match(js, /event\.preventDefault\(\);/);
+  assert.match(js, /chatFormEl\.requestSubmit\(\);/);
+});
diff --git a/test/agent/codex-runner.test.js b/test/agent/codex-runner.test.js
index d0ce6ad..6319aa9 100644
--- a/test/agent/codex-runner.test.js
+++ b/test/agent/codex-runner.test.js
@@ -24,14 +24,14 @@ test('maps final line to chat.final event', () => {
   assert.equal(evt.payload.text, 'done');
 });
 
-test('maps codex item.completed agent_message to chat.final', () => {
+test('maps codex item.completed agent_message to chat.delta (not premature final)', () => {
   const evt = normalizeCodexLine({
     runId: 'r1',
     sessionId: 's1',
     line: '{"type":"item.completed","item":{"type":"agent_message","text":"hello"}}',
   });
-  assert.equal(evt.event, 'chat.final');
-  assert.equal(evt.payload.text, 'hello');
+  assert.equal(evt.event, 'chat.delta');
+  assert.equal(evt.payload.delta, 'hello');
 });
 
 test('buildCodexExecArgs includes --model when session model is set', () => {

From 53370b64d29586e3c31202171b164c46d9664e0c Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 13:04:17 +0530
Subject: [PATCH 118/192] style(extension): apply warm neutral palette across
 panel and popup

---
 extension/agent-panel.css |  98 ++++++++++++++++---------
 extension/options.css     |  80 +++++++++++++-------
 extension/popup.css       | 149 +++++++++++++++++++++++---------------
 3 files changed, 207 insertions(+), 120 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 095ccb9..26f23e2 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -1,14 +1,28 @@
 :root {
   color-scheme: light;
-  --panel-bg: linear-gradient(180deg, #f8f9fc 0%, #eef2f8 100%);
-  --card-bg: #ffffff;
-  --line: #d7ddea;
-  --text: #1f2430;
-  --muted: #5f6878;
-  --accent: #2f7bf6;
-  --menu-bg: rgba(24, 28, 36, 0.94);
-  --menu-line: rgba(255, 255, 255, 0.12);
-  --menu-text: #f4f6fb;
+  --bf-crail: #C15F3C;
+  --bf-cloudy: #B1ADA1;
+  --bf-pampas: #F4F3EE;
+  --bf-white: #FFFFFF;
+
+  --panel-bg: linear-gradient(180deg, var(--bf-pampas) 0%, #ECE6DB 100%);
+  --card-bg: var(--bf-white);
+  --line: #D8D3C9;
+  --line-soft: #E9E4DA;
+  --text: #3D3028;
+  --muted: #756F63;
+  --text-subtle: #8C857A;
+  --accent: var(--bf-crail);
+  --accent-hover: #B05535;
+  --accent-press: #9F4D30;
+  --accent-soft: #E9D3CB;
+  --accent-soft-text: #7A3D27;
+  --status-ok: var(--bf-crail);
+  --status-error: #8A3D24;
+  --status-error-strong: #B25334;
+  --menu-bg: rgba(51, 41, 34, 0.94);
+  --menu-line: rgba(255, 255, 255, 0.16);
+  --menu-text: var(--bf-pampas);
 }
 
 * {
@@ -33,7 +47,7 @@ body {
 .agent-header {
   padding: 10px 12px 8px;
   border-bottom: 1px solid var(--line);
-  background: linear-gradient(180deg, #fff, #f9fbff);
+  background: linear-gradient(180deg, var(--bf-white), var(--bf-pampas));
 }
 
 .header-row {
@@ -47,8 +61,8 @@ body {
   min-height: 30px;
   border: 1px solid var(--line);
   border-radius: 999px;
-  background: #fff;
-  color: #2b3242;
+  background: var(--card-bg);
+  color: var(--text);
   font-size: 12px;
   text-align: left;
   padding: 0 10px;
@@ -62,8 +76,8 @@ body {
   min-height: 30px;
   border-radius: 8px;
   border: 1px solid var(--line);
-  background: #fff;
-  color: #2d3342;
+  background: var(--card-bg);
+  color: var(--text);
   font-size: 20px;
   line-height: 1;
   padding: 0;
@@ -80,15 +94,15 @@ body {
 
 .status-icon {
   font-size: 10px;
-  color: #22a06b;
+  color: var(--status-ok);
 }
 
 .status.error {
-  color: #8f2836;
+  color: var(--status-error);
 }
 
 .status.error .status-icon {
-  color: #d14357;
+  color: var(--status-error-strong);
 }
 
 .chat {
@@ -124,19 +138,19 @@ body {
 }
 
 .message.user {
-  background: #10131a;
-  color: #f3f6fc;
-  border: 1px solid #0c1018;
+  background: var(--accent-press);
+  color: var(--bf-white);
+  border: 1px solid var(--accent-hover);
   border-radius: 16px;
   padding: 10px 14px;
   max-width: min(85%, 520px);
-  box-shadow: 0 4px 12px rgba(0, 0, 0, 0.18);
+  box-shadow: 0 4px 12px rgba(97, 53, 37, 0.28);
 }
 
 .message.assistant {
   background: transparent;
   border: 0;
-  color: #1f2430;
+  color: var(--text);
   padding: 0;
   max-width: 100%;
 }
@@ -151,20 +165,20 @@ body {
 .run-steps-trigger {
   all: unset;
   cursor: pointer;
-  color: #646f84;
+  color: var(--text-subtle);
   font-size: 13px;
   font-weight: 500;
 }
 
 .run-steps-trigger:hover {
-  color: #2f7bf6;
+  color: var(--accent);
 }
 
 .run-steps-list {
   list-style: none;
   margin: 0;
   padding: 0 0 0 10px;
-  border-left: 1px solid #d8dfed;
+  border-left: 1px solid var(--line-soft);
   display: grid;
   gap: 8px;
 }
@@ -173,7 +187,7 @@ body {
   display: flex;
   align-items: flex-start;
   gap: 8px;
-  color: #4e586d;
+  color: var(--muted);
 }
 
 .run-step-label {
@@ -185,7 +199,7 @@ body {
   height: 16px;
   margin-top: 2px;
   flex: 0 0 16px;
-  color: #97a2b8;
+  color: var(--text-subtle);
   position: relative;
 }
 
@@ -307,11 +321,11 @@ body {
 }
 
 .run-step.done .run-step-icon {
-  color: #1f9d63;
+  color: var(--status-ok);
 }
 
 .run-step.failed .run-step-icon {
-  color: #d14357;
+  color: var(--status-error-strong);
 }
 
 .composer {
@@ -319,7 +333,7 @@ body {
   padding: 10px;
   display: grid;
   gap: 8px;
-  background: #fff;
+  background: var(--card-bg);
 }
 
 .composer textarea {
@@ -331,6 +345,8 @@ body {
   border: 1px solid var(--line);
   border-radius: 8px;
   font: inherit;
+  color: var(--text);
+  background: var(--card-bg);
 }
 
 .composer-actions {
@@ -343,20 +359,32 @@ button {
   border: 0;
   border-radius: 8px;
   background: var(--accent);
-  color: #fff;
+  color: var(--bf-white);
   padding: 8px 12px;
   cursor: pointer;
 }
 
+button:hover {
+  background: var(--accent-hover);
+}
+
+button:active {
+  background: var(--accent-press);
+}
+
 button.secondary {
-  background: #e7edf8;
-  color: #1c3f7e;
+  background: var(--accent-soft);
+  color: var(--accent-soft-text);
+}
+
+button.secondary:hover {
+  background: #E2C8BE;
 }
 
 .menu-backdrop {
   position: absolute;
   inset: 0;
-  background: rgba(0, 0, 0, 0.15);
+  background: rgba(0, 0, 0, 0.14);
   z-index: 20;
 }
 
@@ -368,7 +396,7 @@ button.secondary {
   border-radius: 14px;
   background: var(--menu-bg);
   border: 1px solid var(--menu-line);
-  box-shadow: 0 18px 36px rgba(0, 0, 0, 0.28);
+  box-shadow: 0 18px 36px rgba(66, 49, 39, 0.32);
   backdrop-filter: blur(14px);
   z-index: 21;
   max-height: min(360px, calc(100vh - 70px));
diff --git a/extension/options.css b/extension/options.css
index b0a3721..1adfc46 100644
--- a/extension/options.css
+++ b/extension/options.css
@@ -1,3 +1,27 @@
+:root {
+  --bf-crail: #C15F3C;
+  --bf-cloudy: #B1ADA1;
+  --bf-pampas: #F4F3EE;
+  --bf-white: #FFFFFF;
+
+  --bf-page-bg: var(--bf-pampas);
+  --bf-surface: var(--bf-white);
+  --bf-text: #3D3028;
+  --bf-text-muted: #756F63;
+  --bf-text-subtle: #8C857A;
+  --bf-border: #D8D3C9;
+  --bf-border-soft: #E9E4DA;
+  --bf-accent: var(--bf-crail);
+  --bf-accent-hover: #B05535;
+  --bf-ghost-bg-hover: #ECE6DB;
+  --bf-row-hover: #EEE8DD;
+  --bf-row-active: #E6DFD3;
+  --bf-table-head-bg: #F0EBE1;
+  --bf-error-bg: #F4E5E0;
+  --bf-error-border: #E3B9AA;
+  --bf-error-text: #8A3D24;
+}
+
 * {
   box-sizing: border-box;
 }
@@ -5,8 +29,8 @@
 body {
   margin: 0;
   font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, sans-serif;
-  background: #f5f7fb;
-  color: #1c2333;
+  background: var(--bf-page-bg);
+  color: var(--bf-text);
 }
 
 .layout {
@@ -33,13 +57,13 @@ h2 {
   font-size: 14px;
   text-transform: uppercase;
   letter-spacing: 0.04em;
-  color: #4c5770;
+  color: var(--bf-text-muted);
 }
 
 .subtitle {
   margin: 6px 0 0;
   font-size: 13px;
-  color: #5f6d8a;
+  color: var(--bf-text-subtle);
 }
 
 .controls {
@@ -48,10 +72,10 @@ h2 {
 }
 
 button {
-  border: 1px solid #2f6dff;
+  border: 1px solid var(--bf-accent);
   border-radius: 8px;
-  background: #2f6dff;
-  color: #fff;
+  background: var(--bf-accent);
+  color: var(--bf-white);
   height: 36px;
   padding: 0 14px;
   font-size: 13px;
@@ -59,17 +83,17 @@ button {
 }
 
 button:hover {
-  background: #1f5cff;
+  background: var(--bf-accent-hover);
 }
 
 button.ghost {
-  border-color: #c5ccda;
-  background: #fff;
-  color: #27314a;
+  border-color: var(--bf-border);
+  background: var(--bf-surface);
+  color: var(--bf-text);
 }
 
 button.ghost:hover {
-  background: #f2f4f9;
+  background: var(--bf-ghost-bg-hover);
 }
 
 .cards {
@@ -80,10 +104,10 @@ button.ghost:hover {
 }
 
 .card {
-  border: 1px solid #d7ddea;
+  border: 1px solid var(--bf-border);
   border-radius: 10px;
   padding: 12px;
-  background: #fff;
+  background: var(--bf-surface);
   min-height: 90px;
 }
 
@@ -108,18 +132,18 @@ button.ghost:hover {
   display: flex;
   gap: 6px;
   padding: 10px 12px;
-  border: 1px solid #d7ddea;
+  border: 1px solid var(--bf-border);
   border-radius: 10px;
-  background: #fff;
+  background: var(--bf-surface);
   margin-bottom: 12px;
   font-size: 13px;
 }
 
 .error {
   margin: 0 0 12px;
-  border: 1px solid #f2bcc1;
-  background: #fff5f6;
-  color: #8a1d2f;
+  border: 1px solid var(--bf-error-border);
+  background: var(--bf-error-bg);
+  color: var(--bf-error-text);
   padding: 10px 12px;
   border-radius: 8px;
   font-size: 13px;
@@ -127,9 +151,9 @@ button.ghost:hover {
 
 .logs-panel,
 .details-panel {
-  border: 1px solid #d7ddea;
+  border: 1px solid var(--bf-border);
   border-radius: 10px;
-  background: #fff;
+  background: var(--bf-surface);
   margin-bottom: 12px;
 }
 
@@ -138,7 +162,7 @@ button.ghost:hover {
   align-items: center;
   justify-content: space-between;
   padding: 12px;
-  border-bottom: 1px solid #e6ebf5;
+  border-bottom: 1px solid var(--bf-border-soft);
 }
 
 .logs-header h2 {
@@ -160,16 +184,16 @@ td {
   font-size: 12px;
   text-align: left;
   padding: 8px 10px;
-  border-bottom: 1px solid #edf1f8;
+  border-bottom: 1px solid var(--bf-border-soft);
   vertical-align: top;
 }
 
 th {
   position: sticky;
   top: 0;
-  background: #f8faff;
+  background: var(--bf-table-head-bg);
   z-index: 1;
-  color: #4c5770;
+  color: var(--bf-text-muted);
   font-weight: 600;
 }
 
@@ -178,15 +202,15 @@ tr.clickable {
 }
 
 tr.clickable:hover {
-  background: #f7f9ff;
+  background: var(--bf-row-hover);
 }
 
 tr.active {
-  background: #eef3ff;
+  background: var(--bf-row-active);
 }
 
 .empty {
-  color: #74809b;
+  color: var(--bf-text-subtle);
   text-align: center;
 }
 
diff --git a/extension/popup.css b/extension/popup.css
index f26a11b..62bcc57 100644
--- a/extension/popup.css
+++ b/extension/popup.css
@@ -1,3 +1,32 @@
+:root {
+  --bf-crail: #C15F3C;
+  --bf-cloudy: #B1ADA1;
+  --bf-pampas: #F4F3EE;
+  --bf-white: #FFFFFF;
+
+  --bf-bg: var(--bf-white);
+  --bf-surface: var(--bf-white);
+  --bf-surface-soft: var(--bf-pampas);
+  --bf-text: #3D3028;
+  --bf-text-muted: #756F63;
+  --bf-text-subtle: #8C857A;
+  --bf-border: #D8D3C9;
+  --bf-border-soft: #E9E4DA;
+  --bf-border-strong: #C7C0B3;
+  --bf-accent: var(--bf-crail);
+  --bf-accent-hover: #B05535;
+  --bf-accent-press: #9F4D30;
+  --bf-status-connected: var(--bf-crail);
+  --bf-status-connecting: var(--bf-cloudy);
+  --bf-status-disconnected: #CFCBBF;
+  --bf-danger-bg: #F4E5E0;
+  --bf-danger-fg: #8A3D24;
+  --bf-danger-bg-press: #EED7D0;
+  --bf-surface-soft-hover: #EDE7DC;
+  --bf-surface-soft-press: #E4DED3;
+  --bf-surface-press: #ECE6DB;
+}
+
 * {
   margin: 0;
   padding: 0;
@@ -7,8 +36,8 @@
 body {
   font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, sans-serif;
   font-size: 13px;
-  color: #1a1a1a;
-  background: #fff;
+  color: var(--bf-text);
+  background: var(--bf-bg);
 }
 
 .bf-popup {
@@ -17,7 +46,7 @@ body {
 }
 
 .bf-popup.auto-mode {
-  border: 2px dotted #d32f2f;
+  border: 2px dotted var(--bf-accent);
   border-radius: 10px;
 }
 
@@ -51,20 +80,21 @@ h1 {
   width: 8px;
   height: 8px;
   border-radius: 50%;
-  background: #9e9e9e;
+  background: var(--bf-status-disconnected);
 }
 
-.status.connected .dot { background: #4caf50; }
-.status.connecting .dot { background: #ff9800; animation: pulse 1s infinite; }
-.status.disconnected .dot { background: #9e9e9e; }
+.status.connected .dot { background: var(--bf-status-connected); }
+.status.connecting .dot { background: var(--bf-status-connecting); animation: pulse 1s infinite; }
+.status.disconnected .dot { background: var(--bf-status-disconnected); }
 
 .mcp-count {
   font-size: 11px;
   font-weight: 600;
-  color: #4a4a4a;
-  background: #f1f1f1;
+  color: var(--bf-text);
+  background: var(--bf-surface-soft);
   border-radius: 10px;
   padding: 2px 8px;
+  border: 1px solid var(--bf-border-soft);
 }
 
 @keyframes pulse {
@@ -75,7 +105,7 @@ h1 {
 /* Tab Navigation */
 .tab-nav {
   display: flex;
-  border-bottom: 2px solid #eee;
+  border-bottom: 2px solid var(--bf-border-soft);
   margin-bottom: 14px;
   gap: 0;
 }
@@ -87,16 +117,16 @@ h1 {
   background: none;
   font-size: 12px;
   font-weight: 600;
-  color: #999;
+  color: var(--bf-text-subtle);
   cursor: pointer;
   border-bottom: 2px solid transparent;
   margin-bottom: -2px;
   border-radius: 0;
 }
 
-.tab-btn:hover { color: #666; background: none; }
+.tab-btn:hover { color: var(--bf-text-muted); background: none; }
 .tab-btn:active { background: none; }
-.tab-btn.active { color: #4caf50; border-bottom-color: #4caf50; }
+.tab-btn.active { color: var(--bf-accent); border-bottom-color: var(--bf-accent); }
 
 .tab-panel { display: none; }
 .tab-panel.active { display: block; }
@@ -114,16 +144,17 @@ h1 {
   font-weight: 600;
   text-transform: uppercase;
   letter-spacing: 0.04em;
-  color: #666;
+  color: var(--bf-text-muted);
   margin-bottom: 6px;
 }
 
 .badge {
-  background: #e0e0e0;
-  color: #555;
+  background: var(--bf-surface-soft);
+  color: var(--bf-text-muted);
   font-size: 10px;
   padding: 1px 6px;
   border-radius: 10px;
+  border: 1px solid var(--bf-border-soft);
 }
 
 .input-row {
@@ -134,48 +165,50 @@ h1 {
 input[type="text"] {
   flex: 1;
   padding: 6px 10px;
-  border: 1px solid #ddd;
+  border: 1px solid var(--bf-border);
   border-radius: 6px;
   font-size: 12px;
   outline: none;
+  color: var(--bf-text);
+  background: var(--bf-surface);
 }
 
 input[type="text"]:focus {
-  border-color: #4caf50;
+  border-color: var(--bf-accent);
 }
 
 button {
   padding: 6px 14px;
   border: none;
   border-radius: 6px;
-  background: #4caf50;
-  color: #fff;
+  background: var(--bf-accent);
+  color: var(--bf-white);
   font-size: 12px;
   font-weight: 500;
   cursor: pointer;
 }
 
-button:hover { background: #43a047; }
-button:active { background: #388e3c; }
+button:hover { background: var(--bf-accent-hover); }
+button:active { background: var(--bf-accent-press); }
 
 /* Tabs list */
 .tabs-list {
   max-height: 200px;
   overflow-y: auto;
-  border: 1px solid #eee;
+  border: 1px solid var(--bf-border-soft);
   border-radius: 6px;
 }
 
 .tabs-list .empty {
   padding: 12px;
   text-align: center;
-  color: #999;
+  color: var(--bf-text-subtle);
   font-size: 12px;
 }
 
 .tab-item {
   padding: 8px 10px;
-  border-bottom: 1px solid #f5f5f5;
+  border-bottom: 1px solid var(--bf-border-soft);
   overflow: hidden;
 }
 
@@ -204,7 +237,7 @@ button:active { background: #388e3c; }
   border: none;
   border-radius: 4px;
   background: transparent;
-  color: #999;
+  color: var(--bf-text-subtle);
   font-size: 14px;
   line-height: 20px;
   text-align: center;
@@ -212,17 +245,17 @@ button:active { background: #388e3c; }
 }
 
 .detach-btn:hover {
-  background: #fee;
-  color: #d32f2f;
+  background: var(--bf-danger-bg);
+  color: var(--bf-danger-fg);
 }
 
 .detach-btn:active {
-  background: #fdd;
+  background: var(--bf-danger-bg-press);
 }
 
 .tab-item .tab-url {
   font-size: 11px;
-  color: #888;
+  color: var(--bf-text-subtle);
   white-space: nowrap;
   overflow: hidden;
   text-overflow: ellipsis;
@@ -234,7 +267,7 @@ button:active { background: #388e3c; }
   font-size: 11px;
   font-weight: 500;
   font-variant-numeric: tabular-nums;
-  color: #999;
+  color: var(--bf-text-subtle);
   text-transform: none;
   letter-spacing: normal;
 }
@@ -245,54 +278,54 @@ button:active { background: #388e3c; }
 .attach-btn {
   width: 100%;
   padding: 10px;
-  background: #f5f5f5;
-  color: #333;
+  background: var(--bf-surface-soft);
+  color: var(--bf-text);
   font-size: 12px;
   font-weight: 500;
-  border: 1px dashed #ccc;
+  border: 1px dashed var(--bf-border-strong);
   border-radius: 6px;
   cursor: pointer;
 }
 
-.attach-btn:hover { background: #eee; border-color: #aaa; }
-.attach-btn:active { background: #e0e0e0; }
+.attach-btn:hover { background: var(--bf-surface-soft-hover); border-color: var(--bf-cloudy); }
+.attach-btn:active { background: var(--bf-surface-soft-press); }
 
 .agent-btn {
   width: 100%;
   margin-top: 8px;
   margin-bottom: 8px;
   padding: 10px;
-  background: #2f7bf6;
-  color: #fff;
+  background: var(--bf-accent);
+  color: var(--bf-white);
   border-radius: 6px;
 }
 
-.agent-btn:hover { background: #1d66d9; }
-.agent-btn:active { background: #1757b8; }
+.agent-btn:hover { background: var(--bf-accent-hover); }
+.agent-btn:active { background: var(--bf-accent-press); }
 
 .logs-btn {
   width: 100%;
   margin-top: 8px;
   padding: 9px;
-  background: #fff;
-  color: #333;
+  background: var(--bf-surface);
+  color: var(--bf-text);
   font-size: 12px;
   font-weight: 500;
-  border: 1px solid #ddd;
+  border: 1px solid var(--bf-border);
   border-radius: 6px;
 }
 
 .logs-btn:hover {
-  background: #f7f7f7;
+  background: var(--bf-surface-soft);
 }
 
 .logs-btn:active {
-  background: #efefef;
+  background: var(--bf-surface-press);
 }
 
 /* Settings groups */
 .settings-group {
-  border: 1px solid #eee;
+  border: 1px solid var(--bf-border-soft);
   border-radius: 6px;
   padding: 8px 10px;
 }
@@ -305,14 +338,14 @@ button:active { background: #388e3c; }
 }
 
 .setting-row + .setting-row {
-  border-top: 1px solid #f5f5f5;
+  border-top: 1px solid var(--bf-border-soft);
   padding-top: 8px;
   margin-top: 4px;
 }
 
 .setting-label {
   font-size: 12px;
-  color: #333;
+  color: var(--bf-text);
 }
 
 /* Checkbox rows */
@@ -326,28 +359,28 @@ button:active { background: #388e3c; }
   font-weight: normal;
   text-transform: none;
   letter-spacing: normal;
-  color: #333;
+  color: var(--bf-text);
 }
 
 .checkbox-row + .checkbox-row {
-  border-top: 1px solid #f5f5f5;
+  border-top: 1px solid var(--bf-border-soft);
 }
 
 .checkbox-row input[type="checkbox"] {
   width: 14px;
   height: 14px;
-  accent-color: #4caf50;
+  accent-color: var(--bf-accent);
   cursor: pointer;
 }
 
 /* Select + textarea */
 select {
   padding: 4px 8px;
-  border: 1px solid #ddd;
+  border: 1px solid var(--bf-border);
   border-radius: 4px;
   font-size: 12px;
-  background: #fff;
-  color: #333;
+  background: var(--bf-surface);
+  color: var(--bf-text);
   cursor: pointer;
   outline: none;
 }
@@ -359,21 +392,23 @@ select.full-width {
 }
 
 select:focus {
-  border-color: #4caf50;
+  border-color: var(--bf-accent);
 }
 
 textarea {
   width: 100%;
   padding: 8px 10px;
-  border: 1px solid #ddd;
+  border: 1px solid var(--bf-border);
   border-radius: 6px;
   font-size: 12px;
   font-family: inherit;
   resize: vertical;
   outline: none;
   min-height: 60px;
+  color: var(--bf-text);
+  background: var(--bf-surface);
 }
 
 textarea:focus {
-  border-color: #4caf50;
+  border-color: var(--bf-accent);
 }

From 585d879a8b8fc16d29480ce8cb2c07808ad6de0c Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 13:11:48 +0530
Subject: [PATCH 119/192] Update extension background and styles

---
 AGENTS.md               | 12 ++++++++++++
 extension/background.js | 14 ++++++++++----
 2 files changed, 22 insertions(+), 4 deletions(-)

diff --git a/AGENTS.md b/AGENTS.md
index 9ba081d..fedbba4 100644
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -169,6 +169,16 @@ In `single-active`, contention returns HTTP `409 Conflict` for additional `/cdp`
 
 MCP handles `409`/busy connect errors by entering standby and polling `GET /client-slot` with short jittered intervals (~200-400ms), then reconnecting when `busy: false` (up to a 30s connect timeout).
 
+### BrowserForce Agent Session Identity (No Fixed ID)
+
+For side-panel chat UX, **never hardcode or assume a fixed `sessionId`**.
+
+- Sessions are user-selectable conversation threads (ChatGPT/Atlas style).
+- The UI must list prior sessions and let the user resume any session.
+- New chats must create a new generated session ID (UUID/ULID), then persist metadata + transcript.
+- Streaming channels (`/events`) must be scoped by explicit selected `sessionId`.
+- Do not infer continuity from "current Codex turn/session" alone; BrowserForce Agent keeps its own session store.
+
 ## Security Rules
 
 - Relay binds to `127.0.0.1` ONLY. Never `0.0.0.0`.
@@ -247,3 +257,5 @@ Run with: `node --test relay/test/relay-server.test.js` and `node --test mcp/tes
 5. **Relay port collision**: Default port 19222. If tests fail with EADDRINUSE, kill stale processes: `lsof -ti:19222 | xargs kill -9`.
 
 6. **Test writeCdpUrl**: Never call `relay.start()` in tests without `{ writeCdpUrl: false }` — it overwrites the production cdp-url file.
+
+7. **No fixed chat session IDs**: BrowserForce Agent chat must always use explicit user-selected/generated session IDs and persisted session history. Never bind side-panel chat to a single hardcoded ID.
diff --git a/extension/background.js b/extension/background.js
index adf8d50..162364c 100644
--- a/extension/background.js
+++ b/extension/background.js
@@ -5,6 +5,12 @@ const RELAY_URL_DEFAULT = 'ws://127.0.0.1:19222/extension';
 const RECONNECT_DELAY_MS = 3000;
 const CDP_VERSION = '1.3';
 const RELAY_HTTP_DEFAULT = 'http://127.0.0.1:19222';
+const TAB_GROUP_COLOR = 'orange';
+const BADGE_COLORS = {
+  connected: '#C15F3C',
+  connecting: '#B1ADA1',
+  disconnected: '#B1ADA1',
+};
 
 // ─── State ───────────────────────────────────────────────────────────────────
 
@@ -732,7 +738,7 @@ async function syncTabGroup() {
 
     // Always ensure group title/color are correct
     if (groupId !== undefined) {
-      await chrome.tabGroups.update(groupId, { title: 'browserforce', color: 'cyan' });
+      await chrome.tabGroups.update(groupId, { title: 'browserforce', color: TAB_GROUP_COLOR });
     }
   } catch (e) {
     console.warn('[bf] syncTabGroup error:', e.message);
@@ -756,13 +762,13 @@ function updateBadge() {
 
   if (connectionState === 'connected') {
     chrome.action.setBadgeText({ text: count > 0 ? String(count) : 'ON' });
-    chrome.action.setBadgeBackgroundColor({ color: '#4CAF50' });
+    chrome.action.setBadgeBackgroundColor({ color: BADGE_COLORS.connected });
   } else if (connectionState === 'connecting') {
     chrome.action.setBadgeText({ text: '...' });
-    chrome.action.setBadgeBackgroundColor({ color: '#FF9800' });
+    chrome.action.setBadgeBackgroundColor({ color: BADGE_COLORS.connecting });
   } else {
     chrome.action.setBadgeText({ text: '' });
-    chrome.action.setBadgeBackgroundColor({ color: '#9E9E9E' });
+    chrome.action.setBadgeBackgroundColor({ color: BADGE_COLORS.disconnected });
   }
 }
 

From f0fb5477d6febc408eb4a779b0bbc525c02fec86 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 14:00:54 +0530
Subject: [PATCH 120/192] Simplify AGENTS local include setup

---
 .gitignore | 3 ++-
 AGENTS.md  | 9 ++-------
 2 files changed, 4 insertions(+), 8 deletions(-)

diff --git a/.gitignore b/.gitignore
index 74c7dfb..943e95b 100644
--- a/.gitignore
+++ b/.gitignore
@@ -8,4 +8,5 @@ node_modules/
 pnpm-debug.log*
 .worktrees/
 docs/plans/*
-.superset
\ No newline at end of file
+.superset
+AGENTS.local.md
diff --git a/AGENTS.md b/AGENTS.md
index fedbba4..5b44d22 100644
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -1,13 +1,8 @@
 # BrowserForce — Agent Guidelines
 
-## Playwriter Reference
+## Local Private Overrides
 
-**Before writing any new code, always check how [playwriter](../playwriter) solves the same problem.** Playwriter is the reference implementation for a browser extension + CDP relay + MCP server stack. It lives at `~/Documents/projects/playwriter`.
-
-Rules:
-- **Don't reinvent what playwriter already solved.** Read the relevant playwriter source file first.
-- **Only add code for new requirements or problems playwriter hasn't already solved.**
-- Reference files: `playwriter/src/cdp-relay.ts`, `playwriter/src/executor.ts`, `playwriter/src/mcp.ts`, `playwriter/src/relay-client.ts`
+@AGENTS.local.md
 
 ## Project Overview
 

From 0b655965354609619b990ef3195e07b125e647ab Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 14:16:59 +0530
Subject: [PATCH 121/192] feat(sidepanel): show tab attach banner and
 auto-refresh on tab switch

---
 agent/src/chatd.js                           |   2 +
 extension/agent-panel.css                    |  30 ++++
 extension/agent-panel.html                   |   4 +
 extension/agent-panel.js                     | 138 ++++++++++++++++++-
 test/agent/agent-panel-contract.test.js      |   3 +
 test/agent/agent-panel-send-contract.test.js |   3 +
 test/agent/chatd-api.test.js                 |   3 +-
 7 files changed, 175 insertions(+), 8 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index cb52d94..cd1164f 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -257,6 +257,8 @@ function buildRunPrompt({ message, browserContext }) {
   if (browserContext.tabId != null) lines.push(`- Active tab id: ${browserContext.tabId}`);
   if (browserContext.title) lines.push(`- Active tab title: ${browserContext.title}`);
   if (browserContext.url) lines.push(`- Active tab URL: ${browserContext.url}`);
+  lines.push('Inspect the active page and answer directly when the user asks about what is on this tab.');
+  lines.push('Do not ask for permission to inspect the active page.');
   lines.push('Assume the user is referring to this active tab unless they explicitly say otherwise.');
   lines.push('If the request is ambiguous or you are not sure, ask the user a clarifying question before acting.');
   lines.push('');
diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 26f23e2..231c2f7 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -105,6 +105,36 @@ body {
   color: var(--status-error-strong);
 }
 
+.tab-attach {
+  margin-top: 8px;
+  min-height: 28px;
+  border: 1px solid var(--line);
+  border-radius: 10px;
+  padding: 6px 8px;
+  display: flex;
+  align-items: center;
+  justify-content: space-between;
+  gap: 8px;
+  background: var(--card-bg);
+  color: var(--muted);
+  font-size: 12px;
+}
+
+.tab-attach-btn {
+  border: 1px solid var(--line);
+  background: var(--accent-soft);
+  color: var(--accent-soft-text);
+  border-radius: 8px;
+  padding: 4px 8px;
+  font-size: 12px;
+  line-height: 1.2;
+}
+
+.tab-attach-btn:disabled {
+  opacity: 0.6;
+  cursor: default;
+}
+
 .chat {
   min-height: 0;
   display: grid;
diff --git a/extension/agent-panel.html b/extension/agent-panel.html
index b53408e..70b90ae 100644
--- a/extension/agent-panel.html
+++ b/extension/agent-panel.html
@@ -18,6 +18,10 @@
         <span id="bf-agent-status-icon" class="status-icon" aria-hidden="true">●</span>
         <span id="bf-agent-status-text">Starting...</span>
       </p>
+      <div id="bf-tab-attach-banner" class="tab-attach hidden" role="status" aria-live="polite">
+        <span id="bf-tab-attach-text">Current tab is not connected</span>
+        <button id="bf-attach-current-tab" type="button" class="tab-attach-btn">Attach current tab</button>
+      </div>
     </header>
 
     <section class="chat">
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index c48fe65..69bca4e 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -35,6 +35,11 @@ const chatFormEl = document.getElementById('bf-chat-form');
 const chatInputEl = document.getElementById('bf-chat-input');
 const stopRunBtn = document.getElementById('bf-stop-run');
 const sendBtn = chatFormEl.querySelector('button[type="submit"]');
+const tabAttachBannerEl = document.getElementById('bf-tab-attach-banner');
+const tabAttachTextEl = document.getElementById('bf-tab-attach-text');
+const attachCurrentTabBtn = document.getElementById('bf-attach-current-tab');
+let tabAttachRefreshTimer = null;
+let tabAttachRefreshToken = 0;
 
 function setStatus(kind, text) {
   statusTextEl.textContent = text;
@@ -48,6 +53,20 @@ function setComposerEnabled(enabled) {
   sendBtn.disabled = !enabled;
 }
 
+function setTabAttachBannerState({
+  hidden = true,
+  text = 'Current tab is not connected',
+  canAttach = false,
+  busy = false,
+} = {}) {
+  if (!tabAttachBannerEl || !tabAttachTextEl || !attachCurrentTabBtn) return;
+  tabAttachBannerEl.classList.toggle('hidden', !!hidden);
+  if (hidden) return;
+  tabAttachTextEl.textContent = text;
+  attachCurrentTabBtn.disabled = busy || !canAttach;
+  attachCurrentTabBtn.textContent = busy ? 'Attaching...' : 'Attach current tab';
+}
+
 function dispatch(action) {
   state.value = reduceState(state.value, action);
   render();
@@ -288,8 +307,99 @@ async function ensureCurrentTabAttached() {
     if (response?.error && !isIgnoredAttachError(response.error)) {
       console.warn('[bf-agent] attachCurrentTab failed:', response.error);
     }
+    return response || null;
   } catch {
     // best-effort only
+    return null;
+  }
+}
+
+function isTabAttachableUrl(url) {
+  const value = String(url || '').trim();
+  if (!value) return false;
+  return !(
+    value.startsWith('chrome://')
+    || value.startsWith('chrome-extension://')
+    || value.startsWith('edge://')
+    || value.startsWith('devtools://')
+  );
+}
+
+async function getCurrentTabAttachmentState() {
+  if (!chrome?.tabs?.query) return { hidden: true };
+  let tab = null;
+  try {
+    [tab] = await chrome.tabs.query({ active: true, currentWindow: true });
+  } catch {
+    return { hidden: true };
+  }
+  if (!tab || typeof tab.id !== 'number') return { hidden: true };
+  if (!isTabAttachableUrl(tab.url)) {
+    return {
+      hidden: false,
+      text: 'This tab cannot be attached',
+      canAttach: false,
+    };
+  }
+
+  try {
+    const status = await runtimeMessage({ type: 'getStatus' });
+    if (status?.connectionState && status.connectionState !== 'connected') {
+      return {
+        hidden: false,
+        text: 'Relay disconnected',
+        canAttach: false,
+      };
+    }
+
+    const attachedTabs = Array.isArray(status?.tabs) ? status.tabs : [];
+    const attached = attachedTabs.some((item) => Number(item?.tabId) === tab.id);
+    if (attached) return { hidden: true };
+    return {
+      hidden: false,
+      text: 'Current tab is not connected',
+      canAttach: true,
+    };
+  } catch {
+    return {
+      hidden: false,
+      text: 'Unable to check tab connection',
+      canAttach: false,
+    };
+  }
+}
+
+async function refreshTabAttachBanner() {
+  const token = ++tabAttachRefreshToken;
+  const next = await getCurrentTabAttachmentState();
+  if (token !== tabAttachRefreshToken) return;
+  setTabAttachBannerState(next);
+}
+
+function scheduleTabAttachRefresh(delayMs = 0) {
+  if (tabAttachRefreshTimer) clearTimeout(tabAttachRefreshTimer);
+  tabAttachRefreshTimer = setTimeout(() => {
+    refreshTabAttachBanner().catch(() => {});
+  }, delayMs);
+}
+
+function bindTabAttachWatchers() {
+  if (chrome?.tabs?.onActivated?.addListener) {
+    chrome.tabs.onActivated.addListener(() => {
+      scheduleTabAttachRefresh(40);
+    });
+  }
+  if (chrome?.tabs?.onUpdated?.addListener) {
+    chrome.tabs.onUpdated.addListener((_tabId, changeInfo, tab) => {
+      if (!tab?.active) return;
+      if (!('status' in changeInfo) && !('url' in changeInfo) && !('title' in changeInfo)) return;
+      scheduleTabAttachRefresh(80);
+    });
+  }
+  if (chrome?.windows?.onFocusChanged?.addListener) {
+    chrome.windows.onFocusChanged.addListener(() => {
+      scheduleTabAttachRefresh(80);
+    });
   }
 }
 
@@ -300,13 +410,7 @@ async function getActiveTabContext() {
     if (!tab || typeof tab.id !== 'number') return null;
     const title = String(tab.title || '').trim().slice(0, 180);
     const url = String(tab.url || '').trim();
-    if (
-      !url
-      || url.startsWith('chrome://')
-      || url.startsWith('chrome-extension://')
-      || url.startsWith('edge://')
-      || url.startsWith('devtools://')
-    ) {
+    if (!isTabAttachableUrl(url)) {
       return { tabId: tab.id, title, url: null };
     }
     return { tabId: tab.id, title, url: url.slice(0, 500) };
@@ -528,6 +632,7 @@ async function sendMessage(text) {
   dispatch({ type: 'messages.loaded', sessionId, messages: [...existing, { role: 'user', text }] });
 
   await ensureCurrentTabAttached();
+  scheduleTabAttachRefresh(0);
   const browserContext = await getActiveTabContext();
 
   const res = await api('/v1/runs', {
@@ -574,6 +679,22 @@ chatInputEl.addEventListener('keydown', (event) => {
   chatFormEl.requestSubmit();
 });
 
+if (attachCurrentTabBtn) {
+  attachCurrentTabBtn.addEventListener('click', async () => {
+    setTabAttachBannerState({
+      hidden: false,
+      text: tabAttachTextEl?.textContent || 'Current tab is not connected',
+      canAttach: false,
+      busy: true,
+    });
+    const response = await ensureCurrentTabAttached();
+    if (response?.error && !isIgnoredAttachError(response.error)) {
+      setStatus('error', response.error || 'Unable to attach current tab');
+    }
+    scheduleTabAttachRefresh(0);
+  });
+}
+
 newSessionBtn.addEventListener('click', () => {
   createSession()
     .then(() => setPopover('none'))
@@ -601,6 +722,7 @@ popoverBackdropEl.addEventListener('click', () => {
     setStatus('info', 'Connecting...');
     await loadAuth();
     await ensureCurrentTabAttached();
+    bindTabAttachWatchers();
     try {
       await loadModelPresets();
     } catch {
@@ -613,9 +735,11 @@ popoverBackdropEl.addEventListener('click', () => {
       await selectSession(state.value.activeSessionId);
     }
     setComposerEnabled(true);
+    scheduleTabAttachRefresh(0);
     setStatus('ready', 'Ready');
   } catch {
     setComposerEnabled(false);
+    setTabAttachBannerState({ hidden: true });
     setStatus('error', 'Daemon unavailable');
   }
 })();
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index 3739bc3..9b63e6a 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -13,6 +13,9 @@ test('agent panel has inline model and session selectors with popovers', () => {
   assert.match(html, /id="bf-session-panel"/);
   assert.match(html, /id="bf-model-list"/);
   assert.match(html, /id="bf-switch-session-list"/);
+  assert.match(html, /id="bf-tab-attach-banner"/);
+  assert.match(html, /id="bf-tab-attach-text"/);
+  assert.match(html, /id="bf-attach-current-tab"/);
 });
 
 test('agent panel no longer renders title or persistent session sidebar', () => {
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index faf3ddd..7a0d415 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -19,6 +19,9 @@ test('submit handler preserves draft on send failure', () => {
 test('sidepanel auto-attaches current tab and sends browserContext with runs', () => {
   assert.match(js, /async function ensureCurrentTabAttached\(\)/);
   assert.match(js, /runtimeMessage\(\{\s*type:\s*'attachCurrentTab'\s*\}\)/);
+  assert.match(js, /runtimeMessage\(\{\s*type:\s*'getStatus'\s*\}\)/);
+  assert.match(js, /chrome\.tabs\.onActivated\.addListener/);
+  assert.match(js, /attachCurrentTabBtn\.addEventListener\('click'/);
   assert.match(js, /await ensureCurrentTabAttached\(\);/);
   assert.match(js, /const browserContext = await getActiveTabContext\(\);/);
   assert.match(js, /JSON\.stringify\(\{\s*sessionId,\s*message:\s*text,\s*browserContext\s*\}\)/);
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index 704d6d0..7237f2a 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -298,7 +298,8 @@ test('POST /v1/runs includes active tab context in runExecutor prompt', async ()
     const prompt = seenRuns.at(-1)?.message || '';
     assert.match(prompt, /Active tab title: Pricing/);
     assert.match(prompt, /Active tab URL: https:\/\/example\.com\/pricing/);
-    assert.match(prompt, /If the request is ambiguous/i);
+    assert.match(prompt, /inspect the active page and answer directly/i);
+    assert.match(prompt, /do not ask for permission to inspect/i);
     assert.match(prompt, /User request:\s*summarize this page/i);
   } finally {
     await daemon.stop();

From 820ae31b155a14050808b9f845b9bcc033f8f9d5 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 15:10:45 +0530
Subject: [PATCH 122/192] Redesign agent sidepanel UI to new style system

---
 extension/agent-panel.css  | 743 ++++++++++++++++++++++++++-----------
 extension/agent-panel.html |  67 +++-
 extension/agent-panel.js   | 186 ++++++++--
 3 files changed, 739 insertions(+), 257 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 231c2f7..fac41d9 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -1,108 +1,181 @@
 :root {
   color-scheme: light;
-  --bf-crail: #C15F3C;
-  --bf-cloudy: #B1ADA1;
-  --bf-pampas: #F4F3EE;
-  --bf-white: #FFFFFF;
-
-  --panel-bg: linear-gradient(180deg, var(--bf-pampas) 0%, #ECE6DB 100%);
-  --card-bg: var(--bf-white);
-  --line: #D8D3C9;
-  --line-soft: #E9E4DA;
-  --text: #3D3028;
-  --muted: #756F63;
-  --text-subtle: #8C857A;
-  --accent: var(--bf-crail);
-  --accent-hover: #B05535;
-  --accent-press: #9F4D30;
-  --accent-soft: #E9D3CB;
-  --accent-soft-text: #7A3D27;
-  --status-ok: var(--bf-crail);
-  --status-error: #8A3D24;
-  --status-error-strong: #B25334;
-  --menu-bg: rgba(51, 41, 34, 0.94);
-  --menu-line: rgba(255, 255, 255, 0.16);
-  --menu-text: var(--bf-pampas);
+  --crail: #C15F3C;
+  --crail-dark: #A34E30;
+  --crail-press: #8F4228;
+  --crail-soft: #F0DDD6;
+  --pampas: #F4F3EE;
+  --sand: #EAE6DE;
+  --linen: #F9F7F4;
+  --header: #1E2926;
+  --header-border: #2D3B35;
+  --text: #2E2419;
+  --text-muted: #6B6358;
+  --text-subtle: #9B9189;
+  --line: #DDD8CF;
+  --line-soft: #EDE9E2;
+  --ok: #3D8A5E;
+  --error: #B25334;
+  --menu-bg: rgba(26, 36, 33, 0.97);
+  --menu-line: rgba(255, 255, 255, 0.1);
 }
 
 * {
   box-sizing: border-box;
+  margin: 0;
+  padding: 0;
 }
 
 body {
-  margin: 0;
-  font-family: ui-sans-serif, -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif;
+  font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', system-ui, sans-serif;
+  background: var(--pampas);
+  min-height: 100vh;
   color: var(--text);
-  background: var(--panel-bg);
 }
 
 .agent-shell {
   height: 100vh;
-  min-height: 0;
-  display: grid;
-  grid-template-rows: auto 1fr;
+  display: flex;
+  flex-direction: column;
+  overflow: hidden;
   position: relative;
+  background: var(--pampas);
 }
 
 .agent-header {
-  padding: 10px 12px 8px;
-  border-bottom: 1px solid var(--line);
-  background: linear-gradient(180deg, var(--bf-white), var(--bf-pampas));
+  flex-shrink: 0;
+  background: var(--header);
+  border-bottom: 1px solid var(--header-border);
+}
+
+.title-bar {
+  display: flex;
+  align-items: center;
+  justify-content: space-between;
+  padding: 12px 14px 10px;
+}
+
+.brand {
+  display: flex;
+  align-items: center;
+  gap: 8px;
+}
+
+.brand-icon {
+  width: 24px;
+  height: 24px;
+  border-radius: 6px;
+  background: var(--crail);
+  display: flex;
+  align-items: center;
+  justify-content: center;
+  color: #fff;
+  font-weight: 700;
+  font-size: 12px;
+  flex-shrink: 0;
+}
+
+.brand-name {
+  font-size: 13.5px;
+  font-weight: 600;
+  color: #fff;
+  letter-spacing: -0.02em;
 }
 
-.header-row {
+.controls {
   display: flex;
   align-items: center;
   gap: 8px;
+  padding: 0 14px 12px;
 }
 
-.selector-pill {
+.pill-btn {
   flex: 1;
-  min-height: 30px;
-  border: 1px solid var(--line);
+  min-width: 0;
+  height: 32px;
+  display: flex;
+  align-items: center;
+  justify-content: space-between;
+  gap: 6px;
+  padding: 0 12px;
   border-radius: 999px;
-  background: var(--card-bg);
-  color: var(--text);
+  background: rgba(255, 255, 255, 0.1);
+  border: 1px solid rgba(255, 255, 255, 0.14);
+  color: rgba(255, 255, 255, 0.78);
   font-size: 12px;
-  text-align: left;
-  padding: 0 10px;
-  white-space: nowrap;
+  cursor: pointer;
+  transition: background 0.15s, color 0.15s;
+}
+
+.pill-btn:hover {
+  background: rgba(255, 255, 255, 0.15);
+  color: #fff;
+}
+
+.pill-btn span {
   overflow: hidden;
   text-overflow: ellipsis;
+  white-space: nowrap;
 }
 
-.icon-btn {
-  min-width: 32px;
-  min-height: 30px;
-  border-radius: 8px;
-  border: 1px solid var(--line);
-  background: var(--card-bg);
-  color: var(--text);
-  font-size: 20px;
-  line-height: 1;
-  padding: 0;
+.pill-btn svg {
+  width: 12px;
+  height: 12px;
+  opacity: 0.55;
+  flex-shrink: 0;
 }
 
-.status {
-  margin: 8px 0 0;
-  font-size: 12px;
-  color: var(--muted);
+.round-btn {
+  width: 32px;
+  height: 32px;
+  border-radius: 999px;
+  background: rgba(255, 255, 255, 0.1);
+  border: 1px solid rgba(255, 255, 255, 0.14);
+  cursor: pointer;
   display: flex;
   align-items: center;
-  gap: 6px;
+  justify-content: center;
+  color: rgba(255, 255, 255, 0.78);
+  transition: background 0.15s, color 0.15s;
+  flex-shrink: 0;
 }
 
-.status-icon {
-  font-size: 10px;
-  color: var(--status-ok);
+.round-btn:hover {
+  background: rgba(255, 255, 255, 0.16);
+  color: #fff;
+}
+
+.round-btn svg {
+  width: 14px;
+  height: 14px;
+}
+
+.status-circle {
+  width: 32px;
+  height: 32px;
+  flex-shrink: 0;
+  border-radius: 999px;
+  background: rgba(255, 255, 255, 0.1);
+  border: 1px solid rgba(255, 255, 255, 0.14);
+  display: flex;
+  align-items: center;
+  justify-content: center;
+}
+
+.status-dot {
+  width: 8px;
+  height: 8px;
+  border-radius: 50%;
+  background: var(--ok);
 }
 
-.status.error {
-  color: var(--status-error);
+.status-circle.error .status-dot {
+  background: var(--error);
 }
 
-.status.error .status-icon {
-  color: var(--status-error-strong);
+.status-circle.thinking .status-dot {
+  background: #FBBF24;
+  animation: pulse 1.2s ease-in-out infinite;
 }
 
 .tab-attach {
@@ -137,98 +210,202 @@ body {
 
 .chat {
   min-height: 0;
-  display: grid;
-  grid-template-rows: 1fr auto;
+  flex: 1;
+  display: flex;
+  flex-direction: column;
 }
 
 .transcript {
-  overflow: auto;
-  padding: 14px;
+  flex: 1;
+  min-height: 0;
+  overflow-y: auto;
+  padding: 16px;
+  display: flex;
+  flex-direction: column;
+  gap: 18px;
+  scroll-behavior: smooth;
+}
+
+.transcript::-webkit-scrollbar {
+  width: 4px;
+}
+
+.transcript::-webkit-scrollbar-thumb {
+  background: var(--line);
+  border-radius: 4px;
 }
 
-.message-row {
+.empty-state {
   display: flex;
   flex-direction: column;
-  margin-bottom: 10px;
-  gap: 6px;
+  align-items: center;
+  justify-content: center;
+  flex: 1;
+  gap: 12px;
+  text-align: center;
+  padding: 40px 20px;
+}
+
+.empty-icon {
+  width: 40px;
+  height: 40px;
+  border-radius: 12px;
+  background: var(--crail);
+  display: flex;
+  align-items: center;
+  justify-content: center;
+  color: #fff;
+  font-weight: 700;
+  font-size: 18px;
+}
+
+.empty-title {
+  font-size: 14px;
+  font-weight: 500;
+  color: var(--text);
+}
+
+.empty-sub {
+  font-size: 12px;
+  color: var(--text-subtle);
+  line-height: 1.5;
+}
+
+.message {
+  display: flex;
+  flex-direction: column;
+  gap: 4px;
 }
 
-.message-row.user {
+.message.user {
   align-items: flex-end;
 }
 
-.message-row.assistant {
+.message.assistant {
   align-items: flex-start;
 }
 
-.message {
-  font-size: 13px;
+.msg-meta {
+  display: flex;
+  align-items: center;
+  gap: 6px;
+  padding: 0 4px;
+}
+
+.msg-author {
+  font-size: 10px;
+  font-weight: 600;
+  color: var(--text-subtle);
+  text-transform: uppercase;
+  letter-spacing: 0.05em;
+}
+
+.msg-content-wrap {
+  max-width: 96%;
+}
+
+.bubble-user {
+  background: var(--crail);
+  color: #fff;
+  font-size: 13.5px;
+  line-height: 1.55;
+  border-radius: 16px;
+  border-top-right-radius: 4px;
+  padding: 10px 16px;
+  max-width: 82%;
+  box-shadow: 0 2px 8px rgba(193, 95, 60, 0.25);
   white-space: pre-wrap;
-  line-height: 1.45;
 }
 
-.message.user {
-  background: var(--accent-press);
-  color: var(--bf-white);
-  border: 1px solid var(--accent-hover);
+.bubble-assistant {
+  background: #fff;
+  border: 1px solid var(--line-soft);
   border-radius: 16px;
-  padding: 10px 14px;
-  max-width: min(85%, 520px);
-  box-shadow: 0 4px 12px rgba(97, 53, 37, 0.28);
+  border-top-left-radius: 4px;
+  padding: 12px 16px;
+  box-shadow: 0 1px 4px rgba(0, 0, 0, 0.06);
 }
 
-.message.assistant {
-  background: transparent;
-  border: 0;
+.bubble-assistant p {
+  font-size: 13.5px;
+  line-height: 1.6;
   color: var(--text);
-  padding: 0;
-  max-width: 100%;
+  white-space: pre-wrap;
+}
+
+.bubble-assistant code {
+  font-family: 'SF Mono', 'Fira Code', 'Cascadia Code', monospace;
+  font-size: 11.5px;
+  background: var(--sand);
+  color: var(--crail-dark);
+  padding: 2px 6px;
+  border-radius: 5px;
+  border: 1px solid var(--line);
 }
 
 .run-steps-summary {
-  display: inline-flex;
-  flex-direction: column;
-  align-items: flex-start;
-  gap: 8px;
+  margin-bottom: 6px;
 }
 
-.run-steps-trigger {
-  all: unset;
+.steps-toggle {
+  display: inline-flex;
+  align-items: center;
+  gap: 6px;
+  background: transparent;
+  border: 0;
   cursor: pointer;
+  font-size: 12px;
   color: var(--text-subtle);
-  font-size: 13px;
-  font-weight: 500;
+  padding: 0;
+  transition: color 0.15s;
 }
 
-.run-steps-trigger:hover {
-  color: var(--accent);
+.steps-toggle:hover {
+  color: var(--text-muted);
 }
 
-.run-steps-list {
+.steps-toggle svg {
+  width: 12px;
+  height: 12px;
+  transition: transform 0.2s;
+}
+
+.steps-toggle.open svg {
+  transform: rotate(90deg);
+}
+
+.steps-list {
   list-style: none;
-  margin: 0;
-  padding: 0 0 0 10px;
-  border-left: 1px solid var(--line-soft);
-  display: grid;
+  display: none;
+  margin: 8px 0 8px 2px;
+  padding-left: 12px;
+  border-left: 1.5px solid var(--line);
+}
+
+.steps-list.open {
+  display: flex;
+  flex-direction: column;
   gap: 8px;
 }
 
-.run-step {
+.step-item {
   display: flex;
   align-items: flex-start;
   gap: 8px;
-  color: var(--muted);
 }
 
-.run-step-label {
+.step-label {
+  font-size: 11.5px;
+  color: var(--text-muted);
+  line-height: 1.4;
   white-space: pre-wrap;
 }
 
 .run-step-icon {
-  width: 16px;
-  height: 16px;
-  margin-top: 2px;
-  flex: 0 0 16px;
+  width: 13px;
+  height: 13px;
+  flex-shrink: 0;
+  margin-top: 1px;
   color: var(--text-subtle);
   position: relative;
 }
@@ -240,63 +417,63 @@ body {
 }
 
 .run-step-icon.icon-reasoning::before {
-  top: 4px;
-  left: 4px;
-  width: 8px;
-  height: 8px;
+  top: 2px;
+  left: 2px;
+  width: 9px;
+  height: 9px;
   border-radius: 999px;
   background: currentColor;
 }
 
 .run-step-icon.icon-tool::before {
-  top: 2px;
-  left: 2px;
-  width: 12px;
-  height: 12px;
+  top: 1px;
+  left: 1px;
+  width: 10px;
+  height: 10px;
   border: 1.5px solid currentColor;
   border-radius: 3px;
 }
 
 .run-step-icon.icon-view::before {
-  top: 5px;
-  left: 1px;
-  width: 14px;
-  height: 8px;
+  top: 4px;
+  left: 0;
+  width: 12px;
+  height: 6px;
   border: 1.5px solid currentColor;
-  border-radius: 10px;
+  border-radius: 8px;
 }
 
 .run-step-icon.icon-view::after {
-  top: 7px;
-  left: 6px;
-  width: 4px;
-  height: 4px;
+  top: 6px;
+  left: 5px;
+  width: 2px;
+  height: 2px;
   border: 1.5px solid currentColor;
   border-radius: 999px;
 }
 
 .run-step-icon.icon-camera::before {
-  top: 5px;
-  left: 1px;
-  width: 14px;
-  height: 9px;
+  top: 3px;
+  left: 0;
+  width: 12px;
+  height: 7px;
   border: 1.5px solid currentColor;
-  border-radius: 3px;
+  border-radius: 2px;
 }
 
 .run-step-icon.icon-camera::after {
   top: 1px;
   left: 4px;
-  width: 6px;
-  height: 4px;
+  width: 4px;
+  height: 2px;
   border: 1.5px solid currentColor;
   border-bottom: 0;
   border-radius: 2px 2px 0 0;
 }
 
 .run-step-icon.icon-plan::before {
-  top: 3px;
-  left: 3px;
+  top: 2px;
+  left: 2px;
   width: 2px;
   height: 2px;
   border-radius: 999px;
@@ -305,9 +482,9 @@ body {
 }
 
 .run-step-icon.icon-plan::after {
-  top: 3px;
-  left: 7px;
-  width: 7px;
+  top: 2px;
+  left: 6px;
+  width: 5px;
   height: 2px;
   border-radius: 2px;
   background: currentColor;
@@ -315,18 +492,18 @@ body {
 }
 
 .run-step-icon.icon-done::before {
-  top: 1px;
-  left: 1px;
-  width: 12px;
-  height: 12px;
+  top: 0;
+  left: 0;
+  width: 11px;
+  height: 11px;
   border: 1.5px solid currentColor;
   border-radius: 999px;
 }
 
 .run-step-icon.icon-done::after {
-  top: 6px;
-  left: 5px;
-  width: 6px;
+  top: 5px;
+  left: 3px;
+  width: 5px;
   height: 3px;
   border-left: 1.5px solid currentColor;
   border-bottom: 1.5px solid currentColor;
@@ -334,139 +511,281 @@ body {
 }
 
 .run-step-icon.icon-failed::before {
-  top: 1px;
-  left: 1px;
-  width: 12px;
-  height: 12px;
+  top: 0;
+  left: 0;
+  width: 11px;
+  height: 11px;
   border: 1.5px solid currentColor;
   border-radius: 999px;
 }
 
 .run-step-icon.icon-failed::after {
-  top: 7px;
-  left: 4px;
-  width: 8px;
+  top: 6px;
+  left: 2px;
+  width: 7px;
   height: 1.5px;
   background: currentColor;
 }
 
-.run-step.done .run-step-icon {
-  color: var(--status-ok);
+.step-item.done .run-step-icon {
+  color: var(--ok);
 }
 
-.run-step.failed .run-step-icon {
-  color: var(--status-error-strong);
+.step-item.failed .run-step-icon {
+  color: var(--error);
 }
 
-.composer {
-  border-top: 1px solid var(--line);
-  padding: 10px;
-  display: grid;
+.thinking-bubble {
+  background: #fff;
+  border: 1px solid var(--line-soft);
+  border-radius: 16px;
+  border-top-left-radius: 4px;
+  padding: 12px 16px;
+  box-shadow: 0 1px 4px rgba(0, 0, 0, 0.06);
+  display: flex;
+  align-items: center;
   gap: 8px;
-  background: var(--card-bg);
 }
 
-.composer textarea {
-  width: 100%;
-  min-height: 72px;
-  max-height: 160px;
-  resize: vertical;
-  padding: 10px;
+.thinking-bubble span {
+  font-size: 13px;
+  color: var(--text-muted);
+}
+
+.spinner {
+  width: 14px;
+  height: 14px;
+  border: 2px solid var(--crail-soft);
+  border-top-color: var(--crail);
+  border-radius: 50%;
+  flex-shrink: 0;
+  animation: spin 0.7s linear infinite;
+}
+
+.composer-wrap {
+  flex-shrink: 0;
+  background: #fff;
+  border-top: 1px solid var(--line);
+  padding: 10px 12px;
+}
+
+.composer-box {
+  display: flex;
+  align-items: flex-end;
+  gap: 6px;
   border: 1px solid var(--line);
-  border-radius: 8px;
-  font: inherit;
+  border-radius: 12px;
+  background: var(--linen);
+  padding: 8px 8px 8px 12px;
+  transition: border-color 0.15s, box-shadow 0.15s;
+}
+
+.composer-box:focus-within {
+  border-color: var(--crail);
+  box-shadow: 0 0 0 3px rgba(193, 95, 60, 0.12);
+}
+
+.composer-textarea {
+  flex: 1;
+  resize: none;
+  background: transparent;
+  border: 0;
+  outline: none;
+  font-size: 13.5px;
+  font-family: inherit;
   color: var(--text);
-  background: var(--card-bg);
+  line-height: 1.55;
+  min-height: 22px;
+  max-height: 160px;
+  overflow-y: auto;
+}
+
+.composer-textarea::placeholder {
+  color: var(--text-subtle);
 }
 
 .composer-actions {
   display: flex;
-  justify-content: flex-end;
-  gap: 8px;
+  align-items: center;
+  gap: 4px;
+  flex-shrink: 0;
 }
 
-button {
-  border: 0;
+.btn-stop,
+.btn-send {
+  width: 32px;
+  height: 32px;
   border-radius: 8px;
-  background: var(--accent);
-  color: var(--bf-white);
-  padding: 8px 12px;
+  border: 0;
+  display: flex;
+  align-items: center;
+  justify-content: center;
+  transition: background 0.15s, opacity 0.15s, color 0.15s;
+}
+
+.btn-stop {
+  background: transparent;
+  cursor: not-allowed;
+  color: var(--text-subtle);
+  opacity: 0.3;
+}
+
+.btn-stop.active {
   cursor: pointer;
+  opacity: 1;
+  color: #EF4444;
 }
 
-button:hover {
-  background: var(--accent-hover);
+.btn-stop.active:hover {
+  background: #FEF2F2;
 }
 
-button:active {
-  background: var(--accent-press);
+.btn-stop svg {
+  width: 15px;
+  height: 15px;
 }
 
-button.secondary {
-  background: var(--accent-soft);
-  color: var(--accent-soft-text);
+.btn-send {
+  background: var(--crail);
+  color: #fff;
+  cursor: pointer;
+  box-shadow: 0 2px 6px rgba(193, 95, 60, 0.3);
 }
 
-button.secondary:hover {
-  background: #E2C8BE;
+.btn-send:hover:not(:disabled) {
+  background: var(--crail-dark);
 }
 
-.menu-backdrop {
+.btn-send:active:not(:disabled) {
+  background: var(--crail-press);
+}
+
+.btn-send:disabled {
+  opacity: 0.35;
+  cursor: not-allowed;
+  box-shadow: none;
+}
+
+.btn-send svg {
+  width: 13px;
+  height: 13px;
+}
+
+.popover-backdrop {
   position: absolute;
   inset: 0;
-  background: rgba(0, 0, 0, 0.14);
   z-index: 20;
+  display: block;
 }
 
 .popover-panel {
   position: absolute;
-  top: 52px;
+  top: 56px;
   left: 12px;
   right: 12px;
-  border-radius: 14px;
+  z-index: 30;
+  border-radius: 16px;
   background: var(--menu-bg);
-  border: 1px solid var(--menu-line);
-  box-shadow: 0 18px 36px rgba(66, 49, 39, 0.32);
-  backdrop-filter: blur(14px);
-  z-index: 21;
+  border: 1px solid rgba(255, 255, 255, 0.1);
+  box-shadow: 0 20px 60px rgba(0, 0, 0, 0.5);
+  backdrop-filter: blur(16px);
+  overflow: hidden;
   max-height: min(360px, calc(100vh - 70px));
-  overflow: auto;
+}
+
+.popover-label {
+  font-size: 10px;
+  font-weight: 600;
+  color: rgba(255, 255, 255, 0.35);
+  text-transform: uppercase;
+  letter-spacing: 0.08em;
+  padding: 10px 14px 4px;
 }
 
 .popover-list {
   list-style: none;
-  margin: 0;
-  padding: 8px;
+  padding: 4px 8px 8px;
+  max-height: 260px;
+  overflow-y: auto;
+}
+
+.popover-list::-webkit-scrollbar {
+  width: 4px;
+}
+
+.popover-list::-webkit-scrollbar-thumb {
+  background: rgba(255, 255, 255, 0.15);
+  border-radius: 4px;
 }
 
-.popover-list button {
+.popover-item {
+  display: flex;
+  align-items: center;
+  justify-content: space-between;
   width: 100%;
-  text-align: left;
+  padding: 9px 12px;
+  border-radius: 10px;
   border: 0;
   background: transparent;
-  color: var(--menu-text);
-  border-radius: 10px;
-  padding: 10px 12px;
-  margin: 1px 0;
-  font-size: 14px;
+  color: rgba(255, 255, 255, 0.72);
+  font-size: 13px;
+  cursor: pointer;
+  text-align: left;
+  transition: background 0.12s, color 0.12s;
 }
 
-.popover-list button.active,
-.popover-list button:hover {
-  background: rgba(255, 255, 255, 0.11);
+.popover-item:hover {
+  background: rgba(255, 255, 255, 0.1);
+  color: #fff;
 }
 
-.popover-list .hint {
-  font-size: 12px;
-  color: rgba(255, 255, 255, 0.66);
-  margin-top: 4px;
+.popover-item.active {
+  background: rgba(255, 255, 255, 0.14);
+  color: #fff;
+  font-weight: 500;
 }
 
-.popover-list .empty-item {
+.popover-item.active::after {
+  content: '✓';
+  color: var(--crail);
+  font-weight: 700;
+}
+
+.popover-item.custom-item {
+  color: rgba(255, 255, 255, 0.45);
+}
+
+.empty-item {
   color: rgba(255, 255, 255, 0.75);
   padding: 10px 12px;
+  font-size: 13px;
 }
 
 .hidden {
   display: none;
 }
+
+.sr-only {
+  position: absolute;
+  width: 1px;
+  height: 1px;
+  overflow: hidden;
+  clip: rect(0, 0, 0, 0);
+  white-space: nowrap;
+}
+
+@keyframes pulse {
+  0%,
+  100% {
+    opacity: 1;
+  }
+  50% {
+    opacity: 0.4;
+  }
+}
+
+@keyframes spin {
+  to {
+    transform: rotate(360deg);
+  }
+}
diff --git a/extension/agent-panel.html b/extension/agent-panel.html
index 70b90ae..55dbcf8 100644
--- a/extension/agent-panel.html
+++ b/extension/agent-panel.html
@@ -9,15 +9,39 @@
 <body>
   <main class="agent-shell">
     <header class="agent-header">
-      <div class="header-row">
-        <button id="bf-model-trigger" type="button" class="selector-pill" aria-expanded="false">Model</button>
-        <button id="bf-session-trigger" type="button" class="selector-pill" aria-expanded="false">Session</button>
-        <button id="bf-new-session" type="button" class="icon-btn" aria-label="New Session" title="New Session">+</button>
+      <div class="title-bar">
+        <div class="brand">
+          <div class="brand-icon" aria-hidden="true">B</div>
+          <span class="brand-name">BrowserForce</span>
+        </div>
+      </div>
+
+      <div class="controls">
+        <button id="bf-model-trigger" type="button" class="pill-btn" aria-expanded="false" aria-haspopup="listbox">
+          <span id="bf-model-label">Model: Default</span>
+          <svg viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" aria-hidden="true">
+            <path stroke-linecap="round" stroke-linejoin="round" d="M19 9l-7 7-7-7"></path>
+          </svg>
+        </button>
+
+        <button id="bf-session-trigger" type="button" class="pill-btn" aria-expanded="false" aria-haspopup="listbox">
+          <span id="bf-session-label">Session</span>
+          <svg viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" aria-hidden="true">
+            <path stroke-linecap="round" stroke-linejoin="round" d="M19 9l-7 7-7-7"></path>
+          </svg>
+        </button>
+
+        <button id="bf-new-session" type="button" class="round-btn" aria-label="New Session" title="New Session">
+          <svg viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" aria-hidden="true">
+            <path stroke-linecap="round" stroke-linejoin="round" d="M12 4v16m8-8H4"></path>
+          </svg>
+        </button>
+
+        <div id="bf-agent-status" class="status-circle" title="Starting...">
+          <span id="bf-agent-status-icon" class="status-dot" aria-hidden="true"></span>
+          <span id="bf-agent-status-text" class="sr-only">Starting...</span>
+        </div>
       </div>
-      <p id="bf-agent-status" class="status">
-        <span id="bf-agent-status-icon" class="status-icon" aria-hidden="true">●</span>
-        <span id="bf-agent-status-text">Starting...</span>
-      </p>
       <div id="bf-tab-attach-banner" class="tab-attach hidden" role="status" aria-live="polite">
         <span id="bf-tab-attach-text">Current tab is not connected</span>
         <button id="bf-attach-current-tab" type="button" class="tab-attach-btn">Attach current tab</button>
@@ -26,22 +50,35 @@
 
     <section class="chat">
       <div id="bf-transcript" class="transcript"></div>
-      <form id="bf-chat-form" class="composer">
-        <textarea id="bf-chat-input" placeholder="Send a message to BrowserForce Agent"></textarea>
+      <form id="bf-chat-form" class="composer-wrap">
+        <div class="composer-box">
+          <textarea id="bf-chat-input" class="composer-textarea" rows="1" placeholder="Message BrowserForce Agent"></textarea>
         <div class="composer-actions">
-          <button id="bf-stop-run" type="button" class="secondary">Stop</button>
-          <button type="submit">Send</button>
+            <button id="bf-stop-run" type="button" class="btn-stop" aria-label="Stop" title="Stop" disabled>
+              <svg viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" aria-hidden="true">
+                <circle cx="12" cy="12" r="10"></circle>
+                <rect x="9" y="9" width="6" height="6" rx="1" fill="currentColor" stroke="none"></rect>
+              </svg>
+            </button>
+            <button id="bf-send-btn" type="submit" class="btn-send" aria-label="Send" title="Send" disabled>
+              <svg viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2.2" aria-hidden="true">
+                <path stroke-linecap="round" stroke-linejoin="round" d="M12 19V5m0 0l-6 6m6-6l6 6"></path>
+              </svg>
+            </button>
+          </div>
         </div>
       </form>
     </section>
 
-    <div id="bf-popover-backdrop" class="menu-backdrop hidden"></div>
+    <div id="bf-popover-backdrop" class="popover-backdrop hidden"></div>
 
-    <section id="bf-model-panel" class="popover-panel hidden">
+    <section id="bf-model-panel" class="popover-panel hidden" role="listbox" aria-label="Available models">
+      <p class="popover-label">Available Models</p>
       <ul id="bf-model-list" class="popover-list"></ul>
     </section>
 
-    <section id="bf-session-panel" class="popover-panel hidden">
+    <section id="bf-session-panel" class="popover-panel hidden" role="listbox" aria-label="Sessions">
+      <p class="popover-label">Sessions</p>
       <ul id="bf-switch-session-list" class="popover-list"></ul>
     </section>
   </main>
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 69bca4e..07681f1 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -17,13 +17,19 @@ const state = {
   eventLoopToken: 0,
   sessionSelectionToken: 0,
   popover: 'none',
+  status: {
+    kind: 'info',
+    text: 'Starting...',
+  },
 };
 
 const statusEl = document.getElementById('bf-agent-status');
 const statusIconEl = document.getElementById('bf-agent-status-icon');
 const statusTextEl = document.getElementById('bf-agent-status-text');
 const modelTriggerBtn = document.getElementById('bf-model-trigger');
+const modelLabelEl = document.getElementById('bf-model-label');
 const sessionTriggerBtn = document.getElementById('bf-session-trigger');
+const sessionLabelEl = document.getElementById('bf-session-label');
 const newSessionBtn = document.getElementById('bf-new-session');
 const popoverBackdropEl = document.getElementById('bf-popover-backdrop');
 const modelPanelEl = document.getElementById('bf-model-panel');
@@ -41,16 +47,67 @@ const attachCurrentTabBtn = document.getElementById('bf-attach-current-tab');
 let tabAttachRefreshTimer = null;
 let tabAttachRefreshToken = 0;
 
+function getActiveSession() {
+  return state.value.sessions.find((item) => item.sessionId === state.value.activeSessionId) || null;
+}
+
+function getActiveMessages() {
+  return state.value.messagesBySession[state.value.activeSessionId] || [];
+}
+
+function getActiveRun() {
+  const sessionId = state.value.activeSessionId;
+  if (!sessionId) return null;
+  const runId = getSessionRunId(state.currentRunBySession, sessionId);
+  if (!runId) return null;
+  return state.value.runs[runId] || null;
+}
+
+function isActiveRunInProgress() {
+  const run = getActiveRun();
+  return !!(run && !run.done);
+}
+
+function autoResizeInput() {
+  chatInputEl.style.height = 'auto';
+  chatInputEl.style.height = `${Math.min(chatInputEl.scrollHeight, 160)}px`;
+}
+
+function syncComposerState() {
+  const enabled = !chatInputEl.disabled;
+  const hasText = chatInputEl.value.trim().length > 0;
+  const runInProgress = isActiveRunInProgress();
+
+  stopRunBtn.disabled = !enabled || !runInProgress;
+  stopRunBtn.classList.toggle('active', enabled && runInProgress);
+  sendBtn.disabled = !enabled || runInProgress || !hasText;
+}
+
+function syncStatusIndicator() {
+  const runInProgress = isActiveRunInProgress();
+  const hasError = state.status.kind === 'error';
+  const text = hasError
+    ? state.status.text
+    : runInProgress
+      ? 'Thinking...'
+      : state.status.text;
+
+  statusEl.classList.toggle('error', hasError);
+  statusEl.classList.toggle('thinking', runInProgress && !hasError);
+  statusEl.title = text || 'Ready';
+  statusTextEl.textContent = text || '';
+  statusIconEl.textContent = '';
+}
+
 function setStatus(kind, text) {
-  statusTextEl.textContent = text;
-  statusEl.classList.toggle('error', kind === 'error');
-  statusIconEl.textContent = kind === 'error' ? '!' : '●';
+  state.status = { kind, text };
+  syncStatusIndicator();
 }
 
 function setComposerEnabled(enabled) {
   chatInputEl.disabled = !enabled;
-  stopRunBtn.disabled = !enabled;
-  sendBtn.disabled = !enabled;
+  autoResizeInput();
+  syncComposerState();
 }
 
 function setTabAttachBannerState({
@@ -83,22 +140,26 @@ function dispatchEvent(evt) {
   render();
 }
 
-function getActiveSession() {
-  return state.value.sessions.find((item) => item.sessionId === state.value.activeSessionId) || null;
-}
-
-function getActiveMessages() {
-  return state.value.messagesBySession[state.value.activeSessionId] || [];
-}
-
 function formatModelLabel(model) {
   return model && String(model).trim() ? model : 'Default';
 }
 
 function renderSelectors() {
   const activeSession = getActiveSession();
-  modelTriggerBtn.textContent = `Model: ${formatModelLabel(activeSession?.model)}`;
-  sessionTriggerBtn.textContent = activeSession?.title || 'Session';
+  const modelLabel = `Model: ${formatModelLabel(activeSession?.model)}`;
+  const sessionLabel = activeSession?.title || 'Session';
+
+  if (modelLabelEl) {
+    modelLabelEl.textContent = modelLabel;
+  } else {
+    modelTriggerBtn.textContent = modelLabel;
+  }
+
+  if (sessionLabelEl) {
+    sessionLabelEl.textContent = sessionLabel;
+  } else {
+    sessionTriggerBtn.textContent = sessionLabel;
+  }
 }
 
 function renderModelList() {
@@ -107,9 +168,9 @@ function renderModelList() {
 
   const rows = state.modelPresets.map((preset) => {
     const active = (preset.value || null) === activeModel ? 'active' : '';
-    return `<li><button type="button" data-model="${escapeHtml(preset.value || '')}" class="${active}">${escapeHtml(preset.label)}</button></li>`;
+    return `<li><button type="button" data-model="${escapeHtml(preset.value || '')}" class="popover-item ${active}"><span>${escapeHtml(preset.label)}</span></button></li>`;
   });
-  rows.push('<li><button type="button" data-model-custom="1">Custom...</button></li>');
+  rows.push('<li><button type="button" data-model-custom="1" class="popover-item custom-item"><span>Custom...</span></button></li>');
 
   modelListEl.innerHTML = rows.join('');
 
@@ -156,7 +217,7 @@ function renderSessions() {
       const active = session.sessionId === state.value.activeSessionId ? 'active' : '';
       const title = session.title || session.sessionId;
       const suffix = (titleCounts.get(title) || 0) > 1 ? ` · ${session.sessionId.slice(0, 8)}` : '';
-      return `<li><button type="button" data-session-id="${session.sessionId}" class="${active}">${escapeHtml(`${title}${suffix}`)}</button></li>`;
+      return `<li><button type="button" data-session-id="${session.sessionId}" class="popover-item ${active}"><span>${escapeHtml(`${title}${suffix}`)}</span></button></li>`;
     })
     .join('');
 
@@ -185,22 +246,31 @@ function renderRunSteps(runId, run) {
   if (!runId || !run || !Array.isArray(run.steps) || run.steps.length === 0) return '';
   const count = run.steps.length;
   const expanded = isRunStepsExpanded(runId);
-  const summary = `<button type="button" class="run-steps-trigger" data-run-steps-toggle="${escapeHtml(runId)}">${count} step${count === 1 ? '' : 's'}</button>`;
-  if (!expanded) {
-    return `<div class="run-steps-summary">${summary}</div>`;
-  }
 
   const items = run.steps
     .map((step) => {
-      const kind = step?.kind || 'reasoning';
       const status = step?.status || 'running';
       const label = step?.label || 'Step';
       const icon = classifyRunStepIcon(step);
-      return `<li class="run-step ${escapeHtml(kind)} ${escapeHtml(status)}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="run-step-label">${escapeHtml(label)}</span></li>`;
+      return `<li class="step-item ${escapeHtml(status)}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="step-label">${escapeHtml(label)}</span></li>`;
     })
     .join('');
 
-  return `<div class="run-steps-summary expanded">${summary}<ol class="run-steps-list">${items}</ol></div>`;
+  return `
+    <div class="run-steps-summary">
+      <button type="button" class="steps-toggle ${expanded ? 'open' : ''}" data-run-steps-toggle="${escapeHtml(runId)}">
+        <svg fill="none" viewBox="0 0 24 24" stroke="currentColor" stroke-width="2" aria-hidden="true">
+          <path stroke-linecap="round" stroke-linejoin="round" d="M9 5l7 7-7 7"></path>
+        </svg>
+        <strong>${count} step${count === 1 ? '' : 's'}</strong>
+      </button>
+      <ol class="steps-list ${expanded ? 'open' : ''}">${items}</ol>
+    </div>
+  `;
+}
+
+function renderContent(value) {
+  return escapeHtml(value).replace(/`([^`]+)`/g, '<code>$1</code>');
 }
 
 function bindTranscriptHandlers() {
@@ -220,21 +290,65 @@ function renderTranscript() {
   const chunks = messages.map((msg) => {
     const role = msg.role || 'assistant';
     if (role === 'user') {
-      return `<article class="message-row user"><div class="message user">${escapeHtml(msg.text || '')}</div></article>`;
+      return `
+        <article class="message user">
+          <div class="msg-meta"><span class="msg-author">You</span></div>
+          <div class="bubble-user">${escapeHtml(msg.text || '')}</div>
+        </article>
+      `;
     }
 
     const messageRun = msg.runId ? state.value.runs[msg.runId] : null;
-    return `<article class="message-row assistant">${renderRunSteps(msg.runId, messageRun)}<div class="message assistant">${escapeHtml(msg.text || '')}</div></article>`;
+    return `
+      <article class="message assistant">
+        <div class="msg-meta"><span class="msg-author">BrowserForce</span></div>
+        <div class="msg-content-wrap">
+          ${renderRunSteps(msg.runId, messageRun)}
+          <div class="bubble-assistant"><p>${renderContent(msg.text || '')}</p></div>
+        </div>
+      </article>
+    `;
   });
 
   if (run && !run.done) {
-    const liveText = run.text ? `<div class="message assistant">${escapeHtml(run.text || '')}</div>` : '';
-    chunks.push(`<article class="message-row assistant">${renderRunSteps(sessionRunId, run)}${liveText}</article>`);
+    if (run.text && run.text.trim()) {
+      chunks.push(`
+        <article class="message assistant">
+          <div class="msg-meta"><span class="msg-author">BrowserForce</span></div>
+          <div class="msg-content-wrap">
+            ${renderRunSteps(sessionRunId, run)}
+            <div class="bubble-assistant"><p>${renderContent(run.text)}</p></div>
+          </div>
+        </article>
+      `);
+    } else {
+      chunks.push(`
+        <article class="message assistant">
+          <div class="msg-meta"><span class="msg-author">BrowserForce</span></div>
+          <div class="thinking-bubble"><div class="spinner"></div><span>Thinking...</span></div>
+        </article>
+      `);
+    }
+  }
+
+  if (!chunks.length) {
+    transcriptEl.innerHTML = `
+      <div class="empty-state">
+        <div class="empty-icon">B</div>
+        <div>
+          <p class="empty-title">Start a conversation</p>
+          <p class="empty-sub">Ask BrowserForce to inspect your active tab or run a browser task.</p>
+        </div>
+      </div>
+    `;
+  } else {
+    transcriptEl.innerHTML = chunks.join('');
   }
 
-  transcriptEl.innerHTML = chunks.join('') || '<article class="message-row assistant"><div class="message assistant">No messages yet.</div></article>';
   bindTranscriptHandlers();
   transcriptEl.scrollTop = transcriptEl.scrollHeight;
+  syncStatusIndicator();
+  syncComposerState();
 }
 
 function setPopover(popover) {
@@ -648,6 +762,7 @@ async function sendMessage(text) {
   if (body.runId) {
     state.currentRunBySession = assignSessionRunId(state.currentRunBySession, sessionId, body.runId);
   }
+  render();
 }
 
 async function stopRun() {
@@ -666,8 +781,11 @@ chatFormEl.addEventListener('submit', async (event) => {
   try {
     await sendMessage(text);
     chatInputEl.value = '';
+    autoResizeInput();
+    syncComposerState();
   } catch (error) {
     chatInputEl.value = text;
+    syncComposerState();
     setStatus('error', error?.message || 'Failed to send message');
   }
 });
@@ -676,6 +794,7 @@ chatInputEl.addEventListener('keydown', (event) => {
   if (event.key !== 'Enter' || event.shiftKey) return;
   if (event.isComposing) return;
   event.preventDefault();
+  if (sendBtn.disabled) return;
   chatFormEl.requestSubmit();
 });
 
@@ -695,6 +814,11 @@ if (attachCurrentTabBtn) {
   });
 }
 
+chatInputEl.addEventListener('input', () => {
+  autoResizeInput();
+  syncComposerState();
+});
+
 newSessionBtn.addEventListener('click', () => {
   createSession()
     .then(() => setPopover('none'))
@@ -719,6 +843,7 @@ popoverBackdropEl.addEventListener('click', () => {
 
 (async function init() {
   try {
+    setComposerEnabled(false);
     setStatus('info', 'Connecting...');
     await loadAuth();
     await ensureCurrentTabAttached();
@@ -737,6 +862,7 @@ popoverBackdropEl.addEventListener('click', () => {
     setComposerEnabled(true);
     scheduleTabAttachRefresh(0);
     setStatus('ready', 'Ready');
+    render();
   } catch {
     setComposerEnabled(false);
     setTabAttachBannerState({ hidden: true });

From d7fc71b308f199dda82de6bd1f24080b13d8061b Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 15:25:34 +0530
Subject: [PATCH 123/192] Improve sidepanel session UX and prevent message
 overflow

---
 extension/agent-panel.css                    | 149 +++++++++++----
 extension/agent-panel.html                   |   7 -
 extension/agent-panel.js                     | 189 ++++++++++++++++++-
 test/agent/agent-panel-contract.test.js      |  10 +
 test/agent/agent-panel-send-contract.test.js |  23 +++
 5 files changed, 326 insertions(+), 52 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index fac41d9..ffac37e 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -48,45 +48,11 @@ body {
   border-bottom: 1px solid var(--header-border);
 }
 
-.title-bar {
-  display: flex;
-  align-items: center;
-  justify-content: space-between;
-  padding: 12px 14px 10px;
-}
-
-.brand {
-  display: flex;
-  align-items: center;
-  gap: 8px;
-}
-
-.brand-icon {
-  width: 24px;
-  height: 24px;
-  border-radius: 6px;
-  background: var(--crail);
-  display: flex;
-  align-items: center;
-  justify-content: center;
-  color: #fff;
-  font-weight: 700;
-  font-size: 12px;
-  flex-shrink: 0;
-}
-
-.brand-name {
-  font-size: 13.5px;
-  font-weight: 600;
-  color: #fff;
-  letter-spacing: -0.02em;
-}
-
 .controls {
   display: flex;
   align-items: center;
   gap: 8px;
-  padding: 0 14px 12px;
+  padding: 12px 14px;
 }
 
 .pill-btn {
@@ -219,6 +185,7 @@ body {
   flex: 1;
   min-height: 0;
   overflow-y: auto;
+  overflow-x: hidden;
   padding: 16px;
   display: flex;
   flex-direction: column;
@@ -302,6 +269,7 @@ body {
 
 .msg-content-wrap {
   max-width: 96%;
+  min-width: 0;
 }
 
 .bubble-user {
@@ -315,6 +283,8 @@ body {
   max-width: 82%;
   box-shadow: 0 2px 8px rgba(193, 95, 60, 0.25);
   white-space: pre-wrap;
+  overflow-wrap: anywhere;
+  word-break: break-word;
 }
 
 .bubble-assistant {
@@ -324,6 +294,8 @@ body {
   border-top-left-radius: 4px;
   padding: 12px 16px;
   box-shadow: 0 1px 4px rgba(0, 0, 0, 0.06);
+  max-width: 100%;
+  overflow-wrap: anywhere;
 }
 
 .bubble-assistant p {
@@ -331,6 +303,8 @@ body {
   line-height: 1.6;
   color: var(--text);
   white-space: pre-wrap;
+  overflow-wrap: anywhere;
+  word-break: break-word;
 }
 
 .bubble-assistant code {
@@ -341,6 +315,9 @@ body {
   padding: 2px 6px;
   border-radius: 5px;
   border: 1px solid var(--line);
+  white-space: pre-wrap;
+  overflow-wrap: anywhere;
+  word-break: break-word;
 }
 
 .run-steps-summary {
@@ -755,6 +732,108 @@ body {
   color: rgba(255, 255, 255, 0.45);
 }
 
+.session-row {
+  display: flex;
+  align-items: center;
+  gap: 6px;
+  margin: 1px 0;
+}
+
+.session-item {
+  flex: 1;
+  min-width: 0;
+  display: flex;
+  flex-direction: column;
+  align-items: flex-start;
+  gap: 2px;
+}
+
+.session-item.active::after {
+  content: none;
+}
+
+.session-main {
+  width: 100%;
+  overflow: hidden;
+  text-overflow: ellipsis;
+  white-space: nowrap;
+}
+
+.session-meta {
+  font-size: 10px;
+  color: rgba(255, 255, 255, 0.52);
+}
+
+.session-edit-btn {
+  width: 28px;
+  height: 28px;
+  flex-shrink: 0;
+  border-radius: 8px;
+  border: 0;
+  background: transparent;
+  color: rgba(255, 255, 255, 0.6);
+  display: flex;
+  align-items: center;
+  justify-content: center;
+  cursor: pointer;
+  transition: background 0.12s, color 0.12s;
+}
+
+.session-edit-btn:hover {
+  background: rgba(255, 255, 255, 0.1);
+  color: #fff;
+}
+
+.session-edit-btn svg {
+  width: 13px;
+  height: 13px;
+}
+
+.session-edit-form {
+  width: 100%;
+  display: flex;
+  align-items: center;
+  gap: 6px;
+  padding: 6px 6px;
+}
+
+.session-edit-form input {
+  flex: 1;
+  min-width: 0;
+  height: 30px;
+  border-radius: 8px;
+  border: 1px solid rgba(255, 255, 255, 0.24);
+  background: rgba(255, 255, 255, 0.08);
+  color: #fff;
+  padding: 0 10px;
+  font-size: 12px;
+  outline: none;
+}
+
+.session-edit-form input::placeholder {
+  color: rgba(255, 255, 255, 0.45);
+}
+
+.session-edit-save,
+.session-edit-cancel {
+  height: 30px;
+  border-radius: 8px;
+  border: 0;
+  padding: 0 10px;
+  font-size: 11px;
+  cursor: pointer;
+}
+
+.session-edit-save {
+  background: var(--crail);
+  color: #fff;
+}
+
+.session-edit-cancel {
+  background: rgba(255, 255, 255, 0.14);
+  color: rgba(255, 255, 255, 0.86);
+}
+
 .empty-item {
   color: rgba(255, 255, 255, 0.75);
   padding: 10px 12px;
diff --git a/extension/agent-panel.html b/extension/agent-panel.html
index 55dbcf8..9e20c0f 100644
--- a/extension/agent-panel.html
+++ b/extension/agent-panel.html
@@ -9,13 +9,6 @@
 <body>
   <main class="agent-shell">
     <header class="agent-header">
-      <div class="title-bar">
-        <div class="brand">
-          <div class="brand-icon" aria-hidden="true">B</div>
-          <span class="brand-name">BrowserForce</span>
-        </div>
-      </div>
-
       <div class="controls">
         <button id="bf-model-trigger" type="button" class="pill-btn" aria-expanded="false" aria-haspopup="listbox">
           <span id="bf-model-label">Model: Default</span>
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 07681f1..55c6d76 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -12,6 +12,8 @@ const state = {
   auth: null,
   modelPresets: [{ value: null, label: 'Default' }],
   currentRunBySession: {},
+  editingSessionId: null,
+  sessionTitleDrafts: {},
   expandedRunSteps: {},
   eventController: null,
   eventLoopToken: 0,
@@ -144,10 +146,48 @@ function formatModelLabel(model) {
   return model && String(model).trim() ? model : 'Default';
 }
 
+function isDefaultSessionTitle(title) {
+  const lowered = String(title || '').trim().toLowerCase();
+  return !lowered || lowered === 'new session' || lowered === 'new chat';
+}
+
+function formatShortSessionId(sessionId) {
+  const raw = String(sessionId || '').trim();
+  if (!raw) return 'unknown';
+  return raw.slice(0, 8);
+}
+
+function formatSessionDisplayName(session) {
+  if (!session) return 'Session';
+  const title = String(session.title || '').trim();
+  if (!isDefaultSessionTitle(title)) return title;
+  return session.sessionId || 'Session';
+}
+
+function formatSessionLabel(session) {
+  if (!session) return 'Session';
+  const title = String(session.title || '').trim();
+  if (!isDefaultSessionTitle(title)) return title;
+  return formatShortSessionId(session.sessionId);
+}
+
+function formatSessionTimestamp(session) {
+  const raw = session?.updatedAt || session?.createdAt;
+  if (!raw) return 'Unknown time';
+  const date = new Date(raw);
+  if (Number.isNaN(date.getTime())) return 'Unknown time';
+  return date.toLocaleString(undefined, {
+    month: 'short',
+    day: 'numeric',
+    hour: 'numeric',
+    minute: '2-digit',
+  });
+}
+
 function renderSelectors() {
   const activeSession = getActiveSession();
   const modelLabel = `Model: ${formatModelLabel(activeSession?.model)}`;
-  const sessionLabel = activeSession?.title || 'Session';
+  const sessionLabel = formatSessionLabel(activeSession);
 
   if (modelLabelEl) {
     modelLabelEl.textContent = modelLabel;
@@ -206,18 +246,49 @@ function renderSessions() {
     return;
   }
 
-  const titleCounts = new Map();
-  for (const session of sessions) {
-    const title = (session.title || '').trim() || session.sessionId;
-    titleCounts.set(title, (titleCounts.get(title) || 0) + 1);
-  }
-
   switchSessionListEl.innerHTML = sessions
     .map((session) => {
       const active = session.sessionId === state.value.activeSessionId ? 'active' : '';
-      const title = session.title || session.sessionId;
-      const suffix = (titleCounts.get(title) || 0) > 1 ? ` · ${session.sessionId.slice(0, 8)}` : '';
-      return `<li><button type="button" data-session-id="${session.sessionId}" class="popover-item ${active}"><span>${escapeHtml(`${title}${suffix}`)}</span></button></li>`;
+      const displayName = formatSessionDisplayName(session);
+      const timestamp = formatSessionTimestamp(session);
+      const shortId = formatShortSessionId(session.sessionId);
+      const editing = session.sessionId === state.editingSessionId;
+      const draftTitle = Object.prototype.hasOwnProperty.call(state.sessionTitleDrafts, session.sessionId)
+        ? state.sessionTitleDrafts[session.sessionId]
+        : (isDefaultSessionTitle(session.title) ? '' : String(session.title || '').trim());
+
+      if (editing) {
+        return `
+          <li class="session-row editing">
+            <form class="session-edit-form" data-session-edit-form="${escapeHtml(session.sessionId)}">
+              <input
+                type="text"
+                data-session-edit-input="${escapeHtml(session.sessionId)}"
+                value="${escapeHtml(draftTitle)}"
+                placeholder="Session name"
+                maxlength="180"
+              >
+              <button type="submit" class="session-edit-save">Save</button>
+              <button type="button" class="session-edit-cancel" data-session-edit-cancel="${escapeHtml(session.sessionId)}">Cancel</button>
+            </form>
+          </li>
+        `;
+      }
+
+      return `
+        <li class="session-row">
+          <button type="button" data-session-id="${escapeHtml(session.sessionId)}" class="popover-item session-item ${active}">
+            <span class="session-main">${escapeHtml(displayName)}</span>
+            <span class="session-meta">${escapeHtml(`${shortId} · ${timestamp}`)}</span>
+          </button>
+          <button type="button" class="session-edit-btn" data-session-edit-btn="${escapeHtml(session.sessionId)}" aria-label="Rename session" title="Rename session">
+            <svg viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" aria-hidden="true">
+              <path stroke-linecap="round" stroke-linejoin="round" d="M12 20h9"></path>
+              <path stroke-linecap="round" stroke-linejoin="round" d="M16.5 3.5a2.121 2.121 0 113 3L7 19l-4 1 1-4 12.5-12.5z"></path>
+            </svg>
+          </button>
+        </li>
+      `;
     })
     .join('');
 
@@ -227,6 +298,48 @@ function renderSessions() {
       setPopover('none');
     });
   });
+
+  switchSessionListEl.querySelectorAll('button[data-session-edit-btn]').forEach((button) => {
+    button.addEventListener('click', () => {
+      beginSessionEdit(button.getAttribute('data-session-edit-btn') || '');
+    });
+  });
+
+  switchSessionListEl.querySelectorAll('form[data-session-edit-form]').forEach((form) => {
+    form.addEventListener('submit', async (event) => {
+      event.preventDefault();
+      const sessionId = form.getAttribute('data-session-edit-form') || '';
+      const input = form.querySelector('input[data-session-edit-input]');
+      const title = input?.value || '';
+      try {
+        await updateSessionTitle(sessionId, title);
+      } catch (error) {
+        setStatus('error', error?.message || 'Unable to rename session');
+      }
+    });
+  });
+
+  switchSessionListEl.querySelectorAll('button[data-session-edit-cancel]').forEach((button) => {
+    button.addEventListener('click', () => {
+      cancelSessionEdit(button.getAttribute('data-session-edit-cancel') || '');
+    });
+  });
+
+  switchSessionListEl.querySelectorAll('input[data-session-edit-input]').forEach((input) => {
+    input.addEventListener('input', () => {
+      const sessionId = input.getAttribute('data-session-edit-input') || '';
+      state.sessionTitleDrafts = {
+        ...(state.sessionTitleDrafts || {}),
+        [sessionId]: input.value,
+      };
+    });
+    input.addEventListener('keydown', (event) => {
+      if (event.key !== 'Escape') return;
+      event.preventDefault();
+      const sessionId = input.getAttribute('data-session-edit-input') || '';
+      cancelSessionEdit(sessionId);
+    });
+  });
 }
 
 function isRunStepsExpanded(runId) {
@@ -655,6 +768,62 @@ async function createSession() {
   await selectSession(created.sessionId);
 }
 
+function beginSessionEdit(sessionId) {
+  if (!sessionId) return;
+  const session = state.value.sessions.find((item) => item.sessionId === sessionId);
+  if (!session) return;
+
+  const current = isDefaultSessionTitle(session.title) ? '' : String(session.title || '').trim();
+  state.editingSessionId = sessionId;
+  state.sessionTitleDrafts = {
+    ...(state.sessionTitleDrafts || {}),
+    [sessionId]: current,
+  };
+  renderSessions();
+
+  window.requestAnimationFrame(() => {
+    const input = switchSessionListEl.querySelector(`input[data-session-edit-input="${sessionId}"]`);
+    if (!input) return;
+    input.focus();
+    input.select();
+  });
+}
+
+function cancelSessionEdit(sessionId) {
+  if (!sessionId) return;
+  state.editingSessionId = null;
+  const nextDrafts = { ...(state.sessionTitleDrafts || {}) };
+  delete nextDrafts[sessionId];
+  state.sessionTitleDrafts = nextDrafts;
+  renderSessions();
+}
+
+async function updateSessionTitle(sessionId, rawTitle) {
+  const title = String(rawTitle || '').trim();
+  if (!sessionId) return;
+  if (!title) {
+    throw new Error('Session name cannot be empty');
+  }
+
+  const res = await api(`/v1/sessions/${encodeURIComponent(sessionId)}`, {
+    method: 'PATCH',
+    body: JSON.stringify({ title: title }),
+  });
+  if (!res.ok) {
+    const body = await res.json().catch(() => ({}));
+    throw new Error(body.error || 'Unable to rename session');
+  }
+
+  state.editingSessionId = null;
+  const nextDrafts = { ...(state.sessionTitleDrafts || {}) };
+  delete nextDrafts[sessionId];
+  state.sessionTitleDrafts = nextDrafts;
+
+  const activeSessionId = state.value.activeSessionId || sessionId;
+  await loadSessions(activeSessionId);
+  setStatus('ready', 'Ready');
+}
+
 async function updateActiveSessionModel(model) {
   const sessionId = state.value.activeSessionId;
   if (!sessionId) return;
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index 9b63e6a..cfadc94 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -3,6 +3,7 @@ import test from 'node:test';
 import assert from 'node:assert/strict';
 
 const html = fs.readFileSync('extension/agent-panel.html', 'utf8');
+const css = fs.readFileSync('extension/agent-panel.css', 'utf8');
 
 test('agent panel has inline model and session selectors with popovers', () => {
   assert.match(html, /id="bf-model-trigger"/);
@@ -22,3 +23,12 @@ test('agent panel no longer renders title or persistent session sidebar', () =>
   assert.doesNotMatch(html, /<h1/);
   assert.doesNotMatch(html, /<aside class="sessions">/);
 });
+
+test('agent panel does not render a BrowserForce heading bar', () => {
+  assert.doesNotMatch(html, /class="brand-name"/);
+});
+
+test('agent panel keeps horizontal overflow contained in transcript cards', () => {
+  assert.match(css, /\.transcript[\s\S]*overflow-x:\s*hidden/);
+  assert.match(css, /\.bubble-assistant code[\s\S]*overflow-wrap:\s*anywhere/);
+});
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 7a0d415..d108862 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -33,3 +33,26 @@ test('enter key submits composer and shift+enter keeps newline', () => {
   assert.match(js, /event\.preventDefault\(\);/);
   assert.match(js, /chatFormEl\.requestSubmit\(\);/);
 });
+
+test('session labels fall back to session id when title is default', () => {
+  assert.match(js, /function isDefaultSessionTitle\(title\)/);
+  assert.match(js, /new session/);
+  assert.match(js, /new chat/);
+  assert.match(js, /function formatSessionDisplayName\(session\)/);
+  assert.match(js, /session\.sessionId/);
+});
+
+test('session popover supports inline rename and saves via session patch endpoint', () => {
+  assert.match(js, /data-session-edit-btn/);
+  assert.match(js, /data-session-edit-form/);
+  assert.match(js, /async function updateSessionTitle/);
+  assert.match(js, /\/v1\/sessions\/\$\{encodeURIComponent\(sessionId\)\}/);
+  assert.match(js, /method:\s*'PATCH'/);
+  assert.match(js, /JSON\.stringify\(\{\s*title\s*:\s*title/);
+});
+
+test('session popover renders per-session timestamp metadata', () => {
+  assert.match(js, /function formatSessionTimestamp/);
+  assert.match(js, /updatedAt|createdAt/);
+  assert.match(js, /toLocaleString/);
+});

From e9998340810265065fdb00f919eb0a0e03851afd Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 15:43:49 +0530
Subject: [PATCH 124/192] Persist run steps for session transcript rehydration

---
 agent/src/chatd.js                  | 167 +++++++++++++++++++++++++++-
 agent/src/session-store.js          |  42 ++++++-
 extension/agent-panel-state.js      |  99 ++++++++++++++++-
 test/agent/chatd-api.test.js        |  66 +++++++++++
 test/agent/session-store.test.js    |  16 +++
 test/agent/session-ui-state.test.js |  25 +++++
 6 files changed, 411 insertions(+), 4 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index cd1164f..80a3838 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -248,6 +248,156 @@ function normalizeBrowserContext(raw) {
   return { tabId, title, url };
 }
 
+function firstString(values) {
+  for (const value of values) {
+    if (typeof value === 'string' && value.trim()) return value.trim();
+  }
+  return '';
+}
+
+function trimStepLabel(label) {
+  const text = String(label || '').trim();
+  if (!text) return '';
+  return text.length > 160 ? `${text.slice(0, 157)}...` : text;
+}
+
+function pushRunStep(run, step) {
+  if (!run) return;
+  const steps = Array.isArray(run.steps) ? run.steps : [];
+  const normalized = {
+    kind: String(step?.kind || '').trim() || 'reasoning',
+    status: String(step?.status || '').trim() || 'running',
+    label: trimStepLabel(step?.label),
+  };
+  if (!normalized.label) return;
+  const last = steps[steps.length - 1];
+  if (last && last.label === normalized.label && last.kind === normalized.kind && last.status === normalized.status) {
+    return;
+  }
+  steps.push(normalized);
+  if (steps.length > 100) steps.shift();
+  run.steps = steps;
+}
+
+function stepLabelForToolEvent(evt) {
+  const payload = evt?.payload || {};
+  if (evt.event === 'tool.started') {
+    return firstString([
+      payload.title,
+      payload.name,
+      payload.tool,
+      payload.toolName,
+      payload.command,
+    ]) || 'Tool call started';
+  }
+  if (evt.event === 'tool.final') {
+    return firstString([
+      payload.title,
+      payload.name,
+      payload.tool,
+      payload.toolName,
+      payload.command,
+    ]) || 'Tool call completed';
+  }
+  if (evt.event === 'tool.delta') {
+    return firstString([
+      payload.text,
+      payload.message,
+      payload.delta,
+      payload.command,
+      payload.name,
+      payload.tool,
+      payload.toolName,
+      payload.type === 'reasoning' ? 'Reasoning' : '',
+    ]) || 'Working...';
+  }
+  return '';
+}
+
+function humanizeToken(value) {
+  const normalized = String(value || '')
+    .trim()
+    .replace(/[_./-]+/g, ' ')
+    .replace(/\s+/g, ' ');
+  if (!normalized) return '';
+  return normalized.charAt(0).toUpperCase() + normalized.slice(1);
+}
+
+function stepStatusForRunEvent(evt) {
+  const payload = evt?.payload || {};
+  const type = String(payload.type || '').toLowerCase();
+  if (/error|failed|aborted/.test(type)) return 'failed';
+  if (/completed|final|done|finished|succeeded|success|end/.test(type)) return 'done';
+  return 'running';
+}
+
+function stepKindForRunEvent(evt) {
+  const payload = evt?.payload || {};
+  const itemType = String(payload?.item?.type || '').toLowerCase();
+  const eventType = String(payload?.type || '').toLowerCase();
+  if (/reason/.test(itemType) || /reason/.test(eventType)) return 'reasoning';
+  return 'tool';
+}
+
+function stepLabelForRunEvent(evt) {
+  const payload = evt?.payload || {};
+  const item = payload?.item && typeof payload.item === 'object' ? payload.item : {};
+  return firstString([
+    payload.title,
+    payload.message,
+    payload.text,
+    payload.status,
+    item.summary,
+    item.text,
+    item.message,
+    item.title,
+    item.name,
+    item.tool,
+    item.command,
+    item.type ? humanizeToken(item.type) : '',
+    payload.type ? humanizeToken(payload.type) : '',
+  ]) || 'Working...';
+}
+
+function trackRunStep(run, evt) {
+  if (!run || !evt?.event) return;
+
+  if (evt.event === 'tool.started' || evt.event === 'tool.delta' || evt.event === 'tool.final') {
+    pushRunStep(run, {
+      kind: evt.event === 'tool.delta' ? 'reasoning' : 'tool',
+      status: evt.event === 'tool.final' ? 'done' : 'running',
+      label: stepLabelForToolEvent(evt),
+    });
+    return;
+  }
+
+  if (evt.event === 'run.event') {
+    pushRunStep(run, {
+      kind: stepKindForRunEvent(evt),
+      status: stepStatusForRunEvent(evt),
+      label: stepLabelForRunEvent(evt),
+    });
+    return;
+  }
+
+  if (evt.event === 'run.error') {
+    pushRunStep(run, {
+      kind: 'status',
+      status: 'failed',
+      label: `Failed: ${evt.payload?.error || 'Unknown error'}`,
+    });
+    return;
+  }
+
+  if (evt.event === 'run.aborted') {
+    pushRunStep(run, {
+      kind: 'status',
+      status: 'aborted',
+      label: 'Stopped',
+    });
+  }
+}
+
 function buildRunPrompt({ message, browserContext }) {
   if (!browserContext) return message;
 
@@ -260,7 +410,10 @@ function buildRunPrompt({ message, browserContext }) {
   lines.push('Inspect the active page and answer directly when the user asks about what is on this tab.');
   lines.push('Do not ask for permission to inspect the active page.');
   lines.push('Assume the user is referring to this active tab unless they explicitly say otherwise.');
-  lines.push('If the request is ambiguous or you are not sure, ask the user a clarifying question before acting.');
+  lines.push('When the user asks what you can see, asks about this page/tab, or requests a summary of the current page, inspect the active page and answer directly.');
+  lines.push('Use BrowserForce browser tools to read the current page content before replying in these cases.');
+  lines.push('Do not ask for permission to inspect, and do not say you only have tab metadata.');
+  lines.push('If the request is still ambiguous after inspecting, ask one focused clarifying question.');
   lines.push('');
   lines.push(`User request: ${message}`);
   return lines.join('\n');
@@ -358,7 +511,14 @@ export async function startChatd(opts = {}) {
     if (!run || run.status !== 'running' || run.finalSent) return;
     run.finalSent = true;
     run.status = 'done';
-    await appendMessage({ sessionId: run.sessionId, role: 'assistant', text: finalText, storageRoot });
+    await appendMessage({
+      sessionId: run.sessionId,
+      role: 'assistant',
+      text: finalText,
+      runId: run.runId,
+      steps: run.steps,
+      storageRoot,
+    });
     broadcast(buildEvent({ event: 'chat.final', runId: run.runId, sessionId: run.sessionId, payload: { text: finalText } }));
     runs.delete(run.runId);
   }
@@ -555,6 +715,7 @@ export async function startChatd(opts = {}) {
           status: 'running',
           abort: null,
           assistantBuffer: '',
+          steps: [],
           finalSent: false,
           queue: Promise.resolve(),
         };
@@ -593,6 +754,7 @@ export async function startChatd(opts = {}) {
                 }
 
                 if (evt.event === 'run.error') {
+                  trackRunStep(active, evt);
                   failRun(active, evt.payload?.error || 'Run failed');
                   return;
                 }
@@ -601,6 +763,7 @@ export async function startChatd(opts = {}) {
                   return;
                 }
 
+                trackRunStep(active, evt);
                 broadcast(buildEvent({ event: evt.event, runId, sessionId, payload: evt.payload }));
               });
             },
diff --git a/agent/src/session-store.js b/agent/src/session-store.js
index 8713a93..c061015 100644
--- a/agent/src/session-store.js
+++ b/agent/src/session-store.js
@@ -6,6 +6,7 @@ import { randomUUID } from 'node:crypto';
 const DEFAULT_STORAGE_ROOT = join(homedir(), '.browserforce', 'agent', 'sessions');
 const INDEX_FILE = 'index.json';
 const SESSION_ID_RE = /^[A-Za-z0-9_-]{1,128}$/;
+const RUN_ID_RE = /^[A-Za-z0-9_-]{1,256}$/;
 const MODEL_ID_RE = /^[A-Za-z0-9._:/-]{1,128}$/;
 const indexWriteQueues = new Map();
 
@@ -35,6 +36,37 @@ function assertValidSessionId(sessionId, fnName) {
   }
 }
 
+function normalizeRunId(runId) {
+  if (runId == null) return null;
+  const normalized = String(runId).trim();
+  if (!normalized) return null;
+  if (!RUN_ID_RE.test(normalized)) {
+    throw new Error('appendMessage requires a safe runId');
+  }
+  return normalized;
+}
+
+function normalizeStep(step) {
+  if (!step || typeof step !== 'object') return null;
+  const label = String(step.label || '').trim();
+  if (!label) return null;
+  const kind = String(step.kind || '').trim() || 'reasoning';
+  const status = String(step.status || '').trim() || 'running';
+  return {
+    kind,
+    status,
+    label: label.length > 160 ? `${label.slice(0, 157)}...` : label,
+  };
+}
+
+function normalizeSteps(steps) {
+  if (!Array.isArray(steps)) return [];
+  return steps
+    .map(normalizeStep)
+    .filter(Boolean)
+    .slice(-100);
+}
+
 async function ensureStorageRoot(storageRoot) {
   await fs.mkdir(storageRoot, { recursive: true });
 }
@@ -169,7 +201,7 @@ export async function updateSession({ sessionId, patch = {}, storageRoot } = {})
   });
 }
 
-export async function appendMessage({ sessionId, role, text, storageRoot } = {}) {
+export async function appendMessage({ sessionId, role, text, runId, steps, storageRoot } = {}) {
   assertValidSessionId(sessionId, 'appendMessage');
   if (!role) throw new Error('appendMessage requires role');
   if (typeof text !== 'string') throw new Error('appendMessage requires text');
@@ -185,6 +217,14 @@ export async function appendMessage({ sessionId, role, text, storageRoot } = {})
     text,
     createdAt: now,
   };
+  const safeRunId = normalizeRunId(runId);
+  if (safeRunId) {
+    entry.runId = safeRunId;
+  }
+  const normalizedSteps = normalizeSteps(steps);
+  if (normalizedSteps.length > 0) {
+    entry.steps = normalizedSteps;
+  }
 
   const logPath = messageLogPath(root, sessionId);
   await fs.appendFile(logPath, `${JSON.stringify(entry)}\n`, 'utf8');
diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index eab2998..9b665a8 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -70,6 +70,51 @@ function stepLabelForToolEvent(evt) {
   return '';
 }
 
+function humanizeToken(value) {
+  const normalized = String(value || '')
+    .trim()
+    .replace(/[_./-]+/g, ' ')
+    .replace(/\s+/g, ' ');
+  if (!normalized) return '';
+  return normalized.charAt(0).toUpperCase() + normalized.slice(1);
+}
+
+function stepStatusForRunEvent(evt) {
+  const payload = evt?.payload || {};
+  const type = String(payload.type || '').toLowerCase();
+  if (/error|failed|aborted/.test(type)) return 'failed';
+  if (/completed|final|done|finished|succeeded|success|end/.test(type)) return 'done';
+  return 'running';
+}
+
+function stepKindForRunEvent(evt) {
+  const payload = evt?.payload || {};
+  const itemType = String(payload?.item?.type || '').toLowerCase();
+  const eventType = String(payload?.type || '').toLowerCase();
+  if (/reason/.test(itemType) || /reason/.test(eventType)) return 'reasoning';
+  return 'tool';
+}
+
+function stepLabelForRunEvent(evt) {
+  const payload = evt?.payload || {};
+  const item = payload?.item && typeof payload.item === 'object' ? payload.item : {};
+  return firstString([
+    payload.title,
+    payload.message,
+    payload.text,
+    payload.status,
+    item.summary,
+    item.text,
+    item.message,
+    item.title,
+    item.name,
+    item.tool,
+    item.command,
+    item.type ? humanizeToken(item.type) : '',
+    payload.type ? humanizeToken(payload.type) : '',
+  ]) || 'Working...';
+}
+
 function upsertRun(state, runId, patch) {
   return {
     ...state.runs,
@@ -80,6 +125,37 @@ function upsertRun(state, runId, patch) {
   };
 }
 
+function normalizeStoredStep(step) {
+  if (!step || typeof step !== 'object') return null;
+  const label = trimStepLabel(step.label);
+  if (!label) return null;
+  return {
+    kind: step.kind || 'reasoning',
+    status: step.status || 'running',
+    label,
+  };
+}
+
+function hydrateRunsFromMessages(messages, sessionId, currentRuns) {
+  const hydrated = {};
+  for (const message of messages) {
+    const runId = typeof message?.runId === 'string' ? message.runId.trim() : '';
+    if (!runId) continue;
+    const steps = Array.isArray(message?.steps)
+      ? message.steps.map(normalizeStoredStep).filter(Boolean)
+      : [];
+    hydrated[runId] = {
+      ...(currentRuns?.[runId] || { runId, text: '', done: false, steps: [] }),
+      runId,
+      sessionId,
+      text: typeof message?.text === 'string' ? message.text : (currentRuns?.[runId]?.text || ''),
+      done: true,
+      steps: steps.length > 0 ? steps : (currentRuns?.[runId]?.steps || []),
+    };
+  }
+  return hydrated;
+}
+
 export function reduceState(state = initialState, action = {}) {
   if (action.type === 'session.list.loaded') {
     const sessions = Array.isArray(action.sessions) ? action.sessions : [];
@@ -102,11 +178,17 @@ export function reduceState(state = initialState, action = {}) {
   }
 
   if (action.type === 'messages.loaded') {
+    const messages = Array.isArray(action.messages) ? action.messages : [];
+    const hydratedRuns = hydrateRunsFromMessages(messages, action.sessionId, state.runs);
     return {
       ...state,
       messagesBySession: {
         ...state.messagesBySession,
-        [action.sessionId]: Array.isArray(action.messages) ? action.messages : [],
+        [action.sessionId]: messages,
+      },
+      runs: {
+        ...state.runs,
+        ...hydratedRuns,
       },
     };
   }
@@ -220,5 +302,20 @@ export function applyEvent(state = initialState, evt = {}) {
     };
   }
 
+  if (evt.event === 'run.event') {
+    const run = state.runs[evt.runId] || { text: '', done: false, steps: [] };
+    const status = stepStatusForRunEvent(evt);
+    const kind = stepKindForRunEvent(evt);
+    const label = stepLabelForRunEvent(evt);
+    return {
+      ...state,
+      runs: upsertRun(state, evt.runId, {
+        sessionId: evt.sessionId,
+        done: false,
+        steps: pushStep(run, { kind, status, label }),
+      }),
+    };
+  }
+
   return state;
 }
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index 7237f2a..9d07b96 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -211,6 +211,72 @@ test('POST /v1/runs uses injected run executor and persists assistant output', a
   }
 });
 
+test('POST /v1/runs persists run steps so reopened sessions can render them', async () => {
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    runExecutor: ({ runId, sessionId, onEvent, onExit }) => {
+      setTimeout(() => {
+        onEvent({ event: 'tool.started', runId, sessionId, payload: { tool: 'snapshot' } });
+      }, 5);
+      setTimeout(() => {
+        onEvent({
+          event: 'tool.delta',
+          runId,
+          sessionId,
+          payload: { type: 'reasoning', text: 'Inspecting active tab' },
+        });
+      }, 10);
+      setTimeout(() => {
+        onEvent({ event: 'tool.final', runId, sessionId, payload: { tool: 'snapshot' } });
+      }, 15);
+      setTimeout(() => {
+        onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'done' } });
+      }, 20);
+      setTimeout(() => onExit({ code: 0 }), 25);
+      return { abort() {} };
+    },
+  });
+
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'Steps' }),
+    }).then((res) => res.json());
+
+    const runRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'hi' }),
+    });
+    assert.equal(runRes.status, 202);
+    const runBody = await runRes.json();
+
+    await new Promise((resolve) => setTimeout(resolve, 80));
+
+    const messagesBody = await fetch(
+      `${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}/messages`,
+      { headers: { authorization: `Bearer ${daemon.token}` } },
+    ).then((res) => res.json());
+    const assistant = (messagesBody.messages || []).at(-1);
+
+    assert.equal(assistant?.role, 'assistant');
+    assert.equal(assistant?.runId, runBody.runId);
+    assert.equal(Array.isArray(assistant?.steps), true);
+    assert.equal(assistant.steps.length >= 1, true);
+    assert.equal(assistant.steps.some((step) => /Inspecting active tab/.test(step?.label || '')), true);
+  } finally {
+    await daemon.stop();
+  }
+});
+
 test('runExecutor synchronous failure does not leak abortable run', async () => {
   const storageRoot = mkdtempSync(join(tmpdir(), 'bf-chatd-run-fail-'));
   let attemptedRunId = null;
diff --git a/test/agent/session-store.test.js b/test/agent/session-store.test.js
index 1816e8d..f2cae2b 100644
--- a/test/agent/session-store.test.js
+++ b/test/agent/session-store.test.js
@@ -34,6 +34,22 @@ test('messages are stored and loaded by sessionId', async () => {
   assert.equal(rows.at(-1).text, 'hello');
 });
 
+test('messages preserve optional run metadata used for transcript rehydration', async () => {
+  const { sessionId } = await createSession({ title: 'Runs', storageRoot });
+  await appendMessage({
+    sessionId,
+    role: 'assistant',
+    text: 'done',
+    runId: 'run_123',
+    steps: [{ kind: 'tool', status: 'done', label: 'Snapshot page' }],
+    storageRoot,
+  });
+  const rows = await readMessages({ sessionId, limit: 20, storageRoot });
+  const last = rows.at(-1);
+  assert.equal(last.runId, 'run_123');
+  assert.deepEqual(last.steps, [{ kind: 'tool', status: 'done', label: 'Snapshot page' }]);
+});
+
 test('rejects unsafe session ids', async () => {
   await assert.rejects(
     appendMessage({ sessionId: '../escape', role: 'user', text: 'x', storageRoot }),
diff --git a/test/agent/session-ui-state.test.js b/test/agent/session-ui-state.test.js
index d0ebd3f..e7701f5 100644
--- a/test/agent/session-ui-state.test.js
+++ b/test/agent/session-ui-state.test.js
@@ -34,3 +34,28 @@ test('messages.loaded hydrates transcript for the selected session', () => {
 
   assert.equal(next.messagesBySession.s1[0].text, 'hello');
 });
+
+test('messages.loaded hydrates stored run metadata for reopened sessions', () => {
+  const state = {
+    activeSessionId: 's1',
+    sessions: [],
+    runs: {},
+    messagesBySession: {},
+  };
+
+  const next = reduceState(state, {
+    type: 'messages.loaded',
+    sessionId: 's1',
+    messages: [{
+      role: 'assistant',
+      text: 'Done',
+      runId: 'run_1',
+      steps: [{ kind: 'tool', status: 'done', label: 'Snapshot page' }],
+    }],
+  });
+
+  assert.equal(next.runs.run_1?.done, true);
+  assert.equal(next.runs.run_1?.sessionId, 's1');
+  assert.equal(next.runs.run_1?.steps?.length, 1);
+  assert.equal(next.runs.run_1?.steps?.[0]?.label, 'Snapshot page');
+});

From 4a27ec04f6cdf47fd9fe10af1be92801a4c0d4e8 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:06:36 +0530
Subject: [PATCH 125/192] test(agent): add codex exec/resume fixtures and
 contracts

---
 scripts/capture-codex-jsonl.sh                | 41 ++++++++++++++++
 test/fixtures/codex/events/README.md          | 49 +++++++++++++++++++
 test/fixtures/codex/events/exec-sample.jsonl  |  5 ++
 .../codex/events/failed-resume-exit-code.txt  |  1 +
 .../codex/events/failed-resume-sample.jsonl   |  5 ++
 .../codex/events/failed-resume-stderr.txt     |  1 +
 .../fixtures/codex/events/resume-sample.jsonl |  4 ++
 7 files changed, 106 insertions(+)
 create mode 100755 scripts/capture-codex-jsonl.sh
 create mode 100644 test/fixtures/codex/events/README.md
 create mode 100644 test/fixtures/codex/events/exec-sample.jsonl
 create mode 100644 test/fixtures/codex/events/failed-resume-exit-code.txt
 create mode 100644 test/fixtures/codex/events/failed-resume-sample.jsonl
 create mode 100644 test/fixtures/codex/events/failed-resume-stderr.txt
 create mode 100644 test/fixtures/codex/events/resume-sample.jsonl

diff --git a/scripts/capture-codex-jsonl.sh b/scripts/capture-codex-jsonl.sh
new file mode 100755
index 0000000..fb77e7c
--- /dev/null
+++ b/scripts/capture-codex-jsonl.sh
@@ -0,0 +1,41 @@
+#!/usr/bin/env bash
+set -euo pipefail
+
+OUT_DIR="${1:-test/fixtures/codex/events}"
+PROMPT="${2:-Reply with one short sentence and stop.}"
+
+mkdir -p "$OUT_DIR"
+
+codex exec --json "$PROMPT" > "$OUT_DIR/exec-sample.jsonl"
+
+SESSION_ID="$(node - "$OUT_DIR/exec-sample.jsonl" <<'NODE'
+const fs = require('fs');
+const file = process.argv[2];
+const lines = fs.readFileSync(file, 'utf8').split(/\n+/).filter(Boolean);
+for (const line of lines) {
+  let parsed;
+  try {
+    parsed = JSON.parse(line);
+  } catch {
+    continue;
+  }
+  if (parsed && parsed.type === 'thread.started' && typeof parsed.thread_id === 'string' && parsed.thread_id.trim()) {
+    process.stdout.write(parsed.thread_id.trim());
+    process.exit(0);
+  }
+}
+process.exit(1);
+NODE
+)"
+
+codex exec resume "$SESSION_ID" --json "$PROMPT" > "$OUT_DIR/resume-sample.jsonl"
+
+INVALID_SESSION_ID="00000000-0000-0000-0000-000000000000"
+set +e
+codex exec resume "$INVALID_SESSION_ID" --json "$PROMPT" > "$OUT_DIR/failed-resume-sample.jsonl" 2> "$OUT_DIR/failed-resume-stderr.txt"
+EXIT_CODE=$?
+set -e
+
+printf '%s\n' "$EXIT_CODE" > "$OUT_DIR/failed-resume-exit-code.txt"
+
+echo "Captured fixtures in $OUT_DIR"
diff --git a/test/fixtures/codex/events/README.md b/test/fixtures/codex/events/README.md
new file mode 100644
index 0000000..6f52147
--- /dev/null
+++ b/test/fixtures/codex/events/README.md
@@ -0,0 +1,49 @@
+# Codex JSONL Fixture Contracts
+
+Captured with `scripts/capture-codex-jsonl.sh` on 2026-03-03 using `codex-cli 0.106.0`.
+
+## Provider Session ID Extraction
+
+Use only this path:
+
+- Event: `type === "thread.started"`
+- Field: `thread_id`
+
+Normalized contract:
+
+- `run.provider_session.payload.provider = "codex"`
+- `run.provider_session.payload.sessionId = <thread_id>`
+
+## Usage Telemetry Extraction
+
+Use only this path in current fixtures:
+
+- Event: `type === "turn.completed"`
+- Field object: `usage`
+- Fields: `usage.input_tokens`, `usage.cached_input_tokens`, `usage.output_tokens`
+
+Normalization contract:
+
+- `inputTokens = usage.input_tokens`
+- `cachedInputTokens = usage.cached_input_tokens`
+- `outputTokens = usage.output_tokens`
+- `totalTokens = inputTokens + outputTokens`
+- `modelContextWindow = null` (not emitted in these fixtures)
+
+## Failed Resume Signature
+
+Fixture command:
+
+- `codex exec resume "00000000-0000-0000-0000-000000000000" --json "..."`
+
+Observed behavior in this environment:
+
+- Exit code is zero (`failed-resume-exit-code.txt` contains `0`)
+- No failure JSON event is emitted
+- A normal `thread.started` event appears, but with a **new** `thread_id` (fresh session)
+- `stderr` may contain shell snapshot warnings; this text is non-deterministic and is **not** used as a retry signature
+
+Implication for retry logic:
+
+- Invalid/stale resume IDs are currently soft-fallbacked by Codex itself to a fresh thread
+- `isResumeSessionInvalidFailure(...)` should only trigger on explicit hard-failure signatures (none observed in these fixtures)
diff --git a/test/fixtures/codex/events/exec-sample.jsonl b/test/fixtures/codex/events/exec-sample.jsonl
new file mode 100644
index 0000000..ef58c7a
--- /dev/null
+++ b/test/fixtures/codex/events/exec-sample.jsonl
@@ -0,0 +1,5 @@
+{"type":"thread.started","thread_id":"019cb36d-0554-7a91-aee0-17a840d36372"}
+{"type":"turn.started"}
+{"type":"item.completed","item":{"id":"item_0","type":"reasoning","text":"**Confirming single-sentence response**"}}
+{"type":"item.completed","item":{"id":"item_1","type":"agent_message","text":"Understood, I will stop after this sentence."}}
+{"type":"turn.completed","usage":{"input_tokens":15166,"cached_input_tokens":3456,"output_tokens":289}}
diff --git a/test/fixtures/codex/events/failed-resume-exit-code.txt b/test/fixtures/codex/events/failed-resume-exit-code.txt
new file mode 100644
index 0000000..573541a
--- /dev/null
+++ b/test/fixtures/codex/events/failed-resume-exit-code.txt
@@ -0,0 +1 @@
+0
diff --git a/test/fixtures/codex/events/failed-resume-sample.jsonl b/test/fixtures/codex/events/failed-resume-sample.jsonl
new file mode 100644
index 0000000..de9989c
--- /dev/null
+++ b/test/fixtures/codex/events/failed-resume-sample.jsonl
@@ -0,0 +1,5 @@
+{"type":"thread.started","thread_id":"019cb36d-3da8-75c1-a959-d6db6519091d"}
+{"type":"turn.started"}
+{"type":"item.completed","item":{"id":"item_0","type":"reasoning","text":"**Confirming response approach**"}}
+{"type":"item.completed","item":{"id":"item_1","type":"agent_message","text":"This is the only sentence in my reply."}}
+{"type":"turn.completed","usage":{"input_tokens":15166,"cached_input_tokens":15104,"output_tokens":201}}
diff --git a/test/fixtures/codex/events/failed-resume-stderr.txt b/test/fixtures/codex/events/failed-resume-stderr.txt
new file mode 100644
index 0000000..9cc5f58
--- /dev/null
+++ b/test/fixtures/codex/events/failed-resume-stderr.txt
@@ -0,0 +1 @@
+2026-03-03T11:20:07.699206Z  WARN codex_core::shell_snapshot: Failed to delete shell snapshot at "/Users/valsaraj/.codex/shell_snapshots/019cb36d-3da8-75c1-a959-d6db6519091d.tmp-1772536806825338000": Os { code: 2, kind: NotFound, message: "No such file or directory" }
diff --git a/test/fixtures/codex/events/resume-sample.jsonl b/test/fixtures/codex/events/resume-sample.jsonl
new file mode 100644
index 0000000..23f78e3
--- /dev/null
+++ b/test/fixtures/codex/events/resume-sample.jsonl
@@ -0,0 +1,4 @@
+{"type":"thread.started","thread_id":"019cb36d-0554-7a91-aee0-17a840d36372"}
+{"type":"turn.started"}
+{"type":"item.completed","item":{"id":"item_0","type":"agent_message","text":"I will stop after this short sentence."}}
+{"type":"turn.completed","usage":{"input_tokens":30635,"cached_input_tokens":18816,"output_tokens":301}}

From d32285371e91989a7551c53cd8a5c0cb4ed78794 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:06:41 +0530
Subject: [PATCH 126/192] feat(agent): persist codex provider session and usage
 metadata

---
 agent/src/session-store.js       | 86 ++++++++++++++++++++++++++++++++
 test/agent/session-store.test.js | 28 +++++++++++
 2 files changed, 114 insertions(+)

diff --git a/agent/src/session-store.js b/agent/src/session-store.js
index c061015..6ff7759 100644
--- a/agent/src/session-store.js
+++ b/agent/src/session-store.js
@@ -10,6 +10,10 @@ const RUN_ID_RE = /^[A-Za-z0-9_-]{1,256}$/;
 const MODEL_ID_RE = /^[A-Za-z0-9._:/-]{1,128}$/;
 const indexWriteQueues = new Map();
 
+function isObject(value) {
+  return !!value && typeof value === 'object' && !Array.isArray(value);
+}
+
 function resolveStorageRoot(storageRoot) {
   return storageRoot || DEFAULT_STORAGE_ROOT;
 }
@@ -124,6 +128,83 @@ function normalizeModel(model) {
   return trimmed;
 }
 
+function normalizeUsageNumber(value, fieldName) {
+  if (value == null) return null;
+  const parsed = Number(value);
+  if (!Number.isFinite(parsed) || parsed < 0) {
+    throw new Error(`providerState.codex.latestUsage.${fieldName} must be a non-negative number`);
+  }
+  return Math.round(parsed);
+}
+
+function normalizeLatestUsage(latestUsage) {
+  if (latestUsage == null) return null;
+  if (!isObject(latestUsage)) {
+    throw new Error('providerState.codex.latestUsage must be an object');
+  }
+
+  const fields = [
+    'modelContextWindow',
+    'totalTokens',
+    'inputTokens',
+    'cachedInputTokens',
+    'outputTokens',
+    'reasoningOutputTokens',
+  ];
+
+  const normalized = {};
+  for (const field of fields) {
+    if (!Object.prototype.hasOwnProperty.call(latestUsage, field)) continue;
+    const value = normalizeUsageNumber(latestUsage[field], field);
+    if (value != null) normalized[field] = value;
+  }
+  return Object.keys(normalized).length > 0 ? normalized : null;
+}
+
+function normalizeCodexProviderState(patchCodex, currentCodex) {
+  if (patchCodex == null) return null;
+  if (!isObject(patchCodex)) {
+    throw new Error('providerState.codex must be an object');
+  }
+
+  const normalized = isObject(currentCodex) ? { ...currentCodex } : {};
+
+  if (Object.prototype.hasOwnProperty.call(patchCodex, 'sessionId')) {
+    if (patchCodex.sessionId == null || String(patchCodex.sessionId).trim() === '') {
+      delete normalized.sessionId;
+    } else {
+      const sessionId = String(patchCodex.sessionId).trim();
+      if (!isValidSessionId(sessionId)) {
+        throw new Error('providerState.codex.sessionId must be a safe session id');
+      }
+      normalized.sessionId = sessionId;
+    }
+  }
+
+  if (Object.prototype.hasOwnProperty.call(patchCodex, 'latestUsage')) {
+    const latestUsage = normalizeLatestUsage(patchCodex.latestUsage);
+    if (latestUsage == null) delete normalized.latestUsage;
+    else normalized.latestUsage = latestUsage;
+  }
+
+  return Object.keys(normalized).length > 0 ? normalized : null;
+}
+
+function normalizeProviderState(providerStatePatch, currentProviderState) {
+  if (!isObject(providerStatePatch)) {
+    throw new Error('providerState must be an object');
+  }
+  const normalized = isObject(currentProviderState) ? { ...currentProviderState } : {};
+
+  if (Object.prototype.hasOwnProperty.call(providerStatePatch, 'codex')) {
+    const codex = normalizeCodexProviderState(providerStatePatch.codex, normalized.codex);
+    if (codex == null) delete normalized.codex;
+    else normalized.codex = codex;
+  }
+
+  return Object.keys(normalized).length > 0 ? normalized : null;
+}
+
 function sortSessionsNewestFirst(a, b) {
   const aTs = Date.parse(a.updatedAt || a.createdAt || 0);
   const bTs = Date.parse(b.updatedAt || b.createdAt || 0);
@@ -194,6 +275,11 @@ export async function updateSession({ sessionId, patch = {}, storageRoot } = {})
     if (Object.prototype.hasOwnProperty.call(patch, 'model')) {
       next.model = normalizeModel(patch.model);
     }
+    if (Object.prototype.hasOwnProperty.call(patch, 'providerState')) {
+      const providerState = normalizeProviderState(patch.providerState, current.providerState);
+      if (providerState == null) delete next.providerState;
+      else next.providerState = providerState;
+    }
     next.updatedAt = now;
     sessions[idx] = next;
     await writeIndex(root, sessions);
diff --git a/test/agent/session-store.test.js b/test/agent/session-store.test.js
index f2cae2b..a134e65 100644
--- a/test/agent/session-store.test.js
+++ b/test/agent/session-store.test.js
@@ -86,6 +86,34 @@ test('updateSession persists per-session model and title', async () => {
   assert.equal(row?.model, 'gpt-5');
 });
 
+test('updateSession persists codex provider session mapping', async () => {
+  const created = await createSession({ title: 'Continuity', storageRoot });
+  const updated = await updateSession({
+    sessionId: created.sessionId,
+    patch: {
+      providerState: {
+        codex: {
+          sessionId: '019caa6f-8c63-7c81-a542-3dbcf922d065',
+          latestUsage: {
+            modelContextWindow: 258400,
+            totalTokens: 128125,
+            cachedInputTokens: 126592,
+          },
+        },
+      },
+    },
+    storageRoot,
+  });
+
+  assert.equal(updated?.providerState?.codex?.sessionId, '019caa6f-8c63-7c81-a542-3dbcf922d065');
+  assert.equal(updated?.providerState?.codex?.latestUsage?.modelContextWindow, 258400);
+
+  const rows = await listSessions({ limit: 10, storageRoot });
+  const row = rows.find((item) => item.sessionId === created.sessionId);
+  assert.equal(row?.providerState?.codex?.sessionId, '019caa6f-8c63-7c81-a542-3dbcf922d065');
+  assert.equal(row?.providerState?.codex?.latestUsage?.totalTokens, 128125);
+});
+
 test('listSessions fails fast on corrupted index metadata', async () => {
   writeFileSync(join(storageRoot, 'index.json'), '{this-is-not-json\n', 'utf8');
   await assert.rejects(

From b45be4338a28d31df5b9cac4421b1ca3e15236ad Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:06:43 +0530
Subject: [PATCH 127/192] feat(agent): support codex resume args and usage
 event normalization

---
 agent/src/codex-runner.js       | 81 +++++++++++++++++++++++++++++++--
 test/agent/codex-runner.test.js | 62 +++++++++++++++++++++++++
 2 files changed, 139 insertions(+), 4 deletions(-)

diff --git a/agent/src/codex-runner.js b/agent/src/codex-runner.js
index 073f550..776f039 100644
--- a/agent/src/codex-runner.js
+++ b/agent/src/codex-runner.js
@@ -20,6 +20,34 @@ function safeParse(line) {
   }
 }
 
+function toCount(value) {
+  const parsed = Number(value);
+  if (!Number.isFinite(parsed) || parsed < 0) return null;
+  return Math.round(parsed);
+}
+
+function toUsagePayload(source = {}) {
+  const inputTokens = toCount(source.input_tokens ?? source.inputTokens);
+  const cachedInputTokens = toCount(source.cached_input_tokens ?? source.cachedInputTokens);
+  const outputTokens = toCount(source.output_tokens ?? source.outputTokens);
+  const reasoningOutputTokens = toCount(source.reasoning_output_tokens ?? source.reasoningOutputTokens);
+  const explicitTotalTokens = toCount(source.total_tokens ?? source.totalTokens);
+  const modelContextWindow = toCount(source.model_context_window ?? source.modelContextWindow);
+
+  const totalTokens = explicitTotalTokens != null
+    ? explicitTotalTokens
+    : ((inputTokens != null || outputTokens != null) ? (inputTokens || 0) + (outputTokens || 0) : null);
+
+  return {
+    modelContextWindow,
+    totalTokens,
+    inputTokens,
+    cachedInputTokens,
+    outputTokens,
+    reasoningOutputTokens,
+  };
+}
+
 export function normalizeCodexLine({ runId, sessionId, line }) {
   const parsed = safeParse(line);
   if (!parsed || typeof parsed !== 'object') {
@@ -28,6 +56,43 @@ export function normalizeCodexLine({ runId, sessionId, line }) {
 
   const type = String(parsed.type || '').toLowerCase();
 
+  if (type === 'thread.started') {
+    const providerSessionId = String(parsed.thread_id || '').trim();
+    if (providerSessionId) {
+      return envelope({
+        event: 'run.provider_session',
+        runId,
+        sessionId,
+        payload: { provider: 'codex', sessionId: providerSessionId },
+      });
+    }
+  }
+
+  if (type === 'turn.completed' && parsed.usage && typeof parsed.usage === 'object') {
+    return envelope({
+      event: 'run.usage',
+      runId,
+      sessionId,
+      payload: toUsagePayload(parsed.usage),
+    });
+  }
+
+  if (type === 'token_count' && parsed.info && typeof parsed.info === 'object') {
+    const usage = parsed.info.total_token_usage && typeof parsed.info.total_token_usage === 'object'
+      ? parsed.info.total_token_usage
+      : {};
+    return envelope({
+      event: 'run.usage',
+      runId,
+      sessionId,
+      payload: toUsagePayload({
+        ...usage,
+        model_context_window: parsed.info.model_context_window,
+        reasoning_output_tokens: parsed.info.reasoning_output_tokens,
+      }),
+    });
+  }
+
   if (type === 'delta' || type === 'text_delta') {
     return envelope({ event: 'chat.delta', runId, sessionId, payload: { delta: String(parsed.text || '') } });
   }
@@ -92,9 +157,12 @@ export function normalizeCodexLine({ runId, sessionId, line }) {
   return envelope({ event: 'run.event', runId, sessionId, payload: parsed });
 }
 
-export function buildCodexExecArgs({ prompt, model, args } = {}) {
+export function buildCodexExecArgs({ prompt, model, args, resumeSessionId } = {}) {
   if (Array.isArray(args) && args.length > 0) return args;
-  const resolved = ['exec', '--json'];
+  const resumeId = typeof resumeSessionId === 'string' ? resumeSessionId.trim() : '';
+  const resolved = resumeId
+    ? ['exec', 'resume', resumeId, '--json']
+    : ['exec', '--json'];
   if (typeof model === 'string' && model.trim()) {
     resolved.push('--model', model.trim());
   }
@@ -113,9 +181,10 @@ export function startCodexRun({
   command,
   args,
   model,
+  resumeSessionId,
 } = {}) {
   const cmd = command || process.env.BF_CHATD_CODEX_COMMAND || 'codex';
-  const argv = buildCodexExecArgs({ prompt, model, args });
+  const argv = buildCodexExecArgs({ prompt, model, args, resumeSessionId });
 
   const child = spawn(cmd, argv, {
     cwd,
@@ -123,6 +192,8 @@ export function startCodexRun({
     stdio: ['ignore', 'pipe', 'pipe'],
   });
 
+  const stderrChunks = [];
+
   const stdoutLines = readline.createInterface({ input: child.stdout });
   stdoutLines.on('line', (line) => {
     try {
@@ -136,6 +207,8 @@ export function startCodexRun({
   const stderrLines = readline.createInterface({ input: child.stderr });
   stderrLines.on('line', (line) => {
     if (!line) return;
+    stderrChunks.push(String(line));
+    if (stderrChunks.length > 200) stderrChunks.shift();
     onEvent?.(envelope({
       event: 'tool.delta',
       runId,
@@ -149,7 +222,7 @@ export function startCodexRun({
   });
 
   child.on('close', (code, signal) => {
-    onExit?.({ code, signal });
+    onExit?.({ code, signal, stderr: stderrChunks.join('\n') });
   });
 
   return {
diff --git a/test/agent/codex-runner.test.js b/test/agent/codex-runner.test.js
index 6319aa9..b6f2205 100644
--- a/test/agent/codex-runner.test.js
+++ b/test/agent/codex-runner.test.js
@@ -39,6 +39,23 @@ test('buildCodexExecArgs includes --model when session model is set', () => {
   assert.deepEqual(args, ['exec', '--json', '--model', 'gpt-5', 'hi']);
 });
 
+test('buildCodexExecArgs emits resume invocation when codex session id is provided', () => {
+  const args = buildCodexExecArgs({
+    prompt: 'hi',
+    model: 'gpt-5',
+    resumeSessionId: '019caa6f-8c63-7c81-a542-3dbcf922d065',
+  });
+  assert.deepEqual(args, [
+    'exec',
+    'resume',
+    '019caa6f-8c63-7c81-a542-3dbcf922d065',
+    '--json',
+    '--model',
+    'gpt-5',
+    'hi',
+  ]);
+});
+
 test('buildCodexExecArgs omits --model when model is empty', () => {
   const args = buildCodexExecArgs({ prompt: 'hi', model: '' });
   assert.deepEqual(args, ['exec', '--json', 'hi']);
@@ -55,3 +72,48 @@ test('maps transient codex error line to non-fatal tool event', () => {
   assert.equal(evt.payload.level, 'warning');
   assert.match(evt.payload.message, /Reconnecting/);
 });
+
+test('maps codex turn.completed usage into run.usage event', () => {
+  const line = JSON.stringify({
+    type: 'turn.completed',
+    usage: {
+      input_tokens: 1000,
+      cached_input_tokens: 700,
+      output_tokens: 120,
+    },
+  });
+  const evt = normalizeCodexLine({ runId: 'r1', sessionId: 's1', line });
+  assert.equal(evt.event, 'run.usage');
+  assert.equal(evt.payload.modelContextWindow, null);
+  assert.equal(evt.payload.totalTokens, 1120);
+  assert.equal(evt.payload.inputTokens, 1000);
+  assert.equal(evt.payload.cachedInputTokens, 700);
+  assert.equal(evt.payload.outputTokens, 120);
+});
+
+test('maps codex token_count into run.usage event', () => {
+  const line = JSON.stringify({
+    type: 'token_count',
+    info: {
+      total_token_usage: { input_tokens: 1000, cached_input_tokens: 700, output_tokens: 120, total_tokens: 1120 },
+      model_context_window: 258400,
+      reasoning_output_tokens: 14,
+    },
+  });
+  const evt = normalizeCodexLine({ runId: 'r1', sessionId: 's1', line });
+  assert.equal(evt.event, 'run.usage');
+  assert.equal(evt.payload.modelContextWindow, 258400);
+  assert.equal(evt.payload.totalTokens, 1120);
+  assert.equal(evt.payload.reasoningOutputTokens, 14);
+});
+
+test('maps codex thread.started provider session id event to run.provider_session', () => {
+  const line = JSON.stringify({
+    type: 'thread.started',
+    thread_id: '019caa6f-8c63-7c81-a542-3dbcf922d065',
+  });
+  const evt = normalizeCodexLine({ runId: 'r1', sessionId: 's1', line });
+  assert.equal(evt.event, 'run.provider_session');
+  assert.equal(evt.payload.provider, 'codex');
+  assert.equal(evt.payload.sessionId, '019caa6f-8c63-7c81-a542-3dbcf922d065');
+});

From 4d6f5f046cdbe63f72e891197a47b382db156772 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:06:49 +0530
Subject: [PATCH 128/192] feat(agent): resume codex sessions with usage
 persistence and metadata endpoint

---
 agent/src/chatd.js           | 123 +++++++++++++-
 test/agent/chatd-api.test.js | 303 +++++++++++++++++++++++++++++++++++
 2 files changed, 421 insertions(+), 5 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 80a3838..480138b 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -248,6 +248,45 @@ function normalizeBrowserContext(raw) {
   return { tabId, title, url };
 }
 
+function normalizeUsageNumber(value) {
+  const parsed = Number(value);
+  if (!Number.isFinite(parsed) || parsed < 0) return null;
+  return Math.round(parsed);
+}
+
+function normalizeUsagePayload(payload) {
+  if (!payload || typeof payload !== 'object') return null;
+  const modelContextWindow = normalizeUsageNumber(payload.modelContextWindow);
+  const totalTokens = normalizeUsageNumber(payload.totalTokens);
+  const inputTokens = normalizeUsageNumber(payload.inputTokens);
+  const cachedInputTokens = normalizeUsageNumber(payload.cachedInputTokens);
+  const outputTokens = normalizeUsageNumber(payload.outputTokens);
+  const reasoningOutputTokens = normalizeUsageNumber(payload.reasoningOutputTokens);
+
+  const normalized = {
+    modelContextWindow,
+    totalTokens,
+    inputTokens,
+    cachedInputTokens,
+    outputTokens,
+    reasoningOutputTokens,
+  };
+
+  for (const [key, value] of Object.entries(normalized)) {
+    if (value == null) delete normalized[key];
+  }
+  return Object.keys(normalized).length > 0 ? normalized : null;
+}
+
+function isResumeSessionInvalidFailure({ code, error, stderr } = {}) {
+  if (!Number.isInteger(code) || code === 0) return false;
+  const text = `${String(error || '')}\n${String(stderr || '')}`.toLowerCase();
+  return (
+    /resume|session|thread/.test(text)
+    && /not found|unknown|invalid|no such|does not exist/.test(text)
+  );
+}
+
 function firstString(values) {
   for (const value of values) {
     if (typeof value === 'string' && value.trim()) return value.trim();
@@ -454,11 +493,12 @@ async function clearChatdUrlFile({ writeChatdUrl = true, urlPath = CHATD_URL_PAT
 }
 
 function createDefaultRunExecutor({ codexCwd } = {}) {
-  return ({ runId, sessionId, message, model, onEvent, onExit, onError }) => startCodexRun({
+  return ({ runId, sessionId, message, model, resumeSessionId, onEvent, onExit, onError }) => startCodexRun({
     runId,
     sessionId,
     prompt: message,
     model,
+    resumeSessionId,
     cwd: codexCwd,
     onEvent,
     onExit,
@@ -596,6 +636,21 @@ export async function startChatd(opts = {}) {
       }
 
       const sessionMatch = url.pathname.match(/^\/v1\/sessions\/([^/]+)$/);
+      if (sessionMatch && req.method === 'GET') {
+        const decodedSessionId = safeDecodeComponent(sessionMatch[1]);
+        if (!decodedSessionId || !isValidSessionId(decodedSessionId)) {
+          json(res, 400, { error: 'Invalid sessionId' });
+          return;
+        }
+        const session = await getSession({ sessionId: decodedSessionId, storageRoot });
+        if (!session) {
+          json(res, 404, { error: 'Session not found' });
+          return;
+        }
+        json(res, 200, session);
+        return;
+      }
+
       if (sessionMatch && req.method === 'PATCH') {
         const decodedSessionId = safeDecodeComponent(sessionMatch[1]);
         if (!decodedSessionId || !isValidSessionId(decodedSessionId)) {
@@ -718,6 +773,11 @@ export async function startChatd(opts = {}) {
           steps: [],
           finalSent: false,
           queue: Promise.resolve(),
+          lastError: null,
+          resumeRetryAttempted: false,
+          resumeSessionId: isValidSessionId(session?.providerState?.codex?.sessionId || '')
+            ? session.providerState.codex.sessionId
+            : null,
         };
 
         const enqueue = (fn) => {
@@ -728,11 +788,12 @@ export async function startChatd(opts = {}) {
           await appendMessage({ sessionId, role: 'user', text: message, storageRoot });
           runs.set(runId, run);
 
-          const handle = runExecutor({
+          const startAttempt = (resumeSessionId) => runExecutor({
             runId,
             sessionId,
             message: promptMessage,
             model: session.model || null,
+            resumeSessionId,
             onEvent: (evt) => {
               enqueue(async () => {
                 const active = runs.get(runId);
@@ -753,9 +814,43 @@ export async function startChatd(opts = {}) {
                   return;
                 }
 
+                if (evt.event === 'run.provider_session') {
+                  const provider = String(evt.payload?.provider || '').trim().toLowerCase();
+                  const providerSessionId = String(evt.payload?.sessionId || '').trim();
+                  if (provider === 'codex' && isValidSessionId(providerSessionId)) {
+                    await updateSession({
+                      sessionId,
+                      patch: {
+                        providerState: { codex: { sessionId: providerSessionId } },
+                      },
+                      storageRoot,
+                    });
+                  }
+                  broadcast(buildEvent({ event: 'run.provider_session', runId, sessionId, payload: evt.payload }));
+                  return;
+                }
+
+                if (evt.event === 'run.usage') {
+                  const usage = normalizeUsagePayload(evt.payload);
+                  if (usage) {
+                    await updateSession({
+                      sessionId,
+                      patch: {
+                        providerState: { codex: { latestUsage: usage } },
+                      },
+                      storageRoot,
+                    });
+                    broadcast(buildEvent({ event: 'run.usage', runId, sessionId, payload: usage }));
+                  }
+                  return;
+                }
+
                 if (evt.event === 'run.error') {
                   trackRunStep(active, evt);
-                  failRun(active, evt.payload?.error || 'Run failed');
+                  active.lastError = evt.payload?.error || 'Run failed';
+                  if (!active.resumeSessionId || active.resumeRetryAttempted) {
+                    failRun(active, active.lastError);
+                  }
                   return;
                 }
 
@@ -767,13 +862,30 @@ export async function startChatd(opts = {}) {
                 broadcast(buildEvent({ event: evt.event, runId, sessionId, payload: evt.payload }));
               });
             },
-            onExit: ({ code, signal }) => {
+            onExit: ({ code, signal, stderr }) => {
               enqueue(async () => {
                 const active = runs.get(runId);
                 if (!active || active.status !== 'running') return;
 
                 if (signal === 'SIGTERM' || active.status === 'aborted') return;
 
+                if (
+                  active.resumeSessionId
+                  && !active.resumeRetryAttempted
+                  && isResumeSessionInvalidFailure({ code, error: active.lastError, stderr })
+                ) {
+                  active.resumeRetryAttempted = true;
+                  active.resumeSessionId = null;
+                  active.lastError = null;
+                  try {
+                    const retryHandle = startAttempt(null);
+                    active.abort = retryHandle?.abort || null;
+                  } catch (error) {
+                    failRun(active, error?.message || 'Failed to retry codex run');
+                  }
+                  return;
+                }
+
                 if (active.assistantBuffer) {
                   await finalizeRun(active, active.assistantBuffer);
                   return;
@@ -784,7 +896,7 @@ export async function startChatd(opts = {}) {
                   return;
                 }
 
-                failRun(active, `codex exited with code ${code ?? 'unknown'}`);
+                failRun(active, active.lastError || `codex exited with code ${code ?? 'unknown'}`);
               });
             },
             onError: (error) => {
@@ -795,6 +907,7 @@ export async function startChatd(opts = {}) {
             },
           });
 
+          const handle = startAttempt(run.resumeSessionId);
           run.abort = handle?.abort || null;
           broadcast(buildEvent({
             event: 'run.started',
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index 9d07b96..b0804dc 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -371,3 +371,306 @@ test('POST /v1/runs includes active tab context in runExecutor prompt', async ()
     await daemon.stop();
   }
 });
+
+test('POST /v1/runs reuses codex provider session id on second turn', async () => {
+  const observed = [];
+  const providerSessionId = '019caa6f-8c63-7c81-a542-3dbcf922d065';
+
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    runExecutor: ({ runId, sessionId, resumeSessionId, onEvent, onExit }) => {
+      observed.push({ runId, sessionId, resumeSessionId: resumeSessionId || null });
+      setTimeout(() => {
+        onEvent({
+          event: 'run.provider_session',
+          runId,
+          sessionId,
+          payload: { provider: 'codex', sessionId: providerSessionId },
+        });
+      }, 5);
+      setTimeout(() => {
+        onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'ok' } });
+      }, 10);
+      setTimeout(() => onExit({ code: 0 }), 15);
+      return { abort() {} };
+    },
+  });
+
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'Continuity' }),
+    }).then((res) => res.json());
+
+    const runOneRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'first' }),
+    });
+    assert.equal(runOneRes.status, 202);
+    await new Promise((resolve) => setTimeout(resolve, 60));
+
+    const runTwoRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'second' }),
+    });
+    assert.equal(runTwoRes.status, 202);
+    await new Promise((resolve) => setTimeout(resolve, 60));
+
+    assert.equal(observed.length >= 2, true);
+    assert.equal(observed[0].resumeSessionId, null);
+    assert.equal(observed[1].resumeSessionId, providerSessionId);
+  } finally {
+    await daemon.stop();
+  }
+});
+
+test('stale resume failures retry once as fresh run when failure signature matches', async () => {
+  const observed = [];
+  const staleProviderSessionId = '019caa6f-8c63-7c81-a542-3dbcf922d065';
+  const recoveredProviderSessionId = '019caa6f-8c63-7c81-a542-3dbcf922d999';
+
+  let callCount = 0;
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    runExecutor: ({ runId, sessionId, resumeSessionId, onEvent, onExit }) => {
+      callCount += 1;
+      observed.push({ callCount, runId, sessionId, resumeSessionId: resumeSessionId || null });
+
+      if (callCount === 1) {
+        setTimeout(() => {
+          onEvent({
+            event: 'run.provider_session',
+            runId,
+            sessionId,
+            payload: { provider: 'codex', sessionId: staleProviderSessionId },
+          });
+        }, 5);
+        setTimeout(() => onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'seeded' } }), 10);
+        setTimeout(() => onExit({ code: 0 }), 15);
+        return { abort() {} };
+      }
+
+      if (callCount === 2) {
+        setTimeout(() => onEvent({ event: 'run.error', runId, sessionId, payload: { error: 'Resume session not found' } }), 5);
+        setTimeout(() => onExit({ code: 1, stderr: 'session not found' }), 10);
+        return { abort() {} };
+      }
+
+      setTimeout(() => {
+        onEvent({
+          event: 'run.provider_session',
+          runId,
+          sessionId,
+          payload: { provider: 'codex', sessionId: recoveredProviderSessionId },
+        });
+      }, 5);
+      setTimeout(() => onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'recovered' } }), 10);
+      setTimeout(() => onExit({ code: 0 }), 15);
+      return { abort() {} };
+    },
+  });
+
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'Retry' }),
+    }).then((res) => res.json());
+
+    const seedRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'seed' }),
+    });
+    assert.equal(seedRes.status, 202);
+    await new Promise((resolve) => setTimeout(resolve, 70));
+
+    const retryRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'retry' }),
+    });
+    assert.equal(retryRes.status, 202);
+    await new Promise((resolve) => setTimeout(resolve, 100));
+
+    assert.equal(observed.length, 3);
+    assert.equal(observed[1].resumeSessionId, staleProviderSessionId);
+    assert.equal(observed[2].resumeSessionId, null);
+
+    const sessionRes = await fetch(`${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(sessionRes.status, 200);
+    const sessionBody = await sessionRes.json();
+    assert.equal(sessionBody.providerState?.codex?.sessionId, recoveredProviderSessionId);
+  } finally {
+    await daemon.stop();
+  }
+});
+
+test('non-resume failures do not clear codex provider session mapping', async () => {
+  const observed = [];
+  const providerSessionId = '019caa6f-8c63-7c81-a542-3dbcf922d065';
+  let callCount = 0;
+
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    runExecutor: ({ runId, sessionId, resumeSessionId, onEvent, onExit }) => {
+      callCount += 1;
+      observed.push({ callCount, runId, sessionId, resumeSessionId: resumeSessionId || null });
+
+      if (callCount === 1) {
+        setTimeout(() => {
+          onEvent({
+            event: 'run.provider_session',
+            runId,
+            sessionId,
+            payload: { provider: 'codex', sessionId: providerSessionId },
+          });
+        }, 5);
+        setTimeout(() => onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'seeded' } }), 10);
+        setTimeout(() => onExit({ code: 0 }), 15);
+        return { abort() {} };
+      }
+
+      setTimeout(() => onEvent({ event: 'run.error', runId, sessionId, payload: { error: 'tool crashed' } }), 5);
+      setTimeout(() => onExit({ code: 1, stderr: 'tool crashed' }), 10);
+      return { abort() {} };
+    },
+  });
+
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'Preserve mapping' }),
+    }).then((res) => res.json());
+
+    await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'seed' }),
+    });
+    await new Promise((resolve) => setTimeout(resolve, 70));
+
+    const failed = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'fail' }),
+    });
+    assert.equal(failed.status, 202);
+    await new Promise((resolve) => setTimeout(resolve, 80));
+
+    assert.equal(observed.length, 2);
+    assert.equal(observed[1].resumeSessionId, providerSessionId);
+
+    const sessionRes = await fetch(`${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(sessionRes.status, 200);
+    const sessionBody = await sessionRes.json();
+    assert.equal(sessionBody.providerState?.codex?.sessionId, providerSessionId);
+  } finally {
+    await daemon.stop();
+  }
+});
+
+test('GET /v1/sessions/:id exposes providerState metadata for side-panel hydration', async () => {
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    runExecutor: ({ runId, sessionId, onEvent, onExit }) => {
+      setTimeout(() => {
+        onEvent({
+          event: 'run.provider_session',
+          runId,
+          sessionId,
+          payload: { provider: 'codex', sessionId: '019caa6f-8c63-7c81-a542-3dbcf922d065' },
+        });
+      }, 5);
+      setTimeout(() => {
+        onEvent({
+          event: 'run.usage',
+          runId,
+          sessionId,
+          payload: {
+            modelContextWindow: 258400,
+            totalTokens: 1120,
+            inputTokens: 1000,
+            cachedInputTokens: 700,
+            outputTokens: 120,
+          },
+        });
+      }, 10);
+      setTimeout(() => onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'done' } }), 15);
+      setTimeout(() => onExit({ code: 0 }), 20);
+      return { abort() {} };
+    },
+  });
+
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'Metadata' }),
+    }).then((res) => res.json());
+
+    const runRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'collect usage' }),
+    });
+    assert.equal(runRes.status, 202);
+    await new Promise((resolve) => setTimeout(resolve, 90));
+
+    const sessionRes = await fetch(`${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(sessionRes.status, 200);
+    const sessionBody = await sessionRes.json();
+    assert.equal(sessionBody.sessionId, created.sessionId);
+    assert.equal(sessionBody.providerState?.codex?.sessionId, '019caa6f-8c63-7c81-a542-3dbcf922d065');
+    assert.equal(sessionBody.providerState?.codex?.latestUsage?.modelContextWindow, 258400);
+  } finally {
+    await daemon.stop();
+  }
+});

From 58d78436c528f3a373f7605b03572469a22f6eec Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:06:55 +0530
Subject: [PATCH 129/192] feat(sidepanel): render codex context usage telemetry
 with session hydration

---
 extension/agent-panel-runtime.js             | 35 ++++++++++++
 extension/agent-panel-state.js               | 53 ++++++++++++++++++
 extension/agent-panel.css                    | 56 ++++++++++++++++++++
 extension/agent-panel.html                   |  2 +
 extension/agent-panel.js                     | 42 +++++++++++++--
 test/agent/agent-panel-contract.test.js      |  1 +
 test/agent/agent-panel-runtime.test.js       | 31 +++++++++++
 test/agent/agent-panel-send-contract.test.js | 13 +++++
 test/agent/session-ui-state.test.js          | 29 ++++++++++
 test/agent/sse-events.test.js                | 53 ++++++++++++++++++
 10 files changed, 311 insertions(+), 4 deletions(-)

diff --git a/extension/agent-panel-runtime.js b/extension/agent-panel-runtime.js
index 63fe634..0cb7eef 100644
--- a/extension/agent-panel-runtime.js
+++ b/extension/agent-panel-runtime.js
@@ -27,6 +27,27 @@ export function shouldApplySessionSelection({ requestToken, latestRequestToken,
   );
 }
 
+function escapeHtml(value) {
+  return String(value ?? '')
+    .replace(/&/g, '&amp;')
+    .replace(/</g, '&lt;')
+    .replace(/>/g, '&gt;')
+    .replace(/"/g, '&quot;')
+    .replace(/'/g, '&#39;');
+}
+
+export function renderInlineContent(value) {
+  return escapeHtml(value)
+    .replace(/`([^`]+)`/g, '<code>$1</code>')
+    .replace(/\*\*([^*]+)\*\*/g, '<strong>$1</strong>');
+}
+
+export function getLatestInFlightStepIndex(run = {}) {
+  const steps = Array.isArray(run?.steps) ? run.steps : [];
+  if (!steps.length || run?.done) return -1;
+  return steps.length - 1;
+}
+
 export function classifyRunStepIcon(step = {}) {
   const status = String(step.status || '').toLowerCase();
   if (status === 'failed') return 'failed';
@@ -43,3 +64,17 @@ export function classifyRunStepIcon(step = {}) {
   if (kind === 'tool') return 'tool';
   return 'reasoning';
 }
+
+function normalizeUsageValue(value) {
+  const parsed = Number(value);
+  if (!Number.isFinite(parsed) || parsed <= 0) return null;
+  return Math.round(parsed);
+}
+
+export function formatContextUsage({ totalTokens, modelContextWindow } = {}) {
+  const total = normalizeUsageValue(totalTokens);
+  const windowSize = normalizeUsageValue(modelContextWindow);
+  if (total == null || windowSize == null) return null;
+  const percent = ((total / windowSize) * 100).toFixed(1);
+  return `${total.toLocaleString()} / ${windowSize.toLocaleString()} (${percent}%)`;
+}
diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index 9b665a8..13a6671 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -3,6 +3,7 @@ export const initialState = {
   activeSessionId: null,
   messagesBySession: {},
   runs: {},
+  latestUsageBySession: {},
 };
 
 function firstString(values) {
@@ -125,6 +126,28 @@ function upsertRun(state, runId, patch) {
   };
 }
 
+function normalizeUsageValue(value) {
+  const parsed = Number(value);
+  if (!Number.isFinite(parsed) || parsed < 0) return null;
+  return Math.round(parsed);
+}
+
+function normalizeUsagePayload(payload) {
+  if (!payload || typeof payload !== 'object') return null;
+  const normalized = {
+    modelContextWindow: normalizeUsageValue(payload.modelContextWindow),
+    totalTokens: normalizeUsageValue(payload.totalTokens),
+    inputTokens: normalizeUsageValue(payload.inputTokens),
+    cachedInputTokens: normalizeUsageValue(payload.cachedInputTokens),
+    outputTokens: normalizeUsageValue(payload.outputTokens),
+    reasoningOutputTokens: normalizeUsageValue(payload.reasoningOutputTokens),
+  };
+  for (const [key, value] of Object.entries(normalized)) {
+    if (value == null) delete normalized[key];
+  }
+  return Object.keys(normalized).length > 0 ? normalized : null;
+}
+
 function normalizeStoredStep(step) {
   if (!step || typeof step !== 'object') return null;
   const label = trimStepLabel(step.label);
@@ -193,6 +216,18 @@ export function reduceState(state = initialState, action = {}) {
     };
   }
 
+  if (action.type === 'session.metadata.loaded') {
+    const usage = normalizeUsagePayload(action.session?.providerState?.codex?.latestUsage);
+    if (!usage || !action.sessionId) return state;
+    return {
+      ...state,
+      latestUsageBySession: {
+        ...(state.latestUsageBySession || {}),
+        [action.sessionId]: usage,
+      },
+    };
+  }
+
   return state;
 }
 
@@ -317,5 +352,23 @@ export function applyEvent(state = initialState, evt = {}) {
     };
   }
 
+  if (evt.event === 'run.usage') {
+    const usage = normalizeUsagePayload(evt.payload);
+    if (!usage) return state;
+    const run = state.runs[evt.runId] || { text: '', done: false, steps: [] };
+    return {
+      ...state,
+      runs: upsertRun(state, evt.runId, {
+        sessionId: evt.sessionId,
+        done: run.done || false,
+        usage,
+      }),
+      latestUsageBySession: {
+        ...(state.latestUsageBySession || {}),
+        [evt.sessionId]: usage,
+      },
+    };
+  }
+
   return state;
 }
diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index ffac37e..8bc6444 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -53,6 +53,7 @@ body {
   align-items: center;
   gap: 8px;
   padding: 12px 14px;
+  flex-wrap: wrap;
 }
 
 .pill-btn {
@@ -128,6 +129,22 @@ body {
   justify-content: center;
 }
 
+.context-usage-chip {
+  min-width: 0;
+  max-width: 100%;
+  padding: 0 10px;
+  height: 24px;
+  border-radius: 999px;
+  border: 1px solid rgba(255, 255, 255, 0.2);
+  background: rgba(255, 255, 255, 0.08);
+  color: rgba(255, 255, 255, 0.78);
+  font-size: 11px;
+  line-height: 22px;
+  white-space: nowrap;
+  overflow: hidden;
+  text-overflow: ellipsis;
+}
+
 .status-dot {
   width: 8px;
   height: 8px;
@@ -378,6 +395,21 @@ body {
   white-space: pre-wrap;
 }
 
+.step-label strong {
+  font-weight: 600;
+  color: var(--text);
+}
+
+.step-label code {
+  font-family: 'SF Mono', 'Fira Code', 'Cascadia Code', monospace;
+  font-size: 11px;
+  background: var(--sand);
+  color: var(--crail-dark);
+  padding: 1px 5px;
+  border-radius: 4px;
+  border: 1px solid var(--line);
+}
+
 .run-step-icon {
   width: 13px;
   height: 13px;
@@ -387,6 +419,18 @@ body {
   position: relative;
 }
 
+.step-item.latest .step-label {
+  color: var(--text);
+}
+
+.step-item.latest .run-step-icon {
+  color: var(--crail);
+}
+
+.step-item.pulse .run-step-icon {
+  animation: step-pulse 1.2s ease-in-out infinite;
+}
+
 .run-step-icon::before,
 .run-step-icon::after {
   content: '';
@@ -868,3 +912,15 @@ body {
     transform: rotate(360deg);
   }
 }
+
+@keyframes step-pulse {
+  0%,
+  100% {
+    transform: scale(1);
+    opacity: 1;
+  }
+  50% {
+    transform: scale(1.15);
+    opacity: 0.65;
+  }
+}
diff --git a/extension/agent-panel.html b/extension/agent-panel.html
index 9e20c0f..7af63d6 100644
--- a/extension/agent-panel.html
+++ b/extension/agent-panel.html
@@ -30,6 +30,8 @@
           </svg>
         </button>
 
+        <div id="bf-context-usage" class="context-usage-chip" title="Context: unavailable">Context: unavailable</div>
+
         <div id="bf-agent-status" class="status-circle" title="Starting...">
           <span id="bf-agent-status-icon" class="status-dot" aria-hidden="true"></span>
           <span id="bf-agent-status-text" class="sr-only">Starting...</span>
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 55c6d76..cb7760a 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -3,7 +3,10 @@ import {
   assignSessionRunId,
   classifyRunStepIcon,
   clearSessionRunId,
+  formatContextUsage,
+  getLatestInFlightStepIndex,
   getSessionRunId,
+  renderInlineContent,
   shouldApplySessionSelection,
 } from './agent-panel-runtime.js';
 
@@ -28,6 +31,7 @@ const state = {
 const statusEl = document.getElementById('bf-agent-status');
 const statusIconEl = document.getElementById('bf-agent-status-icon');
 const statusTextEl = document.getElementById('bf-agent-status-text');
+const contextUsageEl = document.getElementById('bf-context-usage');
 const modelTriggerBtn = document.getElementById('bf-model-trigger');
 const modelLabelEl = document.getElementById('bf-model-label');
 const sessionTriggerBtn = document.getElementById('bf-session-trigger');
@@ -101,6 +105,15 @@ function syncStatusIndicator() {
   statusIconEl.textContent = '';
 }
 
+function renderContextUsageChip() {
+  if (!contextUsageEl) return;
+  const sessionId = state.value.activeSessionId;
+  const usage = sessionId ? state.value.latestUsageBySession?.[sessionId] : null;
+  const formatted = formatContextUsage(usage || {});
+  contextUsageEl.textContent = formatted ? `Context: ${formatted}` : 'Context: unavailable';
+  contextUsageEl.title = contextUsageEl.textContent;
+}
+
 function setStatus(kind, text) {
   state.status = { kind, text };
   syncStatusIndicator();
@@ -359,13 +372,19 @@ function renderRunSteps(runId, run) {
   if (!runId || !run || !Array.isArray(run.steps) || run.steps.length === 0) return '';
   const count = run.steps.length;
   const expanded = isRunStepsExpanded(runId);
+  const latestStepIndex = getLatestInFlightStepIndex(run);
 
   const items = run.steps
-    .map((step) => {
+    .map((step, index) => {
       const status = step?.status || 'running';
       const label = step?.label || 'Step';
       const icon = classifyRunStepIcon(step);
-      return `<li class="step-item ${escapeHtml(status)}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="step-label">${escapeHtml(label)}</span></li>`;
+      const isLatest = index === latestStepIndex;
+      const shouldPulse = isLatest && status === 'running';
+      const classes = ['step-item', escapeHtml(status)];
+      if (isLatest) classes.push('latest');
+      if (shouldPulse) classes.push('pulse');
+      return `<li class="${classes.join(' ')}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="step-label">${renderInlineContent(label)}</span></li>`;
     })
     .join('');
 
@@ -383,7 +402,7 @@ function renderRunSteps(runId, run) {
 }
 
 function renderContent(value) {
-  return escapeHtml(value).replace(/`([^`]+)`/g, '<code>$1</code>');
+  return renderInlineContent(value);
 }
 
 function bindTranscriptHandlers() {
@@ -438,7 +457,10 @@ function renderTranscript() {
       chunks.push(`
         <article class="message assistant">
           <div class="msg-meta"><span class="msg-author">BrowserForce</span></div>
-          <div class="thinking-bubble"><div class="spinner"></div><span>Thinking...</span></div>
+          <div class="msg-content-wrap">
+            ${renderRunSteps(sessionRunId, run)}
+            <div class="thinking-bubble"><div class="spinner"></div><span>Thinking...</span></div>
+          </div>
         </article>
       `);
     }
@@ -483,6 +505,7 @@ function renderPopovers() {
 
 function render() {
   renderSelectors();
+  renderContextUsageChip();
   renderModelList();
   renderSessions();
   renderTranscript();
@@ -741,11 +764,22 @@ async function loadMessages(sessionId) {
   dispatch({ type: 'messages.loaded', sessionId, messages: body.messages || [] });
 }
 
+async function loadSessionMetadata(sessionId) {
+  const res = await api(`/v1/sessions/${encodeURIComponent(sessionId)}`, {
+    method: 'GET',
+    headers: {},
+  });
+  await ensureOk(res, 'Failed to load session metadata');
+  const session = await readJsonOrEmpty(res);
+  dispatch({ type: 'session.metadata.loaded', sessionId, session });
+}
+
 async function selectSession(sessionId) {
   state.sessionSelectionToken += 1;
   const selectionToken = state.sessionSelectionToken;
   dispatch({ type: 'session.selected', sessionId });
   await loadMessages(sessionId);
+  await loadSessionMetadata(sessionId);
   if (!shouldApplySessionSelection({
     requestToken: selectionToken,
     latestRequestToken: state.sessionSelectionToken,
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index cfadc94..0c7115d 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -17,6 +17,7 @@ test('agent panel has inline model and session selectors with popovers', () => {
   assert.match(html, /id="bf-tab-attach-banner"/);
   assert.match(html, /id="bf-tab-attach-text"/);
   assert.match(html, /id="bf-attach-current-tab"/);
+  assert.match(html, /id="bf-context-usage"/);
 });
 
 test('agent panel no longer renders title or persistent session sidebar', () => {
diff --git a/test/agent/agent-panel-runtime.test.js b/test/agent/agent-panel-runtime.test.js
index 22fb8f5..7a613cf 100644
--- a/test/agent/agent-panel-runtime.test.js
+++ b/test/agent/agent-panel-runtime.test.js
@@ -4,7 +4,10 @@ import {
   assignSessionRunId,
   classifyRunStepIcon,
   clearSessionRunId,
+  formatContextUsage,
+  getLatestInFlightStepIndex,
   getSessionRunId,
+  renderInlineContent,
   shouldApplySessionSelection,
 } from '../../extension/agent-panel-runtime.js';
 
@@ -48,3 +51,31 @@ test('classifies step icons from reasoning/tool labels', () => {
   assert.equal(classifyRunStepIcon({ kind: 'status', status: 'done', label: 'Done' }), 'done');
   assert.equal(classifyRunStepIcon({ kind: 'status', status: 'failed', label: 'Failed' }), 'failed');
 });
+
+test('renders safe inline markdown for bold and code spans', () => {
+  assert.equal(renderInlineContent('**Inspect active tab**'), '<strong>Inspect active tab</strong>');
+  assert.equal(renderInlineContent('Use `snapshot()` now'), 'Use <code>snapshot()</code> now');
+  assert.equal(
+    renderInlineContent('**<script>alert(1)</script>**'),
+    '<strong>&lt;script&gt;alert(1)&lt;/script&gt;</strong>',
+  );
+});
+
+test('tracks latest step index for active runs only', () => {
+  assert.equal(getLatestInFlightStepIndex({ done: false, steps: [{}, {}, {}] }), 2);
+  assert.equal(getLatestInFlightStepIndex({ done: true, steps: [{}, {}] }), -1);
+  assert.equal(getLatestInFlightStepIndex({ done: false, steps: [] }), -1);
+});
+
+test('formats context usage with percentage when context window is present', () => {
+  assert.equal(
+    formatContextUsage({ totalTokens: 12345, modelContextWindow: 258400 }),
+    '12,345 / 258,400 (4.8%)',
+  );
+});
+
+test('returns null for context usage formatting when values are incomplete', () => {
+  assert.equal(formatContextUsage({ totalTokens: 12345 }), null);
+  assert.equal(formatContextUsage({ modelContextWindow: 258400 }), null);
+  assert.equal(formatContextUsage({ totalTokens: 0, modelContextWindow: 258400 }), null);
+});
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index d108862..1443eaf 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -56,3 +56,16 @@ test('session popover renders per-session timestamp metadata', () => {
   assert.match(js, /updatedAt|createdAt/);
   assert.match(js, /toLocaleString/);
 });
+
+test('in-flight thinking state keeps run steps visible above the thinking bubble', () => {
+  assert.match(js, /if \(run && !run\.done\)/);
+  assert.match(js, /renderRunSteps\(sessionRunId, run\)/);
+  assert.match(js, /class="thinking-bubble"/);
+});
+
+test('status row renders context usage from latestUsageBySession with fallback', () => {
+  assert.match(js, /function renderContextUsageChip\(\)/);
+  assert.match(js, /latestUsageBySession/);
+  assert.match(js, /Context:\s*unavailable/);
+  assert.match(js, /formatted \? `Context: \$\{formatted\}` : 'Context: unavailable'/);
+});
diff --git a/test/agent/session-ui-state.test.js b/test/agent/session-ui-state.test.js
index e7701f5..4f96798 100644
--- a/test/agent/session-ui-state.test.js
+++ b/test/agent/session-ui-state.test.js
@@ -59,3 +59,32 @@ test('messages.loaded hydrates stored run metadata for reopened sessions', () =>
   assert.equal(next.runs.run_1?.steps?.length, 1);
   assert.equal(next.runs.run_1?.steps?.[0]?.label, 'Snapshot page');
 });
+
+test('session.metadata.loaded hydrates persisted codex usage for reopened session', () => {
+  const state = {
+    activeSessionId: 's1',
+    sessions: [],
+    runs: {},
+    messagesBySession: {},
+    latestUsageBySession: {},
+  };
+
+  const next = reduceState(state, {
+    type: 'session.metadata.loaded',
+    sessionId: 's1',
+    session: {
+      sessionId: 's1',
+      providerState: {
+        codex: {
+          latestUsage: {
+            modelContextWindow: 258400,
+            totalTokens: 1120,
+          },
+        },
+      },
+    },
+  });
+
+  assert.equal(next.latestUsageBySession.s1.modelContextWindow, 258400);
+  assert.equal(next.latestUsageBySession.s1.totalTokens, 1120);
+});
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
index 1e91059..cff54f6 100644
--- a/test/agent/sse-events.test.js
+++ b/test/agent/sse-events.test.js
@@ -7,6 +7,7 @@ const baseState = {
   activeSessionId: null,
   messagesBySession: {},
   runs: {},
+  latestUsageBySession: {},
 };
 
 test('chat.delta appends to in-flight run text', () => {
@@ -54,3 +55,55 @@ test('run.error appends a final failed step', () => {
   assert.equal(last.status, 'failed');
   assert.match(last.label, /boom/);
 });
+
+test('run.event is converted into a visible in-flight step', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, {
+    event: 'run.event',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: {
+      type: 'item.started',
+      item: {
+        type: 'reasoning',
+        summary: 'Planning skill invocation',
+      },
+    },
+  });
+  const last = s2.runs.r1.steps.at(-1);
+  assert.equal(last.status, 'running');
+  assert.equal(last.kind, 'reasoning');
+  assert.match(last.label, /Planning skill invocation/);
+});
+
+test('run.usage stores normalized usage for run and session', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, {
+    event: 'run.usage',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: {
+      totalTokens: 1120,
+      modelContextWindow: 258400,
+      cachedInputTokens: 700,
+    },
+  });
+
+  assert.equal(s2.runs.r1.usage.totalTokens, 1120);
+  assert.equal(s2.latestUsageBySession.s1.modelContextWindow, 258400);
+  assert.equal(s2.latestUsageBySession.s1.cachedInputTokens, 700);
+});
+
+test('run.usage accepts missing context window without crashing', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, {
+    event: 'run.usage',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: {
+      totalTokens: 1120,
+    },
+  });
+  assert.equal(s2.latestUsageBySession.s1.totalTokens, 1120);
+  assert.equal(Object.prototype.hasOwnProperty.call(s2.latestUsageBySession.s1, 'modelContextWindow'), false);
+});

From 2c1417d63ca61efbcd1afb80462e9f4d7dba225c Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:07:12 +0530
Subject: [PATCH 130/192] docs(agent): document codex session continuity and
 context telemetry

---
 AGENTS.md | 10 ++++++++++
 README.md |  7 +++++++
 2 files changed, 17 insertions(+)

diff --git a/AGENTS.md b/AGENTS.md
index 5b44d22..540bf0c 100644
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -174,6 +174,16 @@ For side-panel chat UX, **never hardcode or assume a fixed `sessionId`**.
 - Streaming channels (`/events`) must be scoped by explicit selected `sessionId`.
 - Do not infer continuity from "current Codex turn/session" alone; BrowserForce Agent keeps its own session store.
 
+### Codex Provider Session Continuity + Usage Telemetry
+
+For side-panel chat continuity, BrowserForce session metadata stores Codex provider state:
+
+- Persist Codex thread identity at `providerState.codex.sessionId`.
+- On each new run, pass that mapping as `resumeSessionId` so runner can invoke `codex exec resume <id> --json`.
+- Persist latest context/token telemetry at `providerState.codex.latestUsage`.
+- Emit and consume `run.usage` and `run.provider_session` events.
+- Side-panel hydrates usage from `GET /v1/sessions/:sessionId` and shows `Context: unavailable` when telemetry is missing.
+
 ## Security Rules
 
 - Relay binds to `127.0.0.1` ONLY. Never `0.0.0.0`.
diff --git a/README.md b/README.md
index 1d9890f..033da4c 100644
--- a/README.md
+++ b/README.md
@@ -393,7 +393,14 @@ BrowserForce now includes a side-panel chat UI in the Chrome extension for resum
 - Open popup -> `Open BrowserForce Agent` to open the side panel.
 - Use the session list to switch between chats; transcripts hydrate per selected `sessionId`.
 - Session identity is explicit and persisted; there is no fixed/hardcoded chat session ID.
+- BrowserForce session metadata persists Codex continuity state at `providerState.codex.sessionId`.
+  - New runs use `codex exec resume <sessionId> --json` when this mapping exists.
+  - If resume fails with an explicit invalid-session signature, chatd retries once as a fresh run.
 - Streaming uses `fetch` + `ReadableStream` for SSE, not `EventSource`, so the panel can send `Authorization: Bearer ...` headers.
+- Side-panel status includes a context usage chip:
+  - Live updates from `run.usage` SSE events when available.
+  - Hydrates from `GET /v1/sessions/:sessionId` via `providerState.codex.latestUsage`.
+  - Falls back to `Context: unavailable` when telemetry is absent.
 
 Daemon lifecycle:
 

From 6d142f627f923e6d8ce6cff08acfdfb2f5865817 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:19:33 +0530
Subject: [PATCH 131/192] sidepanel: render run timeline inline with ordered
 tool/text events

---
 extension/agent-panel-state.js               | 184 +++++++++++++++----
 extension/agent-panel.css                    | 104 ++++-------
 extension/agent-panel.js                     | 133 +++++++-------
 test/agent/agent-panel-send-contract.test.js |  11 +-
 test/agent/session-ui-state.test.js          |  29 +++
 test/agent/sse-events.test.js                |  30 +++
 6 files changed, 320 insertions(+), 171 deletions(-)

diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index 13a6671..1087c24 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -19,14 +19,21 @@ function trimStepLabel(label) {
   return text.length > 160 ? `${text.slice(0, 157)}...` : text;
 }
 
-function pushStep(run, step) {
-  const steps = Array.isArray(run?.steps) ? run.steps.slice() : [];
-  const normalized = {
+function normalizeStep(step) {
+  if (!step || typeof step !== 'object') return null;
+  const label = trimStepLabel(step.label);
+  if (!label) return null;
+  return {
     kind: step.kind || 'reasoning',
     status: step.status || 'running',
-    label: trimStepLabel(step.label),
+    label,
   };
-  if (!normalized.label) return steps;
+}
+
+function pushStep(run, step) {
+  const steps = Array.isArray(run?.steps) ? run.steps.slice() : [];
+  const normalized = normalizeStep(step);
+  if (!normalized || !normalized.label) return steps;
   const last = steps[steps.length - 1];
   if (last && last.label === normalized.label && last.kind === normalized.kind && last.status === normalized.status) {
     return steps;
@@ -36,6 +43,92 @@ function pushStep(run, step) {
   return steps;
 }
 
+function pushTimelineEntry(run, entry) {
+  const timeline = Array.isArray(run?.timeline) ? run.timeline.slice() : [];
+  if (!entry || typeof entry !== 'object') return timeline;
+
+  if (entry.type === 'text') {
+    const text = typeof entry.text === 'string' ? entry.text : '';
+    if (!text) return timeline;
+    const last = timeline[timeline.length - 1];
+    if (last?.type === 'text') {
+      last.text = `${last.text || ''}${text}`;
+    } else {
+      timeline.push({ type: 'text', text });
+    }
+  } else if (entry.type === 'step') {
+    const normalized = normalizeStep(entry);
+    if (!normalized) return timeline;
+    const candidate = { type: 'step', ...normalized };
+    const last = timeline[timeline.length - 1];
+    if (
+      last
+      && last.type === 'step'
+      && last.label === candidate.label
+      && last.kind === candidate.kind
+      && last.status === candidate.status
+    ) {
+      return timeline;
+    }
+    timeline.push(candidate);
+  }
+
+  if (timeline.length > 200) timeline.shift();
+  return timeline;
+}
+
+function normalizeStoredTimelineEntry(entry) {
+  if (!entry || typeof entry !== 'object') return null;
+  if (entry.type === 'text') {
+    const text = typeof entry.text === 'string' ? entry.text : '';
+    if (!text) return null;
+    return { type: 'text', text };
+  }
+  const step = normalizeStep(entry);
+  if (!step) return null;
+  return { type: 'step', ...step };
+}
+
+function fallbackTimelineFromMessage({ steps, text }) {
+  const timeline = [];
+  for (const step of steps) {
+    timeline.push({ type: 'step', ...step });
+  }
+  if (typeof text === 'string' && text) {
+    timeline.push({ type: 'text', text });
+  }
+  return timeline;
+}
+
+function hasTimelineText(timeline) {
+  return Array.isArray(timeline) && timeline.some((entry) => entry?.type === 'text' && entry.text);
+}
+
+function applyFinalTextToTimeline(run, finalText) {
+  let timeline = Array.isArray(run?.timeline) ? run.timeline.slice() : [];
+  const currentText = String(run?.text || '');
+  const resolved = String(finalText || '');
+  if (!resolved) return timeline;
+
+  if (!timeline.length || !hasTimelineText(timeline)) {
+    timeline = pushTimelineEntry({ timeline }, { type: 'text', text: resolved });
+    return timeline;
+  }
+
+  if (resolved === currentText) return timeline;
+
+  if (currentText && resolved.startsWith(currentText)) {
+    const suffix = resolved.slice(currentText.length);
+    if (suffix) {
+      timeline = pushTimelineEntry({ timeline }, { type: 'text', text: suffix });
+    }
+    return timeline;
+  }
+
+  timeline = pushTimelineEntry({ timeline }, { type: 'text', text: resolved });
+  return timeline;
+}
+
 function stepLabelForToolEvent(evt) {
   const payload = evt?.payload || {};
   if (evt.event === 'tool.started') {
@@ -149,14 +242,7 @@ function normalizeUsagePayload(payload) {
 }
 
 function normalizeStoredStep(step) {
-  if (!step || typeof step !== 'object') return null;
-  const label = trimStepLabel(step.label);
-  if (!label) return null;
-  return {
-    kind: step.kind || 'reasoning',
-    status: step.status || 'running',
-    label,
-  };
+  return normalizeStep(step);
 }
 
 function hydrateRunsFromMessages(messages, sessionId, currentRuns) {
@@ -167,13 +253,23 @@ function hydrateRunsFromMessages(messages, sessionId, currentRuns) {
     const steps = Array.isArray(message?.steps)
       ? message.steps.map(normalizeStoredStep).filter(Boolean)
       : [];
+    const timeline = Array.isArray(message?.timeline)
+      ? message.timeline.map(normalizeStoredTimelineEntry).filter(Boolean)
+      : [];
+    const resolvedText = typeof message?.text === 'string' ? message.text : (currentRuns?.[runId]?.text || '');
     hydrated[runId] = {
       ...(currentRuns?.[runId] || { runId, text: '', done: false, steps: [] }),
       runId,
       sessionId,
-      text: typeof message?.text === 'string' ? message.text : (currentRuns?.[runId]?.text || ''),
+      text: resolvedText,
       done: true,
       steps: steps.length > 0 ? steps : (currentRuns?.[runId]?.steps || []),
+      timeline: timeline.length > 0
+        ? timeline
+        : fallbackTimelineFromMessage({
+          steps: steps.length > 0 ? steps : (currentRuns?.[runId]?.steps || []),
+          text: resolvedText,
+        }),
     };
   }
   return hydrated;
@@ -243,30 +339,39 @@ export function applyEvent(state = initialState, evt = {}) {
         done: false,
         error: null,
         steps: [],
+        timeline: [],
       }),
     };
   }
 
   if (evt.event === 'chat.delta') {
-    const run = state.runs[evt.runId] || { text: '', done: false };
+    const run = state.runs[evt.runId] || { text: '', done: false, steps: [], timeline: [] };
     const delta = evt.payload?.delta || '';
     return {
       ...state,
       runs: upsertRun(state, evt.runId, {
         sessionId: evt.sessionId,
         text: `${run.text || ''}${delta}`,
+        timeline: pushTimelineEntry(run, { type: 'text', text: delta }),
       }),
     };
   }
 
   if (evt.event === 'chat.final') {
-    const finalText = evt.payload?.text || state.runs[evt.runId]?.text || '';
+    const run = state.runs[evt.runId] || { text: '', done: false, steps: [], timeline: [] };
+    const finalText = evt.payload?.text || run.text || '';
+    const timeline = applyFinalTextToTimeline(run, finalText);
     const currentMessages = state.messagesBySession[evt.sessionId] || [];
     const hasStoredFinal = currentMessages.some(
       (message) => message.runId === evt.runId && message.role === 'assistant',
     );
-    const nextMessages = (!hasStoredFinal && finalText)
-      ? [...currentMessages, { role: 'assistant', text: finalText, runId: evt.runId }]
+    const nextMessages = (!hasStoredFinal && (finalText || timeline.length > 0))
+      ? [...currentMessages, {
+        role: 'assistant',
+        text: finalText,
+        runId: evt.runId,
+        timeline,
+      }]
       : currentMessages;
 
     return {
@@ -279,47 +384,52 @@ export function applyEvent(state = initialState, evt = {}) {
         sessionId: evt.sessionId,
         text: finalText,
         done: true,
+        timeline,
       }),
     };
   }
 
   if (evt.event === 'run.error') {
-    const run = state.runs[evt.runId] || { steps: [] };
+    const run = state.runs[evt.runId] || { steps: [], timeline: [] };
     const error = evt.payload?.error || 'Unknown error';
+    const step = {
+      kind: 'status',
+      status: 'failed',
+      label: `Failed: ${error}`,
+    };
     return {
       ...state,
       runs: upsertRun(state, evt.runId, {
         sessionId: evt.sessionId,
         done: true,
         error,
-        steps: pushStep(run, {
-          kind: 'status',
-          status: 'failed',
-          label: `Failed: ${error}`,
-        }),
+        steps: pushStep(run, step),
+        timeline: pushTimelineEntry(run, { type: 'step', ...step }),
       }),
     };
   }
 
   if (evt.event === 'run.aborted') {
-    const run = state.runs[evt.runId] || { steps: [] };
+    const run = state.runs[evt.runId] || { steps: [], timeline: [] };
+    const step = {
+      kind: 'status',
+      status: 'aborted',
+      label: 'Stopped',
+    };
     return {
       ...state,
       runs: upsertRun(state, evt.runId, {
         sessionId: evt.sessionId,
         done: true,
         aborted: true,
-        steps: pushStep(run, {
-          kind: 'status',
-          status: 'aborted',
-          label: 'Stopped',
-        }),
+        steps: pushStep(run, step),
+        timeline: pushTimelineEntry(run, { type: 'step', ...step }),
       }),
     };
   }
 
   if (evt.event === 'tool.started' || evt.event === 'tool.delta' || evt.event === 'tool.final') {
-    const run = state.runs[evt.runId] || { text: '', done: false, steps: [] };
+    const run = state.runs[evt.runId] || { text: '', done: false, steps: [], timeline: [] };
     const status = evt.event === 'tool.final'
       ? 'done'
       : 'running';
@@ -327,27 +437,31 @@ export function applyEvent(state = initialState, evt = {}) {
       ? 'reasoning'
       : 'tool';
     const label = stepLabelForToolEvent(evt);
+    const step = { kind, status, label };
     return {
       ...state,
       runs: upsertRun(state, evt.runId, {
         sessionId: evt.sessionId,
         done: false,
-        steps: pushStep(run, { kind, status, label }),
+        steps: pushStep(run, step),
+        timeline: pushTimelineEntry(run, { type: 'step', ...step }),
       }),
     };
   }
 
   if (evt.event === 'run.event') {
-    const run = state.runs[evt.runId] || { text: '', done: false, steps: [] };
+    const run = state.runs[evt.runId] || { text: '', done: false, steps: [], timeline: [] };
     const status = stepStatusForRunEvent(evt);
     const kind = stepKindForRunEvent(evt);
     const label = stepLabelForRunEvent(evt);
+    const step = { kind, status, label };
     return {
       ...state,
       runs: upsertRun(state, evt.runId, {
         sessionId: evt.sessionId,
         done: false,
-        steps: pushStep(run, { kind, status, label }),
+        steps: pushStep(run, step),
+        timeline: pushTimelineEntry(run, { type: 'step', ...step }),
       }),
     };
   }
@@ -355,7 +469,7 @@ export function applyEvent(state = initialState, evt = {}) {
   if (evt.event === 'run.usage') {
     const usage = normalizeUsagePayload(evt.payload);
     if (!usage) return state;
-    const run = state.runs[evt.runId] || { text: '', done: false, steps: [] };
+    const run = state.runs[evt.runId] || { text: '', done: false, steps: [], timeline: [] };
     return {
       ...state,
       runs: upsertRun(state, evt.runId, {
diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 8bc6444..25a5f8c 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -337,49 +337,15 @@ body {
   word-break: break-word;
 }
 
-.run-steps-summary {
-  margin-bottom: 6px;
-}
-
-.steps-toggle {
-  display: inline-flex;
-  align-items: center;
-  gap: 6px;
-  background: transparent;
-  border: 0;
-  cursor: pointer;
-  font-size: 12px;
-  color: var(--text-subtle);
-  padding: 0;
-  transition: color 0.15s;
-}
-
-.steps-toggle:hover {
-  color: var(--text-muted);
-}
-
-.steps-toggle svg {
-  width: 12px;
-  height: 12px;
-  transition: transform 0.2s;
-}
-
-.steps-toggle.open svg {
-  transform: rotate(90deg);
-}
-
-.steps-list {
-  list-style: none;
-  display: none;
-  margin: 8px 0 8px 2px;
-  padding-left: 12px;
-  border-left: 1.5px solid var(--line);
-}
-
-.steps-list.open {
+.run-timeline {
   display: flex;
   flex-direction: column;
   gap: 8px;
+  margin-bottom: 6px;
+}
+
+.timeline-step {
+  padding-left: 2px;
 }
 
 .step-item {
@@ -411,10 +377,13 @@ body {
 }
 
 .run-step-icon {
-  width: 13px;
-  height: 13px;
+  width: 14px;
+  height: 14px;
+  display: inline-flex;
+  align-items: center;
+  justify-content: center;
   flex-shrink: 0;
-  margin-top: 1px;
+  margin-top: 2px;
   color: var(--text-subtle);
   position: relative;
 }
@@ -435,20 +404,21 @@ body {
 .run-step-icon::after {
   content: '';
   position: absolute;
+  box-sizing: border-box;
 }
 
 .run-step-icon.icon-reasoning::before {
-  top: 2px;
-  left: 2px;
-  width: 9px;
-  height: 9px;
+  top: 3px;
+  left: 3px;
+  width: 8px;
+  height: 8px;
   border-radius: 999px;
   background: currentColor;
 }
 
 .run-step-icon.icon-tool::before {
-  top: 1px;
-  left: 1px;
+  top: 2px;
+  left: 2px;
   width: 10px;
   height: 10px;
   border: 1.5px solid currentColor;
@@ -457,7 +427,7 @@ body {
 
 .run-step-icon.icon-view::before {
   top: 4px;
-  left: 0;
+  left: 1px;
   width: 12px;
   height: 6px;
   border: 1.5px solid currentColor;
@@ -466,7 +436,7 @@ body {
 
 .run-step-icon.icon-view::after {
   top: 6px;
-  left: 5px;
+  left: 6px;
   width: 2px;
   height: 2px;
   border: 1.5px solid currentColor;
@@ -475,7 +445,7 @@ body {
 
 .run-step-icon.icon-camera::before {
   top: 3px;
-  left: 0;
+  left: 1px;
   width: 12px;
   height: 7px;
   border: 1.5px solid currentColor;
@@ -484,7 +454,7 @@ body {
 
 .run-step-icon.icon-camera::after {
   top: 1px;
-  left: 4px;
+  left: 5px;
   width: 4px;
   height: 2px;
   border: 1.5px solid currentColor;
@@ -494,7 +464,7 @@ body {
 
 .run-step-icon.icon-plan::before {
   top: 2px;
-  left: 2px;
+  left: 3px;
   width: 2px;
   height: 2px;
   border-radius: 999px;
@@ -504,7 +474,7 @@ body {
 
 .run-step-icon.icon-plan::after {
   top: 2px;
-  left: 6px;
+  left: 7px;
   width: 5px;
   height: 2px;
   border-radius: 2px;
@@ -513,17 +483,17 @@ body {
 }
 
 .run-step-icon.icon-done::before {
-  top: 0;
-  left: 0;
-  width: 11px;
-  height: 11px;
+  top: 1px;
+  left: 1px;
+  width: 12px;
+  height: 12px;
   border: 1.5px solid currentColor;
   border-radius: 999px;
 }
 
 .run-step-icon.icon-done::after {
-  top: 5px;
-  left: 3px;
+  top: 6px;
+  left: 4px;
   width: 5px;
   height: 3px;
   border-left: 1.5px solid currentColor;
@@ -532,17 +502,17 @@ body {
 }
 
 .run-step-icon.icon-failed::before {
-  top: 0;
-  left: 0;
-  width: 11px;
-  height: 11px;
+  top: 1px;
+  left: 1px;
+  width: 12px;
+  height: 12px;
   border: 1.5px solid currentColor;
   border-radius: 999px;
 }
 
 .run-step-icon.icon-failed::after {
-  top: 6px;
-  left: 2px;
+  top: 7px;
+  left: 3px;
   width: 7px;
   height: 1.5px;
   background: currentColor;
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index cb7760a..8aea36d 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -4,7 +4,6 @@ import {
   classifyRunStepIcon,
   clearSessionRunId,
   formatContextUsage,
-  getLatestInFlightStepIndex,
   getSessionRunId,
   renderInlineContent,
   shouldApplySessionSelection,
@@ -17,7 +16,6 @@ const state = {
   currentRunBySession: {},
   editingSessionId: null,
   sessionTitleDrafts: {},
-  expandedRunSteps: {},
   eventController: null,
   eventLoopToken: 0,
   sessionSelectionToken: 0,
@@ -355,48 +353,62 @@ function renderSessions() {
   });
 }
 
-function isRunStepsExpanded(runId) {
-  return !!state.expandedRunSteps?.[runId];
-}
+function normalizeRunTimeline(run, fallbackText = '') {
+  if (!run) return [];
+  if (Array.isArray(run.timeline) && run.timeline.length > 0) {
+    return run.timeline.filter((entry) => {
+      if (!entry || typeof entry !== 'object') return false;
+      if (entry.type === 'text') return typeof entry.text === 'string' && entry.text.length > 0;
+      if (entry.type === 'step') return typeof entry.label === 'string' && entry.label.trim().length > 0;
+      return false;
+    });
+  }
 
-function toggleRunSteps(runId) {
-  if (!runId) return;
-  state.expandedRunSteps = {
-    ...(state.expandedRunSteps || {}),
-    [runId]: !isRunStepsExpanded(runId),
-  };
-  renderTranscript();
+  const steps = Array.isArray(run.steps) ? run.steps : [];
+  const timeline = steps.map((step) => ({
+    type: 'step',
+    kind: step?.kind || 'reasoning',
+    status: step?.status || 'running',
+    label: step?.label || '',
+  }));
+
+  const text = typeof fallbackText === 'string' && fallbackText
+    ? fallbackText
+    : (typeof run.text === 'string' ? run.text : '');
+  if (text) timeline.push({ type: 'text', text });
+  return timeline;
 }
 
-function renderRunSteps(runId, run) {
-  if (!runId || !run || !Array.isArray(run.steps) || run.steps.length === 0) return '';
-  const count = run.steps.length;
-  const expanded = isRunStepsExpanded(runId);
-  const latestStepIndex = getLatestInFlightStepIndex(run);
-
-  const items = run.steps
-    .map((step, index) => {
-      const status = step?.status || 'running';
-      const label = step?.label || 'Step';
-      const icon = classifyRunStepIcon(step);
-      const isLatest = index === latestStepIndex;
-      const shouldPulse = isLatest && status === 'running';
-      const classes = ['step-item', escapeHtml(status)];
-      if (isLatest) classes.push('latest');
-      if (shouldPulse) classes.push('pulse');
-      return `<li class="${classes.join(' ')}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="step-label">${renderInlineContent(label)}</span></li>`;
-    })
-    .join('');
+function getLatestInFlightTimelineStepIndex(run, timeline) {
+  if (!run || run.done) return -1;
+  for (let index = timeline.length - 1; index >= 0; index -= 1) {
+    const entry = timeline[index];
+    if (entry?.type !== 'step') continue;
+    const status = String(entry.status || 'running').toLowerCase();
+    if (status === 'running') return index;
+  }
+  return -1;
+}
 
+function renderRunTimeline(run, fallbackText = '') {
+  const timeline = normalizeRunTimeline(run, fallbackText);
+  if (!timeline.length) return '';
+  const latestStepIndex = getLatestInFlightTimelineStepIndex(run, timeline);
   return `
-    <div class="run-steps-summary">
-      <button type="button" class="steps-toggle ${expanded ? 'open' : ''}" data-run-steps-toggle="${escapeHtml(runId)}">
-        <svg fill="none" viewBox="0 0 24 24" stroke="currentColor" stroke-width="2" aria-hidden="true">
-          <path stroke-linecap="round" stroke-linejoin="round" d="M9 5l7 7-7 7"></path>
-        </svg>
-        <strong>${count} step${count === 1 ? '' : 's'}</strong>
-      </button>
-      <ol class="steps-list ${expanded ? 'open' : ''}">${items}</ol>
+    <div class="run-timeline">
+      ${timeline.map((entry, index) => {
+    if (entry.type === 'text') {
+      return `<div class="bubble-assistant"><p>${renderContent(entry.text || '')}</p></div>`;
+    }
+    const status = entry?.status || 'running';
+    const icon = classifyRunStepIcon(entry);
+    const isLatest = index === latestStepIndex;
+    const shouldPulse = isLatest && status === 'running';
+    const classes = ['step-item', 'timeline-step', escapeHtml(status)];
+    if (isLatest) classes.push('latest');
+    if (shouldPulse) classes.push('pulse');
+    return `<div class="${classes.join(' ')}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="step-label">${renderInlineContent(entry.label || 'Step')}</span></div>`;
+  }).join('')}
     </div>
   `;
 }
@@ -406,11 +418,7 @@ function renderContent(value) {
 }
 
 function bindTranscriptHandlers() {
-  transcriptEl.querySelectorAll('button[data-run-steps-toggle]').forEach((button) => {
-    button.addEventListener('click', () => {
-      toggleRunSteps(button.getAttribute('data-run-steps-toggle'));
-    });
-  });
+  // Transcript rows are static render output; no delegated actions required.
 }
 
 function renderTranscript() {
@@ -431,39 +439,30 @@ function renderTranscript() {
     }
 
     const messageRun = msg.runId ? state.value.runs[msg.runId] : null;
+    const timelineHtml = renderRunTimeline(messageRun, msg.text || '');
+    const fallbackHtml = `<div class="bubble-assistant"><p>${renderContent(msg.text || '')}</p></div>`;
     return `
       <article class="message assistant">
         <div class="msg-meta"><span class="msg-author">BrowserForce</span></div>
         <div class="msg-content-wrap">
-          ${renderRunSteps(msg.runId, messageRun)}
-          <div class="bubble-assistant"><p>${renderContent(msg.text || '')}</p></div>
+          ${timelineHtml || fallbackHtml}
         </div>
       </article>
     `;
   });
 
   if (run && !run.done) {
-    if (run.text && run.text.trim()) {
-      chunks.push(`
-        <article class="message assistant">
-          <div class="msg-meta"><span class="msg-author">BrowserForce</span></div>
-          <div class="msg-content-wrap">
-            ${renderRunSteps(sessionRunId, run)}
-            <div class="bubble-assistant"><p>${renderContent(run.text)}</p></div>
-          </div>
-        </article>
-      `);
-    } else {
-      chunks.push(`
-        <article class="message assistant">
-          <div class="msg-meta"><span class="msg-author">BrowserForce</span></div>
-          <div class="msg-content-wrap">
-            ${renderRunSteps(sessionRunId, run)}
-            <div class="thinking-bubble"><div class="spinner"></div><span>Thinking...</span></div>
-          </div>
-        </article>
-      `);
-    }
+    const timelineHtml = renderRunTimeline(run, run.text || '');
+    const shouldShowThinking = !(run.text && run.text.trim());
+    chunks.push(`
+      <article class="message assistant">
+        <div class="msg-meta"><span class="msg-author">BrowserForce</span></div>
+        <div class="msg-content-wrap">
+          ${timelineHtml}
+          ${shouldShowThinking ? '<div class="thinking-bubble"><div class="spinner"></div><span>Thinking...</span></div>' : ''}
+        </div>
+      </article>
+    `);
   }
 
   if (!chunks.length) {
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 1443eaf..50d9370 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -57,12 +57,19 @@ test('session popover renders per-session timestamp metadata', () => {
   assert.match(js, /toLocaleString/);
 });
 
-test('in-flight thinking state keeps run steps visible above the thinking bubble', () => {
+test('in-flight thinking state keeps inline timeline visible above the thinking bubble', () => {
   assert.match(js, /if \(run && !run\.done\)/);
-  assert.match(js, /renderRunSteps\(sessionRunId, run\)/);
+  assert.match(js, /function renderRunTimeline\(run, fallbackText = ''\)/);
+  assert.match(js, /renderRunTimeline\(run, run\.text \|\| ''\)/);
   assert.match(js, /class="thinking-bubble"/);
 });
 
+test('assistant transcript prefers ordered run timeline over grouped run steps', () => {
+  assert.match(js, /function normalizeRunTimeline\(run, fallbackText = ''\)/);
+  assert.match(js, /if \(Array\.isArray\(run\.timeline\) && run\.timeline\.length > 0\)/);
+  assert.match(js, /const timelineHtml = renderRunTimeline\(messageRun, msg\.text \|\| ''\)/);
+});
+
 test('status row renders context usage from latestUsageBySession with fallback', () => {
   assert.match(js, /function renderContextUsageChip\(\)/);
   assert.match(js, /latestUsageBySession/);
diff --git a/test/agent/session-ui-state.test.js b/test/agent/session-ui-state.test.js
index 4f96798..c491fe0 100644
--- a/test/agent/session-ui-state.test.js
+++ b/test/agent/session-ui-state.test.js
@@ -60,6 +60,35 @@ test('messages.loaded hydrates stored run metadata for reopened sessions', () =>
   assert.equal(next.runs.run_1?.steps?.[0]?.label, 'Snapshot page');
 });
 
+test('messages.loaded hydrates stored timeline entries for reopened sessions', () => {
+  const state = {
+    activeSessionId: 's1',
+    sessions: [],
+    runs: {},
+    messagesBySession: {},
+  };
+
+  const next = reduceState(state, {
+    type: 'messages.loaded',
+    sessionId: 's1',
+    messages: [{
+      role: 'assistant',
+      text: 'Done',
+      runId: 'run_2',
+      timeline: [
+        { type: 'step', kind: 'tool', status: 'done', label: 'execute' },
+        { type: 'text', text: 'Done' },
+      ],
+    }],
+  });
+
+  assert.equal(next.runs.run_2?.done, true);
+  assert.equal(Array.isArray(next.runs.run_2?.timeline), true);
+  assert.equal(next.runs.run_2?.timeline?.length, 2);
+  assert.equal(next.runs.run_2?.timeline?.[0]?.type, 'step');
+  assert.equal(next.runs.run_2?.timeline?.[1]?.type, 'text');
+});
+
 test('session.metadata.loaded hydrates persisted codex usage for reopened session', () => {
   const state = {
     activeSessionId: 's1',
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
index cff54f6..3e1c116 100644
--- a/test/agent/sse-events.test.js
+++ b/test/agent/sse-events.test.js
@@ -48,6 +48,36 @@ test('tool and reasoning events are tracked as steps', () => {
   assert.match(s4.runs.r1.steps[1].label, /Planning/);
 });
 
+test('chat and tool events preserve inline timeline order', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, { event: 'chat.delta', runId: 'r1', sessionId: 's1', payload: { delta: 'First chunk. ' } });
+  const s3 = applyEvent(s2, { event: 'tool.started', runId: 'r1', sessionId: 's1', payload: { tool: 'execute' } });
+  const s4 = applyEvent(s3, { event: 'chat.delta', runId: 'r1', sessionId: 's1', payload: { delta: 'Second chunk.' } });
+  const timeline = s4.runs.r1.timeline || [];
+
+  assert.deepEqual(
+    timeline.map((item) => item.type),
+    ['text', 'step', 'text'],
+  );
+  assert.equal(timeline[0]?.text, 'First chunk. ');
+  assert.match(timeline[1]?.label || '', /execute/i);
+  assert.equal(timeline[2]?.text, 'Second chunk.');
+});
+
+test('chat.final stores timeline with assistant transcript message', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, { event: 'chat.delta', runId: 'r1', sessionId: 's1', payload: { delta: 'Done.' } });
+  const s3 = applyEvent(s2, { event: 'tool.started', runId: 'r1', sessionId: 's1', payload: { tool: 'execute' } });
+  const s4 = applyEvent(s3, { event: 'chat.final', runId: 'r1', sessionId: 's1', payload: { text: 'Done.' } });
+  const message = s4.messagesBySession.s1.at(-1);
+
+  assert.equal(message?.role, 'assistant');
+  assert.equal(Array.isArray(message?.timeline), true);
+  assert.equal(message.timeline.length >= 2, true);
+  assert.equal(message.timeline.some((item) => item.type === 'step'), true);
+  assert.equal(message.timeline.some((item) => item.type === 'text'), true);
+});
+
 test('run.error appends a final failed step', () => {
   const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
   const s2 = applyEvent(s1, { event: 'run.error', runId: 'r1', sessionId: 's1', payload: { error: 'boom' } });

From cfc8dae4c743cd9712f83c82c1662ab122c29241 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:19:45 +0530
Subject: [PATCH 132/192] agent: persist ordered run timeline with chat/tool
 events

---
 agent/src/chatd.js               | 105 ++++++++++++++++++++++++++-----
 agent/src/session-store.js       |  43 ++++++++++++-
 test/agent/chatd-api.test.js     |   3 +
 test/agent/session-store.test.js |   8 +++
 4 files changed, 144 insertions(+), 15 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 480138b..2245ac4 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -300,15 +300,22 @@ function trimStepLabel(label) {
   return text.length > 160 ? `${text.slice(0, 157)}...` : text;
 }
 
-function pushRunStep(run, step) {
-  if (!run) return;
-  const steps = Array.isArray(run.steps) ? run.steps : [];
-  const normalized = {
+function normalizeRunStep(step) {
+  if (!step || typeof step !== 'object') return null;
+  const label = trimStepLabel(step.label);
+  if (!label) return null;
+  return {
     kind: String(step?.kind || '').trim() || 'reasoning',
     status: String(step?.status || '').trim() || 'running',
-    label: trimStepLabel(step?.label),
+    label,
   };
-  if (!normalized.label) return;
+}
+
+function pushRunStep(run, step) {
+  if (!run) return;
+  const steps = Array.isArray(run.steps) ? run.steps : [];
+  const normalized = normalizeRunStep(step);
+  if (!normalized || !normalized.label) return;
   const last = steps[steps.length - 1];
   if (last && last.label === normalized.label && last.kind === normalized.kind && last.status === normalized.status) {
     return;
@@ -318,6 +325,64 @@ function pushRunStep(run, step) {
   run.steps = steps;
 }
 
+function pushRunTimelineEntry(run, entry) {
+  if (!run || !entry || typeof entry !== 'object') return;
+  const timeline = Array.isArray(run.timeline) ? run.timeline : [];
+  if (entry.type === 'text') {
+    const text = typeof entry.text === 'string' ? entry.text : '';
+    if (!text) return;
+    const last = timeline[timeline.length - 1];
+    if (last?.type === 'text') {
+      last.text = `${last.text || ''}${text}`;
+    } else {
+      timeline.push({ type: 'text', text });
+    }
+  } else if (entry.type === 'step') {
+    const normalized = normalizeRunStep(entry);
+    if (!normalized) return;
+    const next = { type: 'step', ...normalized };
+    const last = timeline[timeline.length - 1];
+    if (
+      last
+      && last.type === 'step'
+      && last.label === next.label
+      && last.kind === next.kind
+      && last.status === next.status
+    ) {
+      return;
+    }
+    timeline.push(next);
+  } else {
+    return;
+  }
+  if (timeline.length > 200) timeline.shift();
+  run.timeline = timeline;
+}
+
+function runTimelineHasText(run) {
+  return Array.isArray(run?.timeline) && run.timeline.some((entry) => entry?.type === 'text' && entry.text);
+}
+
+function syncFinalTextToRunTimeline(run, finalText) {
+  if (!run) return;
+  const text = String(finalText || '');
+  if (!text) return;
+  const assistantBuffer = String(run.assistantBuffer || '');
+
+  if (!runTimelineHasText(run)) {
+    pushRunTimelineEntry(run, { type: 'text', text });
+    return;
+  }
+  if (assistantBuffer && text.startsWith(assistantBuffer)) {
+    const suffix = text.slice(assistantBuffer.length);
+    if (suffix) pushRunTimelineEntry(run, { type: 'text', text: suffix });
+    return;
+  }
+  if (text !== assistantBuffer) {
+    pushRunTimelineEntry(run, { type: 'text', text });
+  }
+}
+
 function stepLabelForToolEvent(evt) {
   const payload = evt?.payload || {};
   if (evt.event === 'tool.started') {
@@ -402,38 +467,46 @@ function trackRunStep(run, evt) {
   if (!run || !evt?.event) return;
 
   if (evt.event === 'tool.started' || evt.event === 'tool.delta' || evt.event === 'tool.final') {
-    pushRunStep(run, {
+    const step = {
       kind: evt.event === 'tool.delta' ? 'reasoning' : 'tool',
       status: evt.event === 'tool.final' ? 'done' : 'running',
       label: stepLabelForToolEvent(evt),
-    });
+    };
+    pushRunStep(run, step);
+    pushRunTimelineEntry(run, { type: 'step', ...step });
     return;
   }
 
   if (evt.event === 'run.event') {
-    pushRunStep(run, {
+    const step = {
       kind: stepKindForRunEvent(evt),
       status: stepStatusForRunEvent(evt),
       label: stepLabelForRunEvent(evt),
-    });
+    };
+    pushRunStep(run, step);
+    pushRunTimelineEntry(run, { type: 'step', ...step });
     return;
   }
 
   if (evt.event === 'run.error') {
-    pushRunStep(run, {
+    const step = {
       kind: 'status',
       status: 'failed',
       label: `Failed: ${evt.payload?.error || 'Unknown error'}`,
-    });
+    };
+    pushRunStep(run, step);
+    pushRunTimelineEntry(run, { type: 'step', ...step });
     return;
   }
 
   if (evt.event === 'run.aborted') {
-    pushRunStep(run, {
+    const step = {
       kind: 'status',
       status: 'aborted',
       label: 'Stopped',
-    });
+    };
+    pushRunStep(run, step);
+    pushRunTimelineEntry(run, { type: 'step', ...step });
   }
 }
 
@@ -551,12 +624,14 @@ export async function startChatd(opts = {}) {
     if (!run || run.status !== 'running' || run.finalSent) return;
     run.finalSent = true;
     run.status = 'done';
+    syncFinalTextToRunTimeline(run, finalText);
     await appendMessage({
       sessionId: run.sessionId,
       role: 'assistant',
       text: finalText,
       runId: run.runId,
       steps: run.steps,
+      timeline: run.timeline,
       storageRoot,
     });
     broadcast(buildEvent({ event: 'chat.final', runId: run.runId, sessionId: run.sessionId, payload: { text: finalText } }));
@@ -771,6 +846,7 @@ export async function startChatd(opts = {}) {
           abort: null,
           assistantBuffer: '',
           steps: [],
+          timeline: [],
           finalSent: false,
           queue: Promise.resolve(),
           lastError: null,
@@ -803,6 +879,7 @@ export async function startChatd(opts = {}) {
                   const delta = evt.payload?.delta || '';
                   if (delta) {
                     active.assistantBuffer += delta;
+                    pushRunTimelineEntry(active, { type: 'text', text: delta });
                     broadcast(buildEvent({ event: 'chat.delta', runId, sessionId, payload: { delta } }));
                   }
                   return;
diff --git a/agent/src/session-store.js b/agent/src/session-store.js
index 6ff7759..a624f4b 100644
--- a/agent/src/session-store.js
+++ b/agent/src/session-store.js
@@ -71,6 +71,43 @@ function normalizeSteps(steps) {
     .slice(-100);
 }
 
+function normalizeTimelineEntry(entry) {
+  if (!entry || typeof entry !== 'object') return null;
+  if (entry.type === 'text') {
+    const text = typeof entry.text === 'string' ? entry.text : '';
+    if (!text) return null;
+    return { type: 'text', text };
+  }
+  const step = normalizeStep(entry);
+  if (!step) return null;
+  return { type: 'step', ...step };
+}
+
+function normalizeTimeline(timeline) {
+  if (!Array.isArray(timeline)) return [];
+  const entries = [];
+  for (const item of timeline.slice(-200)) {
+    const normalized = normalizeTimelineEntry(item);
+    if (!normalized) continue;
+    const last = entries[entries.length - 1];
+    if (normalized.type === 'text' && last?.type === 'text') {
+      last.text = `${last.text || ''}${normalized.text || ''}`;
+      continue;
+    }
+    if (
+      normalized.type === 'step'
+      && last?.type === 'step'
+      && last.label === normalized.label
+      && last.kind === normalized.kind
+      && last.status === normalized.status
+    ) {
+      continue;
+    }
+    entries.push(normalized);
+  }
+  return entries.slice(-200);
+}
+
 async function ensureStorageRoot(storageRoot) {
   await fs.mkdir(storageRoot, { recursive: true });
 }
@@ -287,7 +324,7 @@ export async function updateSession({ sessionId, patch = {}, storageRoot } = {})
   });
 }
 
-export async function appendMessage({ sessionId, role, text, runId, steps, storageRoot } = {}) {
+export async function appendMessage({ sessionId, role, text, runId, steps, timeline, storageRoot } = {}) {
   assertValidSessionId(sessionId, 'appendMessage');
   if (!role) throw new Error('appendMessage requires role');
   if (typeof text !== 'string') throw new Error('appendMessage requires text');
@@ -311,6 +348,10 @@ export async function appendMessage({ sessionId, role, text, runId, steps, stora
   if (normalizedSteps.length > 0) {
     entry.steps = normalizedSteps;
   }
+  const normalizedTimeline = normalizeTimeline(timeline);
+  if (normalizedTimeline.length > 0) {
+    entry.timeline = normalizedTimeline;
+  }
 
   const logPath = messageLogPath(root, sessionId);
   await fs.appendFile(logPath, `${JSON.stringify(entry)}\n`, 'utf8');
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index b0804dc..3507b3d 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -272,6 +272,9 @@ test('POST /v1/runs persists run steps so reopened sessions can render them', as
     assert.equal(Array.isArray(assistant?.steps), true);
     assert.equal(assistant.steps.length >= 1, true);
     assert.equal(assistant.steps.some((step) => /Inspecting active tab/.test(step?.label || '')), true);
+    assert.equal(Array.isArray(assistant?.timeline), true);
+    assert.equal(assistant.timeline.some((item) => item?.type === 'step'), true);
+    assert.equal(assistant.timeline.some((item) => item?.type === 'text' && /done/i.test(item?.text || '')), true);
   } finally {
     await daemon.stop();
   }
diff --git a/test/agent/session-store.test.js b/test/agent/session-store.test.js
index a134e65..162a95f 100644
--- a/test/agent/session-store.test.js
+++ b/test/agent/session-store.test.js
@@ -42,12 +42,20 @@ test('messages preserve optional run metadata used for transcript rehydration',
     text: 'done',
     runId: 'run_123',
     steps: [{ kind: 'tool', status: 'done', label: 'Snapshot page' }],
+    timeline: [
+      { type: 'step', kind: 'tool', status: 'done', label: 'Snapshot page' },
+      { type: 'text', text: 'done' },
+    ],
     storageRoot,
   });
   const rows = await readMessages({ sessionId, limit: 20, storageRoot });
   const last = rows.at(-1);
   assert.equal(last.runId, 'run_123');
   assert.deepEqual(last.steps, [{ kind: 'tool', status: 'done', label: 'Snapshot page' }]);
+  assert.deepEqual(last.timeline, [
+    { type: 'step', kind: 'tool', status: 'done', label: 'Snapshot page' },
+    { type: 'text', text: 'done' },
+  ]);
 });
 
 test('rejects unsafe session ids', async () => {

From 764f7ada8721b42af69716b90ccde000e2ea6e8d Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:47:46 +0530
Subject: [PATCH 133/192] fix(sidepanel): show context usage only when
 available below composer

---
 extension/agent-panel.css                    | 30 +++++++++-----------
 extension/agent-panel.html                   |  3 +-
 extension/agent-panel.js                     |  8 +++++-
 test/agent/agent-panel-contract.test.js      |  4 +++
 test/agent/agent-panel-send-contract.test.js |  7 +++--
 5 files changed, 29 insertions(+), 23 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 25a5f8c..48051dd 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -53,7 +53,6 @@ body {
   align-items: center;
   gap: 8px;
   padding: 12px 14px;
-  flex-wrap: wrap;
 }
 
 .pill-btn {
@@ -129,22 +128,6 @@ body {
   justify-content: center;
 }
 
-.context-usage-chip {
-  min-width: 0;
-  max-width: 100%;
-  padding: 0 10px;
-  height: 24px;
-  border-radius: 999px;
-  border: 1px solid rgba(255, 255, 255, 0.2);
-  background: rgba(255, 255, 255, 0.08);
-  color: rgba(255, 255, 255, 0.78);
-  font-size: 11px;
-  line-height: 22px;
-  white-space: nowrap;
-  overflow: hidden;
-  text-overflow: ellipsis;
-}
-
 .status-dot {
   width: 8px;
   height: 8px;
@@ -558,6 +541,9 @@ body {
   background: #fff;
   border-top: 1px solid var(--line);
   padding: 10px 12px;
+  display: flex;
+  flex-direction: column;
+  gap: 6px;
 }
 
 .composer-box {
@@ -602,6 +588,16 @@ body {
   flex-shrink: 0;
 }
 
+.context-usage-note {
+  font-size: 10px;
+  line-height: 1.35;
+  color: var(--text-subtle);
+  padding: 0 4px;
+  white-space: nowrap;
+  overflow: hidden;
+  text-overflow: ellipsis;
+}
+
 .btn-stop,
 .btn-send {
   width: 32px;
diff --git a/extension/agent-panel.html b/extension/agent-panel.html
index 7af63d6..e2355a5 100644
--- a/extension/agent-panel.html
+++ b/extension/agent-panel.html
@@ -30,8 +30,6 @@
           </svg>
         </button>
 
-        <div id="bf-context-usage" class="context-usage-chip" title="Context: unavailable">Context: unavailable</div>
-
         <div id="bf-agent-status" class="status-circle" title="Starting...">
           <span id="bf-agent-status-icon" class="status-dot" aria-hidden="true"></span>
           <span id="bf-agent-status-text" class="sr-only">Starting...</span>
@@ -62,6 +60,7 @@
             </button>
           </div>
         </div>
+        <div id="bf-context-usage" class="context-usage-note hidden" aria-live="polite"></div>
       </form>
     </section>
 
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 8aea36d..5b2ca0a 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -108,7 +108,13 @@ function renderContextUsageChip() {
   const sessionId = state.value.activeSessionId;
   const usage = sessionId ? state.value.latestUsageBySession?.[sessionId] : null;
   const formatted = formatContextUsage(usage || {});
-  contextUsageEl.textContent = formatted ? `Context: ${formatted}` : 'Context: unavailable';
+  contextUsageEl.classList.toggle('hidden', !formatted);
+  if (!formatted) {
+    contextUsageEl.textContent = '';
+    contextUsageEl.removeAttribute('title');
+    return;
+  }
+  contextUsageEl.textContent = `Context: ${formatted}`;
   contextUsageEl.title = contextUsageEl.textContent;
 }
 
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index 0c7115d..2eef4ca 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -18,6 +18,10 @@ test('agent panel has inline model and session selectors with popovers', () => {
   assert.match(html, /id="bf-tab-attach-text"/);
   assert.match(html, /id="bf-attach-current-tab"/);
   assert.match(html, /id="bf-context-usage"/);
+  assert.match(
+    html,
+    /id="bf-chat-form"[\s\S]*class="composer-box"[\s\S]*<\/div>\s*<div id="bf-context-usage"/,
+  );
 });
 
 test('agent panel no longer renders title or persistent session sidebar', () => {
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 50d9370..7860f76 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -70,9 +70,10 @@ test('assistant transcript prefers ordered run timeline over grouped run steps',
   assert.match(js, /const timelineHtml = renderRunTimeline\(messageRun, msg\.text \|\| ''\)/);
 });
 
-test('status row renders context usage from latestUsageBySession with fallback', () => {
+test('context usage renderer hides element when unavailable and only shows formatted values', () => {
   assert.match(js, /function renderContextUsageChip\(\)/);
   assert.match(js, /latestUsageBySession/);
-  assert.match(js, /Context:\s*unavailable/);
-  assert.match(js, /formatted \? `Context: \$\{formatted\}` : 'Context: unavailable'/);
+  assert.match(js, /contextUsageEl\.classList\.toggle\('hidden', !formatted\)/);
+  assert.match(js, /contextUsageEl\.textContent = `Context: \$\{formatted\}`/);
+  assert.doesNotMatch(js, /Context:\s*unavailable/);
 });

From e49e899b6376b1c8559eafe83db96e0577c87503 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:48:22 +0530
Subject: [PATCH 134/192] docs(dev): add contributor guide for relay ports and
 streaming debug workflows

---
 README.md           |   2 +
 docs/DEVELOPMENT.md | 143 ++++++++++++++++++++++++++++++++++++++++++++
 2 files changed, 145 insertions(+)
 create mode 100644 docs/DEVELOPMENT.md

diff --git a/README.md b/README.md
index 033da4c..9948bc9 100644
--- a/README.md
+++ b/README.md
@@ -83,6 +83,8 @@ After loading, the extension icon appears in your toolbar (gray = disconnected).
 
 The relay auto-starts when you run any command or connect via MCP — no manual step needed. Extension icon turns green once connected.
 
+Contributor/dev workflows (alternate ports, stream debugging, MCP wiring) are documented in [docs/DEVELOPMENT.md](docs/DEVELOPMENT.md).
+
 To run the relay manually (optional):
 
 ```bash
diff --git a/docs/DEVELOPMENT.md b/docs/DEVELOPMENT.md
new file mode 100644
index 0000000..fa941fe
--- /dev/null
+++ b/docs/DEVELOPMENT.md
@@ -0,0 +1,143 @@
+# BrowserForce Development Guide
+
+This guide is for contributors who need a fast local dev/debug loop.
+
+## Quickstart
+
+1. Install deps:
+
+```bash
+pnpm install
+```
+
+2. Run relay and MCP from this repo:
+
+```bash
+pnpm relay
+pnpm mcp
+```
+
+3. Load extension from this repo in Chrome (`chrome://extensions` -> Load unpacked -> `extension/`).
+
+4. In popup, ensure Relay URL is:
+
+```text
+ws://127.0.0.1:19222/extension
+```
+
+## Run on a Different Relay Port (Local Debug Hack)
+
+Use this when another BrowserForce instance is already running or you want isolated debugging.
+
+1. Start relay on a non-default port:
+
+```bash
+RELAY_PORT=19333 pnpm relay
+```
+
+2. In extension popup, set Relay URL to:
+
+```text
+ws://127.0.0.1:19333/extension
+```
+
+3. Make MCP use the same relay port.
+
+If your MCP client is configured with `npx browserforce@latest mcp`, inject `RELAY_PORT=19333` in the MCP command.
+
+Example shape:
+
+```json
+{
+  "command": "env",
+  "args": ["RELAY_PORT=19333", "npx", "-y", "browserforce@latest", "mcp"]
+}
+```
+
+Fallback (if you cannot pass `RELAY_PORT` in MCP config): set `BF_CDP_URL` to the exact ws URL from `~/.browserforce/cdp-url`.
+
+## Debug Side-Panel Streaming Events
+
+The side-panel receives SSE from chatd (`/v1/events`). You can inspect the same stream directly.
+
+1. Start agent daemon:
+
+```bash
+browserforce agent start
+```
+
+2. Load auth data:
+
+```bash
+PORT=$(jq -r '.port' ~/.browserforce/chatd-url.json)
+TOKEN=$(jq -r '.token' ~/.browserforce/chatd-url.json)
+BASE="http://127.0.0.1:$PORT"
+```
+
+3. Create a test session:
+
+```bash
+SESSION_ID=$(curl -sS "$BASE/v1/sessions" \
+  -H "Authorization: Bearer $TOKEN" \
+  -H "Content-Type: application/json" \
+  -d '{"title":"debug-stream"}' | jq -r '.sessionId')
+echo "$SESSION_ID"
+```
+
+4. Terminal A: watch stream:
+
+```bash
+curl -N -sS "$BASE/v1/events?sessionId=$SESSION_ID" \
+  -H "Authorization: Bearer $TOKEN" \
+| awk '/^data: /{sub(/^data: /,""); print}' \
+| jq -c '{event, runId, sessionId, payload}'
+```
+
+5. Terminal B: trigger a run:
+
+```bash
+curl -sS "$BASE/v1/runs" \
+  -H "Authorization: Bearer $TOKEN" \
+  -H "Content-Type: application/json" \
+  -d "{\"sessionId\":\"$SESSION_ID\",\"message\":\"say hello and stop\"}" | jq
+```
+
+Useful filters:
+
+```bash
+# only assistant deltas
+... | jq -c 'select(.event=="chat.delta")'
+
+# continuity + telemetry
+... | jq -c 'select(.event=="run.provider_session" or .event=="run.usage")'
+```
+
+## Debug CDP Traffic
+
+Relay writes CDP traffic to:
+
+```text
+~/.browserforce/cdp.jsonl
+```
+
+Tail live:
+
+```bash
+tail -f ~/.browserforce/cdp.jsonl | jq -c '{ts, direction, method: (.message.method // "response")}'
+```
+
+Method summary:
+
+```bash
+jq -r '.direction + "\t" + (.message.method // "response")' ~/.browserforce/cdp.jsonl | uniq -c
+```
+
+## Test Commands (Common While Developing)
+
+```bash
+pnpm test
+node --test test/agent/chatd-api.test.js
+node --test test/agent/codex-runner.test.js
+node --test test/agent/session-store.test.js
+node --test test/agent/agent-panel-contract.test.js test/agent/agent-panel-send-contract.test.js
+```

From 6b0cd93e39b7b9392f3e2bfb7289f3cd879034d9 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:52:06 +0530
Subject: [PATCH 135/192] fix(sidepanel): open panel first and attach tab
 asynchronously

---
 extension/agent-panel.js                     | 31 ++++++++++++++++----
 test/agent/agent-panel-send-contract.test.js | 17 +++++++++--
 2 files changed, 41 insertions(+), 7 deletions(-)

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 5b2ca0a..06358f6 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -14,6 +14,8 @@ const state = {
   auth: null,
   modelPresets: [{ value: null, label: 'Default' }],
   currentRunBySession: {},
+  initialTabAttachInFlight: false,
+  initialTabAttachStarted: false,
   editingSessionId: null,
   sessionTitleDrafts: {},
   eventController: null,
@@ -108,14 +110,17 @@ function renderContextUsageChip() {
   const sessionId = state.value.activeSessionId;
   const usage = sessionId ? state.value.latestUsageBySession?.[sessionId] : null;
   const formatted = formatContextUsage(usage || {});
-  contextUsageEl.classList.toggle('hidden', !formatted);
-  if (!formatted) {
+  const note = state.initialTabAttachInFlight
+    ? 'Attaching active tab...'
+    : (formatted ? `Context: ${formatted}` : '');
+  contextUsageEl.classList.toggle('hidden', !note);
+  if (!note) {
     contextUsageEl.textContent = '';
     contextUsageEl.removeAttribute('title');
     return;
   }
-  contextUsageEl.textContent = `Context: ${formatted}`;
-  contextUsageEl.title = contextUsageEl.textContent;
+  contextUsageEl.textContent = note;
+  contextUsageEl.title = note;
 }
 
 function setStatus(kind, text) {
@@ -658,6 +663,21 @@ function bindTabAttachWatchers() {
   }
 }
 
+function startInitialTabAttach() {
+  if (state.initialTabAttachStarted) return;
+  state.initialTabAttachStarted = true;
+  state.initialTabAttachInFlight = true;
+  renderContextUsageChip();
+  ensureCurrentTabAttached()
+    .catch(() => {
+      // best-effort only
+    })
+    .finally(() => {
+      state.initialTabAttachInFlight = false;
+      renderContextUsageChip();
+    });
+}
+
 async function getActiveTabContext() {
   if (!chrome?.tabs?.query) return null;
   try {
@@ -1053,8 +1073,9 @@ popoverBackdropEl.addEventListener('click', () => {
   try {
     setComposerEnabled(false);
     setStatus('info', 'Connecting...');
+    render();
+    startInitialTabAttach();
     await loadAuth();
-    await ensureCurrentTabAttached();
     bindTabAttachWatchers();
     try {
       await loadModelPresets();
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 7860f76..f796abd 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -73,7 +73,20 @@ test('assistant transcript prefers ordered run timeline over grouped run steps',
 test('context usage renderer hides element when unavailable and only shows formatted values', () => {
   assert.match(js, /function renderContextUsageChip\(\)/);
   assert.match(js, /latestUsageBySession/);
-  assert.match(js, /contextUsageEl\.classList\.toggle\('hidden', !formatted\)/);
-  assert.match(js, /contextUsageEl\.textContent = `Context: \$\{formatted\}`/);
+  assert.match(js, /const note = state\.initialTabAttachInFlight[\s\S]*formatted[\s\S]*Context: \$\{formatted\}/);
+  assert.match(js, /contextUsageEl\.classList\.toggle\('hidden', !note\)/);
+  assert.match(js, /contextUsageEl\.textContent = note/);
   assert.doesNotMatch(js, /Context:\s*unavailable/);
 });
+
+test('init opens smoothly by starting tab attach asynchronously', () => {
+  assert.match(js, /function startInitialTabAttach\(\)/);
+  assert.match(js, /\(async function init\(\)[\s\S]*startInitialTabAttach\(\);/);
+  assert.doesNotMatch(js, /\(async function init\(\)[\s\S]*await ensureCurrentTabAttached\(\);/);
+});
+
+test('bottom note can show async attach status and still hides when no note is available', () => {
+  assert.match(js, /initialTabAttachInFlight:\s*false/);
+  assert.match(js, /state\.initialTabAttachInFlight\s*\?\s*'Attaching active tab\.\.\.'/);
+  assert.match(js, /contextUsageEl\.classList\.toggle\('hidden', !note\)/);
+});

From 755d85f2b5dd906404aa5ee6c5d8c261880cb357 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 17:57:26 +0530
Subject: [PATCH 136/192] fix(sidepanel): defer first tab attach until after
 initial paint

---
 extension/agent-panel.js                     | 20 ++++++++++++--------
 test/agent/agent-panel-send-contract.test.js |  7 +++++++
 2 files changed, 19 insertions(+), 8 deletions(-)

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 06358f6..8ecd4dd 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -668,14 +668,18 @@ function startInitialTabAttach() {
   state.initialTabAttachStarted = true;
   state.initialTabAttachInFlight = true;
   renderContextUsageChip();
-  ensureCurrentTabAttached()
-    .catch(() => {
-      // best-effort only
-    })
-    .finally(() => {
-      state.initialTabAttachInFlight = false;
-      renderContextUsageChip();
-    });
+  window.requestAnimationFrame(() => {
+    window.setTimeout(() => {
+      ensureCurrentTabAttached()
+        .catch(() => {
+          // best-effort only
+        })
+        .finally(() => {
+          state.initialTabAttachInFlight = false;
+          renderContextUsageChip();
+        });
+    }, 0);
+  });
 }
 
 async function getActiveTabContext() {
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index f796abd..6601e56 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -90,3 +90,10 @@ test('bottom note can show async attach status and still hides when no note is a
   assert.match(js, /state\.initialTabAttachInFlight\s*\?\s*'Attaching active tab\.\.\.'/);
   assert.match(js, /contextUsageEl\.classList\.toggle\('hidden', !note\)/);
 });
+
+test('initial tab attach is deferred until after first paint', () => {
+  const fnMatch = js.match(/function startInitialTabAttach\(\)[\s\S]*?\n}\n\nasync function getActiveTabContext/);
+  assert.ok(fnMatch, 'startInitialTabAttach function block should be present');
+  const fnBlock = fnMatch[0];
+  assert.match(fnBlock, /window\.requestAnimationFrame\(\(\)\s*=>\s*\{/);
+});

From ae4196b0c1602a38cb53a33dcce8d8803ec315f9 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 18:00:32 +0530
Subject: [PATCH 137/192] fix(sidepanel): delay initial tab attach by 2s

---
 extension/agent-panel.js                     | 22 +++++++++-----------
 test/agent/agent-panel-send-contract.test.js |  5 +++--
 2 files changed, 13 insertions(+), 14 deletions(-)

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 8ecd4dd..84232e7 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -668,18 +668,16 @@ function startInitialTabAttach() {
   state.initialTabAttachStarted = true;
   state.initialTabAttachInFlight = true;
   renderContextUsageChip();
-  window.requestAnimationFrame(() => {
-    window.setTimeout(() => {
-      ensureCurrentTabAttached()
-        .catch(() => {
-          // best-effort only
-        })
-        .finally(() => {
-          state.initialTabAttachInFlight = false;
-          renderContextUsageChip();
-        });
-    }, 0);
-  });
+  window.setTimeout(() => {
+    ensureCurrentTabAttached()
+      .catch(() => {
+        // best-effort only
+      })
+      .finally(() => {
+        state.initialTabAttachInFlight = false;
+        renderContextUsageChip();
+      });
+  }, 2000);
 }
 
 async function getActiveTabContext() {
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 6601e56..ed2622c 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -91,9 +91,10 @@ test('bottom note can show async attach status and still hides when no note is a
   assert.match(js, /contextUsageEl\.classList\.toggle\('hidden', !note\)/);
 });
 
-test('initial tab attach is deferred until after first paint', () => {
+test('initial tab attach waits 2 seconds before attaching', () => {
   const fnMatch = js.match(/function startInitialTabAttach\(\)[\s\S]*?\n}\n\nasync function getActiveTabContext/);
   assert.ok(fnMatch, 'startInitialTabAttach function block should be present');
   const fnBlock = fnMatch[0];
-  assert.match(fnBlock, /window\.requestAnimationFrame\(\(\)\s*=>\s*\{/);
+  assert.match(fnBlock, /window\.setTimeout\(\(\)\s*=>\s*\{/);
+  assert.match(fnBlock, /},\s*2000\)/);
 });

From e147203a9876a705b66244f76dd162a6f4b4f639 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 18:06:12 +0530
Subject: [PATCH 138/192] fix(sidepanel): preserve partial assistant output
 when run is stopped

---
 extension/agent-panel-state.js | 25 +++++++++++++++++++++++--
 test/agent/sse-events.test.js  | 14 ++++++++++++++
 2 files changed, 37 insertions(+), 2 deletions(-)

diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index 1087c24..48e7bbd 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -410,20 +410,41 @@ export function applyEvent(state = initialState, evt = {}) {
   }
 
   if (evt.event === 'run.aborted') {
-    const run = state.runs[evt.runId] || { steps: [], timeline: [] };
+    const run = state.runs[evt.runId] || { text: '', steps: [], timeline: [] };
     const step = {
       kind: 'status',
       status: 'aborted',
       label: 'Stopped',
     };
+    const timeline = pushTimelineEntry(run, { type: 'step', ...step });
+    const hasContentBeforeStop = Boolean(
+      (typeof run.text === 'string' && run.text)
+      || (Array.isArray(run.timeline) && run.timeline.length > 0),
+    );
+    const currentMessages = state.messagesBySession[evt.sessionId] || [];
+    const hasStoredFinal = currentMessages.some(
+      (message) => message.runId === evt.runId && message.role === 'assistant',
+    );
+    const nextMessages = (!hasStoredFinal && hasContentBeforeStop)
+      ? [...currentMessages, {
+        role: 'assistant',
+        text: run.text || '',
+        runId: evt.runId,
+        timeline,
+      }]
+      : currentMessages;
     return {
       ...state,
+      messagesBySession: {
+        ...state.messagesBySession,
+        [evt.sessionId]: nextMessages,
+      },
       runs: upsertRun(state, evt.runId, {
         sessionId: evt.sessionId,
         done: true,
         aborted: true,
         steps: pushStep(run, step),
-        timeline: pushTimelineEntry(run, { type: 'step', ...step }),
+        timeline,
       }),
     };
   }
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
index 3e1c116..3244f14 100644
--- a/test/agent/sse-events.test.js
+++ b/test/agent/sse-events.test.js
@@ -31,6 +31,20 @@ test('run.aborted marks run terminal', () => {
   assert.equal(next.runs.r1.aborted, true);
 });
 
+test('run.aborted preserves partial assistant output in transcript history', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, { event: 'chat.delta', runId: 'r1', sessionId: 's1', payload: { delta: 'Partial answer' } });
+  const s3 = applyEvent(s2, { event: 'run.aborted', runId: 'r1', sessionId: 's1', payload: {} });
+  const message = s3.messagesBySession.s1?.at(-1);
+
+  assert.equal(message?.role, 'assistant');
+  assert.equal(message?.runId, 'r1');
+  assert.equal(message?.text, 'Partial answer');
+  assert.equal(Array.isArray(message?.timeline), true);
+  assert.equal(message.timeline.some((item) => item.type === 'text'), true);
+  assert.equal(message.timeline.some((item) => item.type === 'step' && item.status === 'aborted'), true);
+});
+
 test('tool and reasoning events are tracked as steps', () => {
   const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
   const s2 = applyEvent(s1, { event: 'tool.started', runId: 'r1', sessionId: 's1', payload: { tool: 'fetch' } });

From 4dc0dd5f4d56976da9088cd162fd8d0a8d1575fc Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 18:07:40 +0530
Subject: [PATCH 139/192] fix(chatd): persist partial assistant output when run
 is aborted

---
 agent/src/chatd.js           | 22 +++++++++++++
 test/agent/chatd-api.test.js | 60 ++++++++++++++++++++++++++++++++++++
 2 files changed, 82 insertions(+)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 2245ac4..a2290ff 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -650,6 +650,27 @@ export async function startChatd(opts = {}) {
     runs.delete(run.runId);
   }
 
+  async function persistAbortedRun(run) {
+    if (!run) return;
+    trackRunStep(run, { event: 'run.aborted', payload: {} });
+    const partialText = String(run.assistantBuffer || '');
+    syncFinalTextToRunTimeline(run, partialText);
+    const hasContent = Boolean(
+      partialText
+      || (Array.isArray(run.timeline) && run.timeline.length > 0),
+    );
+    if (!hasContent) return;
+    await appendMessage({
+      sessionId: run.sessionId,
+      role: 'assistant',
+      text: partialText,
+      runId: run.runId,
+      steps: run.steps,
+      timeline: run.timeline,
+      storageRoot,
+    });
+  }
+
   const server = http.createServer(async (req, res) => {
     try {
       const base = `http://${req.headers.host || '127.0.0.1'}`;
@@ -1015,6 +1036,7 @@ export async function startChatd(opts = {}) {
         }
 
         run.status = 'aborted';
+        await persistAbortedRun(run);
         run.abort?.();
         runs.delete(decodedRunId);
         broadcast(buildEvent({ event: 'run.aborted', runId: decodedRunId, sessionId: run.sessionId, payload: {} }));
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index 3507b3d..3398e5c 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -280,6 +280,66 @@ test('POST /v1/runs persists run steps so reopened sessions can render them', as
   }
 });
 
+test('POST /v1/runs abort persists partial assistant output for session reloads', async () => {
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    runExecutor: ({ runId, sessionId, onEvent }) => {
+      setTimeout(() => {
+        onEvent({ event: 'chat.delta', runId, sessionId, payload: { delta: 'Partial answer' } });
+      }, 10);
+      setTimeout(() => {
+        onEvent({ event: 'tool.started', runId, sessionId, payload: { tool: 'snapshot' } });
+      }, 15);
+      return { abort() {} };
+    },
+  });
+
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'Abort persistence' }),
+    }).then((res) => res.json());
+
+    const runRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'start and stop' }),
+    });
+    assert.equal(runRes.status, 202);
+    const runBody = await runRes.json();
+
+    await new Promise((resolve) => setTimeout(resolve, 60));
+
+    const abortRes = await fetch(`${daemon.baseUrl}/v1/runs/${encodeURIComponent(runBody.runId)}/abort`, {
+      method: 'DELETE',
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(abortRes.status, 200);
+
+    const messagesBody = await fetch(
+      `${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}/messages`,
+      { headers: { authorization: `Bearer ${daemon.token}` } },
+    ).then((res) => res.json());
+    const assistant = (messagesBody.messages || []).at(-1);
+
+    assert.equal(assistant?.role, 'assistant');
+    assert.equal(assistant?.runId, runBody.runId);
+    assert.equal(assistant?.text, 'Partial answer');
+    assert.equal(Array.isArray(assistant?.timeline), true);
+    assert.equal(assistant.timeline.some((item) => item?.type === 'step' && item?.status === 'aborted'), true);
+  } finally {
+    await daemon.stop();
+  }
+});
+
 test('runExecutor synchronous failure does not leak abortable run', async () => {
   const storageRoot = mkdtempSync(join(tmpdir(), 'bf-chatd-run-fail-'));
   let attemptedRunId = null;

From 7d69d82c8463ea491cec6382d9c179ff3b0b4935 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 18:11:32 +0530
Subject: [PATCH 140/192] feat(sidepanel): collapse tool-call steps with
 expandable details

---
 extension/agent-panel-state.js               | 94 +++++++++++++++++++-
 extension/agent-panel.css                    | 55 ++++++++++++
 extension/agent-panel.js                     | 58 +++++++++++-
 test/agent/agent-panel-send-contract.test.js |  6 ++
 test/agent/sse-events.test.js                | 24 +++++
 5 files changed, 230 insertions(+), 7 deletions(-)

diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index 48e7bbd..7cd3fe7 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -19,14 +19,57 @@ function trimStepLabel(label) {
   return text.length > 160 ? `${text.slice(0, 157)}...` : text;
 }
 
+function normalizeStepDetails(details, label = '') {
+  const lines = [];
+  const pushLine = (value) => {
+    const line = String(value || '')
+      .split('\n')
+      .map((part) => part.trim())
+      .filter(Boolean);
+    for (const rawPart of line) {
+      const part = rawPart.replace(/^[-*]\s+/, '').trim();
+      if (!part) continue;
+      if (part === label) continue;
+      if (lines.includes(part)) continue;
+      lines.push(part.length > 220 ? `${part.slice(0, 217)}...` : part);
+      if (lines.length >= 8) return;
+    }
+  };
+  const visit = (value) => {
+    if (value == null) return;
+    if (Array.isArray(value)) {
+      for (const item of value) {
+        if (lines.length >= 8) return;
+        visit(item);
+      }
+      return;
+    }
+    if (typeof value === 'object') {
+      visit(value.text);
+      visit(value.message);
+      visit(value.output);
+      visit(value.command);
+      visit(value.path);
+      visit(value.query);
+      visit(value.pattern);
+      return;
+    }
+    pushLine(value);
+  };
+  visit(details);
+  return lines;
+}
+
 function normalizeStep(step) {
   if (!step || typeof step !== 'object') return null;
   const label = trimStepLabel(step.label);
   if (!label) return null;
+  const details = normalizeStepDetails(step.details, label);
   return {
     kind: step.kind || 'reasoning',
     status: step.status || 'running',
     label,
+    ...(details.length > 0 ? { details } : {}),
   };
 }
 
@@ -35,7 +78,13 @@ function pushStep(run, step) {
   const normalized = normalizeStep(step);
   if (!normalized || !normalized.label) return steps;
   const last = steps[steps.length - 1];
-  if (last && last.label === normalized.label && last.kind === normalized.kind && last.status === normalized.status) {
+  if (
+    last
+    && last.label === normalized.label
+    && last.kind === normalized.kind
+    && last.status === normalized.status
+    && JSON.stringify(last.details || []) === JSON.stringify(normalized.details || [])
+  ) {
     return steps;
   }
   steps.push(normalized);
@@ -67,6 +116,7 @@ function pushTimelineEntry(run, entry) {
       && last.label === candidate.label
       && last.kind === candidate.kind
       && last.status === candidate.status
+      && JSON.stringify(last.details || []) === JSON.stringify(candidate.details || [])
     ) {
       return timeline;
     }
@@ -164,6 +214,24 @@ function stepLabelForToolEvent(evt) {
   return '';
 }
 
+function stepDetailsForToolEvent(evt, label) {
+  const payload = evt?.payload || {};
+  return normalizeStepDetails([
+    payload.details,
+    payload.text,
+    payload.message,
+    payload.delta,
+    payload.command,
+    payload.path,
+    payload.query,
+    payload.pattern,
+    payload.args,
+    payload.paths,
+    payload.items,
+    payload.item,
+  ], label);
+}
+
 function humanizeToken(value) {
   const normalized = String(value || '')
     .trim()
@@ -209,6 +277,24 @@ function stepLabelForRunEvent(evt) {
   ]) || 'Working...';
 }
 
+function stepDetailsForRunEvent(evt, label) {
+  const payload = evt?.payload || {};
+  return normalizeStepDetails([
+    payload.details,
+    payload.text,
+    payload.message,
+    payload.delta,
+    payload.command,
+    payload.path,
+    payload.query,
+    payload.pattern,
+    payload.args,
+    payload.paths,
+    payload.items,
+    payload.item,
+  ], label);
+}
+
 function upsertRun(state, runId, patch) {
   return {
     ...state.runs,
@@ -458,7 +544,8 @@ export function applyEvent(state = initialState, evt = {}) {
       ? 'reasoning'
       : 'tool';
     const label = stepLabelForToolEvent(evt);
-    const step = { kind, status, label };
+    const details = stepDetailsForToolEvent(evt, label);
+    const step = { kind, status, label, ...(details.length > 0 ? { details } : {}) };
     return {
       ...state,
       runs: upsertRun(state, evt.runId, {
@@ -475,7 +562,8 @@ export function applyEvent(state = initialState, evt = {}) {
     const status = stepStatusForRunEvent(evt);
     const kind = stepKindForRunEvent(evt);
     const label = stepLabelForRunEvent(evt);
-    const step = { kind, status, label };
+    const details = stepDetailsForRunEvent(evt, label);
+    const step = { kind, status, label, ...(details.length > 0 ? { details } : {}) };
     return {
       ...state,
       runs: upsertRun(state, evt.runId, {
diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 48051dd..fa82ecd 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -337,6 +337,61 @@ body {
   gap: 8px;
 }
 
+.step-body {
+  min-width: 0;
+  flex: 1;
+}
+
+.step-toggle {
+  width: 100%;
+  border: 0;
+  background: transparent;
+  padding: 0;
+  color: inherit;
+  cursor: pointer;
+  display: flex;
+  align-items: center;
+  justify-content: space-between;
+  gap: 10px;
+  text-align: left;
+}
+
+.step-item.collapsible .step-label {
+  white-space: nowrap;
+  overflow: hidden;
+  text-overflow: ellipsis;
+}
+
+.step-caret::before {
+  content: '›';
+  display: inline-block;
+  font-size: 14px;
+  color: var(--text-subtle);
+  transition: transform 0.15s ease;
+}
+
+.step-item.collapsible.expanded .step-caret::before {
+  transform: rotate(90deg);
+}
+
+.step-details {
+  list-style: none;
+  margin: 6px 0 0;
+  padding: 0 0 0 4px;
+  display: flex;
+  flex-direction: column;
+  gap: 4px;
+}
+
+.step-details li {
+  font-size: 11.5px;
+  color: var(--text-muted);
+  line-height: 1.4;
+  white-space: pre-wrap;
+  overflow-wrap: anywhere;
+  word-break: break-word;
+}
+
 .step-label {
   font-size: 11.5px;
   color: var(--text-muted);
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 84232e7..eedce97 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -14,6 +14,8 @@ const state = {
   auth: null,
   modelPresets: [{ value: null, label: 'Default' }],
   currentRunBySession: {},
+  expandedTimelineEntries: {},
+  transcriptHandlersBound: false,
   initialTabAttachInFlight: false,
   initialTabAttachStarted: false,
   editingSessionId: null,
@@ -405,6 +407,13 @@ function renderRunTimeline(run, fallbackText = '') {
   const timeline = normalizeRunTimeline(run, fallbackText);
   if (!timeline.length) return '';
   const latestStepIndex = getLatestInFlightTimelineStepIndex(run, timeline);
+  const getTimelineEntryKey = (entry, index) => {
+    const runId = String(run?.runId || 'run');
+    const kind = String(entry?.kind || '');
+    const status = String(entry?.status || '');
+    const label = String(entry?.label || '');
+    return `${runId}:${index}:${kind}:${status}:${label}`;
+  };
   return `
     <div class="run-timeline">
       ${timeline.map((entry, index) => {
@@ -415,10 +424,33 @@ function renderRunTimeline(run, fallbackText = '') {
     const icon = classifyRunStepIcon(entry);
     const isLatest = index === latestStepIndex;
     const shouldPulse = isLatest && status === 'running';
+    const details = Array.isArray(entry?.details) ? entry.details.filter(Boolean) : [];
+    const isCollapsible = details.length > 0;
     const classes = ['step-item', 'timeline-step', escapeHtml(status)];
     if (isLatest) classes.push('latest');
     if (shouldPulse) classes.push('pulse');
-    return `<div class="${classes.join(' ')}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="step-label">${renderInlineContent(entry.label || 'Step')}</span></div>`;
+    if (!isCollapsible) {
+      return `<div class="${classes.join(' ')}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="step-label">${renderInlineContent(entry.label || 'Step')}</span></div>`;
+    }
+    classes.push('collapsible');
+    const key = getTimelineEntryKey(entry, index);
+    const expanded = !!state.expandedTimelineEntries[key];
+    if (expanded) classes.push('expanded');
+    const detailsHtml = details
+      .map((line) => `<li>${renderInlineContent(line)}</li>`)
+      .join('');
+    return `
+      <div class="${classes.join(' ')}">
+        <span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span>
+        <div class="step-body">
+          <button type="button" class="step-toggle" data-step-key="${escapeHtml(key)}" aria-expanded="${expanded ? 'true' : 'false'}">
+            <span class="step-label">${renderInlineContent(entry.label || 'Step')}</span>
+            <span class="step-caret" aria-hidden="true"></span>
+          </button>
+          ${expanded ? `<ul class="step-details">${detailsHtml}</ul>` : ''}
+        </div>
+      </div>
+    `;
   }).join('')}
     </div>
   `;
@@ -429,10 +461,24 @@ function renderContent(value) {
 }
 
 function bindTranscriptHandlers() {
-  // Transcript rows are static render output; no delegated actions required.
+  if (state.transcriptHandlersBound) return;
+  transcriptEl.addEventListener('click', (event) => {
+    const toggleBtn = event.target.closest('button[data-step-key]');
+    if (!toggleBtn || !transcriptEl.contains(toggleBtn)) return;
+    const stepKey = toggleBtn.getAttribute('data-step-key');
+    if (!stepKey) return;
+    const nextExpanded = !state.expandedTimelineEntries[stepKey];
+    state.expandedTimelineEntries = {
+      ...state.expandedTimelineEntries,
+      [stepKey]: nextExpanded,
+    };
+    const scrollTop = transcriptEl.scrollTop;
+    renderTranscript({ preserveScrollTop: scrollTop });
+  });
+  state.transcriptHandlersBound = true;
 }
 
-function renderTranscript() {
+function renderTranscript({ preserveScrollTop = null } = {}) {
   const messages = getActiveMessages();
   const sessionId = state.value.activeSessionId;
   const sessionRunId = getSessionRunId(state.currentRunBySession, sessionId);
@@ -491,7 +537,11 @@ function renderTranscript() {
   }
 
   bindTranscriptHandlers();
-  transcriptEl.scrollTop = transcriptEl.scrollHeight;
+  if (Number.isFinite(preserveScrollTop)) {
+    transcriptEl.scrollTop = preserveScrollTop;
+  } else {
+    transcriptEl.scrollTop = transcriptEl.scrollHeight;
+  }
   syncStatusIndicator();
   syncComposerState();
 }
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index ed2622c..ee367ef 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -98,3 +98,9 @@ test('initial tab attach waits 2 seconds before attaching', () => {
   assert.match(fnBlock, /window\.setTimeout\(\(\)\s*=>\s*\{/);
   assert.match(fnBlock, /},\s*2000\)/);
 });
+
+test('tool-call timeline entries render collapsed toggle rows with click-to-expand details', () => {
+  assert.match(js, /data-step-key=/);
+  assert.match(js, /class="step-details"/);
+  assert.match(js, /closest\('button\[data-step-key\]'\)/);
+});
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
index 3244f14..5661388 100644
--- a/test/agent/sse-events.test.js
+++ b/test/agent/sse-events.test.js
@@ -120,6 +120,30 @@ test('run.event is converted into a visible in-flight step', () => {
   assert.match(last.label, /Planning skill invocation/);
 });
 
+test('run.event captures detail lines for collapsible tool-call rendering', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, {
+    event: 'run.event',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: {
+      item: {
+        summary: 'Explored 2 files, 1 search',
+        text: 'Read chatd.js\nSearched for run.aborted\nRead sse-events.test.js',
+      },
+    },
+  });
+  const lastStep = s2.runs.r1.steps.at(-1);
+  const lastTimeline = s2.runs.r1.timeline.at(-1);
+  assert.equal(lastStep?.label, 'Explored 2 files, 1 search');
+  assert.deepEqual(lastStep?.details, [
+    'Read chatd.js',
+    'Searched for run.aborted',
+    'Read sse-events.test.js',
+  ]);
+  assert.deepEqual(lastTimeline?.details, lastStep?.details);
+});
+
 test('run.usage stores normalized usage for run and session', () => {
   const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
   const s2 = applyEvent(s1, {

From 807f8060ec4f7b4ce02f884f58afa5f3f38463c1 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 18:19:57 +0530
Subject: [PATCH 141/192] sidepanel: restyle composer shell and simplify
 controls

---
 extension/agent-panel.css               | 124 ++++++++++++++++--------
 extension/agent-panel.html              |   8 +-
 test/agent/agent-panel-contract.test.js |   9 ++
 3 files changed, 96 insertions(+), 45 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index fa82ecd..563f723 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -593,28 +593,53 @@ body {
 
 .composer-wrap {
   flex-shrink: 0;
-  background: #fff;
-  border-top: 1px solid var(--line);
-  padding: 10px 12px;
+  background: linear-gradient(180deg, rgba(18, 20, 25, 0) 0%, rgba(18, 20, 25, 0.9) 36%, rgba(18, 20, 25, 0.96) 100%);
+  border-top: 0;
+  padding: 12px;
   display: flex;
   flex-direction: column;
   gap: 6px;
 }
 
 .composer-box {
-  display: flex;
-  align-items: flex-end;
-  gap: 6px;
-  border: 1px solid var(--line);
-  border-radius: 12px;
-  background: var(--linen);
-  padding: 8px 8px 8px 12px;
-  transition: border-color 0.15s, box-shadow 0.15s;
+  display: grid;
+  grid-template-columns: minmax(0, 1fr) auto;
+  align-items: center;
+  gap: 10px;
+  border: 1px solid rgba(255, 255, 255, 0.1);
+  border-radius: 32px;
+  background: linear-gradient(180deg, #33363A 0%, #2A2D30 100%);
+  padding: 10px 12px;
+  min-height: 56px;
+  box-shadow: 0 12px 28px rgba(0, 0, 0, 0.35);
+  transition: border-color 0.16s, box-shadow 0.16s, min-height 0.16s, padding 0.16s;
 }
 
 .composer-box:focus-within {
-  border-color: var(--crail);
-  box-shadow: 0 0 0 3px rgba(193, 95, 60, 0.12);
+  border-color: rgba(255, 255, 255, 0.22);
+  box-shadow:
+    0 0 0 1px rgba(255, 255, 255, 0.14),
+    0 16px 34px rgba(0, 0, 0, 0.4);
+}
+
+.composer-box.is-multiline {
+  align-items: stretch;
+  grid-template-rows: minmax(0, 1fr) auto;
+  min-height: 138px;
+  padding-top: 14px;
+  padding-bottom: 12px;
+}
+
+.composer-box.is-multiline .composer-textarea {
+  grid-column: 1 / -1;
+  grid-row: 1;
+  margin: 0 2px 2px;
+}
+
+.composer-box.is-multiline .composer-actions {
+  grid-column: 2;
+  grid-row: 2;
+  align-self: end;
 }
 
 .composer-textarea {
@@ -623,30 +648,40 @@ body {
   background: transparent;
   border: 0;
   outline: none;
-  font-size: 13.5px;
+  font-size: 15px;
   font-family: inherit;
-  color: var(--text);
-  line-height: 1.55;
+  color: rgba(255, 255, 255, 0.95);
+  line-height: 1.38;
   min-height: 22px;
   max-height: 160px;
   overflow-y: auto;
+  padding: 2px 0;
 }
 
 .composer-textarea::placeholder {
-  color: var(--text-subtle);
+  color: rgba(255, 255, 255, 0.56);
+}
+
+.composer-textarea::-webkit-scrollbar {
+  width: 4px;
+}
+
+.composer-textarea::-webkit-scrollbar-thumb {
+  background: rgba(255, 255, 255, 0.2);
+  border-radius: 999px;
 }
 
 .composer-actions {
   display: flex;
-  align-items: center;
-  gap: 4px;
+  align-items: flex-end;
+  gap: 6px;
   flex-shrink: 0;
 }
 
 .context-usage-note {
   font-size: 10px;
   line-height: 1.35;
-  color: var(--text-subtle);
+  color: rgba(255, 255, 255, 0.42);
   padding: 0 4px;
   white-space: nowrap;
   overflow: hidden;
@@ -655,62 +690,69 @@ body {
 
 .btn-stop,
 .btn-send {
-  width: 32px;
-  height: 32px;
-  border-radius: 8px;
+  width: 44px;
+  height: 44px;
+  border-radius: 999px;
   border: 0;
   display: flex;
   align-items: center;
   justify-content: center;
-  transition: background 0.15s, opacity 0.15s, color 0.15s;
+  transition: background 0.15s, opacity 0.15s, color 0.15s, transform 0.15s;
 }
 
 .btn-stop {
-  background: transparent;
-  cursor: not-allowed;
-  color: var(--text-subtle);
-  opacity: 0.3;
+  background: rgba(255, 90, 90, 0.14);
+  cursor: pointer;
+  color: #FF5A5A;
+  opacity: 1;
 }
 
 .btn-stop.active {
-  cursor: pointer;
-  opacity: 1;
-  color: #EF4444;
+  background: rgba(255, 90, 90, 0.14);
 }
 
 .btn-stop.active:hover {
-  background: #FEF2F2;
+  background: rgba(255, 90, 90, 0.2);
+}
+
+.btn-stop:disabled {
+  opacity: 0.5;
+  cursor: not-allowed;
 }
 
 .btn-stop svg {
-  width: 15px;
-  height: 15px;
+  width: 20px;
+  height: 20px;
 }
 
 .btn-send {
-  background: var(--crail);
+  background: #2B78F6;
   color: #fff;
   cursor: pointer;
-  box-shadow: 0 2px 6px rgba(193, 95, 60, 0.3);
+  box-shadow: 0 8px 18px rgba(43, 120, 246, 0.45);
 }
 
 .btn-send:hover:not(:disabled) {
-  background: var(--crail-dark);
+  background: #2269D9;
+  transform: translateY(-1px);
 }
 
 .btn-send:active:not(:disabled) {
-  background: var(--crail-press);
+  background: #1A5CC7;
+  transform: translateY(0);
 }
 
 .btn-send:disabled {
-  opacity: 0.35;
+  background: rgba(255, 255, 255, 0.18);
+  color: rgba(255, 255, 255, 0.62);
+  opacity: 1;
   cursor: not-allowed;
   box-shadow: none;
 }
 
 .btn-send svg {
-  width: 13px;
-  height: 13px;
+  width: 18px;
+  height: 18px;
 }
 
 .popover-backdrop {
diff --git a/extension/agent-panel.html b/extension/agent-panel.html
index e2355a5..53a2344 100644
--- a/extension/agent-panel.html
+++ b/extension/agent-panel.html
@@ -45,10 +45,10 @@
       <div id="bf-transcript" class="transcript"></div>
       <form id="bf-chat-form" class="composer-wrap">
         <div class="composer-box">
-          <textarea id="bf-chat-input" class="composer-textarea" rows="1" placeholder="Message BrowserForce Agent"></textarea>
-        <div class="composer-actions">
-            <button id="bf-stop-run" type="button" class="btn-stop" aria-label="Stop" title="Stop" disabled>
-              <svg viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" aria-hidden="true">
+          <textarea id="bf-chat-input" class="composer-textarea" rows="1" placeholder="Ask anything"></textarea>
+          <div class="composer-actions">
+            <button id="bf-stop-run" type="button" class="btn-stop" aria-label="Stop" title="Stop" disabled hidden>
+              <svg class="icon-stop" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" aria-hidden="true">
                 <circle cx="12" cy="12" r="10"></circle>
                 <rect x="9" y="9" width="6" height="6" rx="1" fill="currentColor" stroke="none"></rect>
               </svg>
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index 2eef4ca..e81a647 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -37,3 +37,12 @@ test('agent panel keeps horizontal overflow contained in transcript cards', () =
   assert.match(css, /\.transcript[\s\S]*overflow-x:\s*hidden/);
   assert.match(css, /\.bubble-assistant code[\s\S]*overflow-wrap:\s*anywhere/);
 });
+
+test('agent panel composer matches compact/expanded shell structure', () => {
+  assert.doesNotMatch(html, /id="bf-attach-btn"/);
+  assert.doesNotMatch(html, /icon-mic/);
+  assert.match(html, /id="bf-stop-run"[\s\S]*icon-stop/);
+  assert.match(html, /id="bf-send-btn"/);
+  assert.match(css, /\.composer-box\.is-multiline/);
+  assert.match(css, /\.btn-send[\s\S]*border-radius:\s*999px/);
+});

From 63a64ba9c016ac819fd5c37d374895d18bc4ddb6 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 18:20:06 +0530
Subject: [PATCH 142/192] sidepanel: toggle multiline state and swap send/stop
 visibility

---
 extension/agent-panel.js                     | 14 ++++++++++++++
 test/agent/agent-panel-send-contract.test.js | 12 ++++++++++++
 2 files changed, 26 insertions(+)

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index eedce97..4ee34f5 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -46,6 +46,7 @@ const modelListEl = document.getElementById('bf-model-list');
 const switchSessionListEl = document.getElementById('bf-switch-session-list');
 const transcriptEl = document.getElementById('bf-transcript');
 const chatFormEl = document.getElementById('bf-chat-form');
+const composerBoxEl = chatFormEl.querySelector('.composer-box');
 const chatInputEl = document.getElementById('bf-chat-input');
 const stopRunBtn = document.getElementById('bf-stop-run');
 const sendBtn = chatFormEl.querySelector('button[type="submit"]');
@@ -81,6 +82,12 @@ function autoResizeInput() {
   chatInputEl.style.height = `${Math.min(chatInputEl.scrollHeight, 160)}px`;
 }
 
+function syncComposerLayoutState() {
+  const lineHeight = Number.parseFloat(window.getComputedStyle(chatInputEl).lineHeight) || 21;
+  const isMultiline = chatInputEl.scrollHeight > (lineHeight * 1.6);
+  composerBoxEl.classList.toggle('is-multiline', isMultiline);
+}
+
 function syncComposerState() {
   const enabled = !chatInputEl.disabled;
   const hasText = chatInputEl.value.trim().length > 0;
@@ -88,7 +95,9 @@ function syncComposerState() {
 
   stopRunBtn.disabled = !enabled || !runInProgress;
   stopRunBtn.classList.toggle('active', enabled && runInProgress);
+  stopRunBtn.hidden = !runInProgress;
   sendBtn.disabled = !enabled || runInProgress || !hasText;
+  sendBtn.hidden = runInProgress;
 }
 
 function syncStatusIndicator() {
@@ -133,6 +142,7 @@ function setStatus(kind, text) {
 function setComposerEnabled(enabled) {
   chatInputEl.disabled = !enabled;
   autoResizeInput();
+  syncComposerLayoutState();
   syncComposerState();
 }
 
@@ -1062,9 +1072,12 @@ chatFormEl.addEventListener('submit', async (event) => {
     await sendMessage(text);
     chatInputEl.value = '';
     autoResizeInput();
+    syncComposerLayoutState();
     syncComposerState();
   } catch (error) {
     chatInputEl.value = text;
+    autoResizeInput();
+    syncComposerLayoutState();
     syncComposerState();
     setStatus('error', error?.message || 'Failed to send message');
   }
@@ -1096,6 +1109,7 @@ if (attachCurrentTabBtn) {
 
 chatInputEl.addEventListener('input', () => {
   autoResizeInput();
+  syncComposerLayoutState();
   syncComposerState();
 });
 
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index ee367ef..1bc5ec1 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -104,3 +104,15 @@ test('tool-call timeline entries render collapsed toggle rows with click-to-expa
   assert.match(js, /class="step-details"/);
   assert.match(js, /closest\('button\[data-step-key\]'\)/);
 });
+
+test('composer toggles single-line and multiline visual state from textarea height', () => {
+  assert.match(js, /const composerBoxEl = chatFormEl\.querySelector\('\.composer-box'\)/);
+  assert.match(js, /function syncComposerLayoutState\(\)/);
+  assert.match(js, /composerBoxEl\.classList\.toggle\('is-multiline', isMultiline\)/);
+  assert.match(js, /autoResizeInput\(\);[\s\S]*syncComposerLayoutState\(\);/);
+});
+
+test('send and stop buttons are mutually exclusive based on run state', () => {
+  assert.match(js, /stopRunBtn\.hidden\s*=\s*!runInProgress/);
+  assert.match(js, /sendBtn\.hidden\s*=\s*runInProgress/);
+});

From c50858001f93fbfa1ce8deefb21cf1ab722c66b9 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 18:44:27 +0530
Subject: [PATCH 143/192] sidepanel: prevent empty input from triggering
 multiline composer

---
 extension/agent-panel.js | 9 +++++++--
 1 file changed, 7 insertions(+), 2 deletions(-)

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 4ee34f5..51a9957 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -83,8 +83,13 @@ function autoResizeInput() {
 }
 
 function syncComposerLayoutState() {
-  const lineHeight = Number.parseFloat(window.getComputedStyle(chatInputEl).lineHeight) || 21;
-  const isMultiline = chatInputEl.scrollHeight > (lineHeight * 1.6);
+  const styles = window.getComputedStyle(chatInputEl);
+  const lineHeight = Number.parseFloat(styles.lineHeight) || 21;
+  const paddingTop = Number.parseFloat(styles.paddingTop) || 0;
+  const paddingBottom = Number.parseFloat(styles.paddingBottom) || 0;
+  const singleLineHeight = lineHeight + paddingTop + paddingBottom;
+  const hasContent = chatInputEl.value.trim().length > 0;
+  const isMultiline = hasContent && chatInputEl.scrollHeight > (singleLineHeight + 6);
   composerBoxEl.classList.toggle('is-multiline', isMultiline);
 }
 

From 82fff68a1ae57b25f4c552770c7b0bffa0675be7 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 18:44:31 +0530
Subject: [PATCH 144/192] sidepanel: shrink composer and switch to light blue
 style

---
 extension/agent-panel.css | 72 +++++++++++++++++++--------------------
 1 file changed, 35 insertions(+), 37 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 563f723..00143cd 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -593,47 +593,45 @@ body {
 
 .composer-wrap {
   flex-shrink: 0;
-  background: linear-gradient(180deg, rgba(18, 20, 25, 0) 0%, rgba(18, 20, 25, 0.9) 36%, rgba(18, 20, 25, 0.96) 100%);
-  border-top: 0;
-  padding: 12px;
+  background: #f3f5f8;
+  border-top: 1px solid #d8dee6;
+  padding: 8px 12px;
   display: flex;
   flex-direction: column;
-  gap: 6px;
+  gap: 4px;
 }
 
 .composer-box {
   display: grid;
   grid-template-columns: minmax(0, 1fr) auto;
   align-items: center;
-  gap: 10px;
-  border: 1px solid rgba(255, 255, 255, 0.1);
-  border-radius: 32px;
-  background: linear-gradient(180deg, #33363A 0%, #2A2D30 100%);
-  padding: 10px 12px;
-  min-height: 56px;
-  box-shadow: 0 12px 28px rgba(0, 0, 0, 0.35);
+  gap: 8px;
+  border: 1px solid #8ea8c2;
+  border-radius: 28px;
+  background: linear-gradient(90deg, #6e93b7 0%, #759ec1 100%);
+  padding: 6px 10px;
+  min-height: 46px;
+  box-shadow: 0 2px 8px rgba(25, 46, 68, 0.16);
   transition: border-color 0.16s, box-shadow 0.16s, min-height 0.16s, padding 0.16s;
 }
 
 .composer-box:focus-within {
-  border-color: rgba(255, 255, 255, 0.22);
-  box-shadow:
-    0 0 0 1px rgba(255, 255, 255, 0.14),
-    0 16px 34px rgba(0, 0, 0, 0.4);
+  border-color: #6f8eac;
+  box-shadow: 0 0 0 2px rgba(111, 142, 172, 0.28);
 }
 
 .composer-box.is-multiline {
   align-items: stretch;
   grid-template-rows: minmax(0, 1fr) auto;
-  min-height: 138px;
-  padding-top: 14px;
-  padding-bottom: 12px;
+  min-height: 68px;
+  padding-top: 8px;
+  padding-bottom: 8px;
 }
 
 .composer-box.is-multiline .composer-textarea {
   grid-column: 1 / -1;
   grid-row: 1;
-  margin: 0 2px 2px;
+  margin: 0;
 }
 
 .composer-box.is-multiline .composer-actions {
@@ -650,16 +648,16 @@ body {
   outline: none;
   font-size: 15px;
   font-family: inherit;
-  color: rgba(255, 255, 255, 0.95);
+  color: #f1f7ff;
   line-height: 1.38;
-  min-height: 22px;
-  max-height: 160px;
+  min-height: 20px;
+  max-height: 84px;
   overflow-y: auto;
-  padding: 2px 0;
+  padding: 0;
 }
 
 .composer-textarea::placeholder {
-  color: rgba(255, 255, 255, 0.56);
+  color: rgba(228, 240, 252, 0.74);
 }
 
 .composer-textarea::-webkit-scrollbar {
@@ -667,7 +665,7 @@ body {
 }
 
 .composer-textarea::-webkit-scrollbar-thumb {
-  background: rgba(255, 255, 255, 0.2);
+  background: rgba(241, 247, 255, 0.4);
   border-radius: 999px;
 }
 
@@ -681,7 +679,7 @@ body {
 .context-usage-note {
   font-size: 10px;
   line-height: 1.35;
-  color: rgba(255, 255, 255, 0.42);
+  color: #6f7f92;
   padding: 0 4px;
   white-space: nowrap;
   overflow: hidden;
@@ -690,8 +688,8 @@ body {
 
 .btn-stop,
 .btn-send {
-  width: 44px;
-  height: 44px;
+  width: 36px;
+  height: 36px;
   border-radius: 999px;
   border: 0;
   display: flex;
@@ -701,7 +699,7 @@ body {
 }
 
 .btn-stop {
-  background: rgba(255, 90, 90, 0.14);
+  background: rgba(255, 90, 90, 0.18);
   cursor: pointer;
   color: #FF5A5A;
   opacity: 1;
@@ -712,7 +710,7 @@ body {
 }
 
 .btn-stop.active:hover {
-  background: rgba(255, 90, 90, 0.2);
+  background: rgba(255, 90, 90, 0.28);
 }
 
 .btn-stop:disabled {
@@ -721,15 +719,15 @@ body {
 }
 
 .btn-stop svg {
-  width: 20px;
-  height: 20px;
+  width: 16px;
+  height: 16px;
 }
 
 .btn-send {
   background: #2B78F6;
   color: #fff;
   cursor: pointer;
-  box-shadow: 0 8px 18px rgba(43, 120, 246, 0.45);
+  box-shadow: 0 3px 8px rgba(43, 120, 246, 0.35);
 }
 
 .btn-send:hover:not(:disabled) {
@@ -743,16 +741,16 @@ body {
 }
 
 .btn-send:disabled {
-  background: rgba(255, 255, 255, 0.18);
-  color: rgba(255, 255, 255, 0.62);
+  background: rgba(214, 226, 239, 0.82);
+  color: #8da2b9;
   opacity: 1;
   cursor: not-allowed;
   box-shadow: none;
 }
 
 .btn-send svg {
-  width: 18px;
-  height: 18px;
+  width: 15px;
+  height: 15px;
 }
 
 .popover-backdrop {

From 1ecc42fd085fccbc4811252f4627fd17a5d2a5d0 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 18:54:57 +0530
Subject: [PATCH 145/192] sidepanel: reconcile stale run state to hide stop
 when stream ends

---
 extension/agent-panel.js                     | 16 ++++++++++++++++
 test/agent/agent-panel-send-contract.test.js |  7 +++++++
 2 files changed, 23 insertions(+)

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 51a9957..4e0e5e2 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -77,6 +77,18 @@ function isActiveRunInProgress() {
   return !!(run && !run.done);
 }
 
+function reconcileSessionRunState(sessionId) {
+  if (!sessionId) return false;
+  const runId = getSessionRunId(state.currentRunBySession, sessionId);
+  if (!runId) return false;
+  const run = state.value.runs[runId] || null;
+  if (!run || run.done) {
+    state.currentRunBySession = clearSessionRunId(state.currentRunBySession, sessionId, runId);
+    return true;
+  }
+  return false;
+}
+
 function autoResizeInput() {
   chatInputEl.style.height = 'auto';
   chatInputEl.style.height = `${Math.min(chatInputEl.scrollHeight, 160)}px`;
@@ -854,6 +866,9 @@ async function loadMessages(sessionId) {
   await ensureOk(res, 'Failed to load messages');
   const body = await readJsonOrEmpty(res);
   dispatch({ type: 'messages.loaded', sessionId, messages: body.messages || [] });
+  if (reconcileSessionRunState(sessionId)) {
+    render();
+  }
 }
 
 async function loadSessionMetadata(sessionId) {
@@ -1020,6 +1035,7 @@ function connectEvents(sessionId) {
         }
 
         backoffMs = 250;
+        await loadMessages(sessionId).catch(() => {});
         await consumeEventStream(response.body, loopToken);
       } catch {
         if (controller.signal.aborted || state.eventLoopToken !== loopToken) break;
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 1bc5ec1..885818d 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -116,3 +116,10 @@ test('send and stop buttons are mutually exclusive based on run state', () => {
   assert.match(js, /stopRunBtn\.hidden\s*=\s*!runInProgress/);
   assert.match(js, /sendBtn\.hidden\s*=\s*runInProgress/);
 });
+
+test('stale run pointer is reconciled from loaded messages so stop does not stay visible forever', () => {
+  assert.match(js, /function reconcileSessionRunState\(sessionId\)/);
+  assert.match(js, /if \(!run \|\| run\.done\)/);
+  assert.match(js, /state\.currentRunBySession = clearSessionRunId\(state\.currentRunBySession, sessionId, runId\)/);
+  assert.match(js, /async function loadMessages\(sessionId\)[\s\S]*reconcileSessionRunState\(sessionId\)/);
+});

From 92916bfa43e044ec7fdfbb7127984e13af072884 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 18:55:00 +0530
Subject: [PATCH 146/192] sidepanel: align composer section with brand light
 palette

---
 extension/agent-panel.css | 78 +++++++++++++++++++--------------------
 1 file changed, 39 insertions(+), 39 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 00143cd..2a9bd6f 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -593,39 +593,39 @@ body {
 
 .composer-wrap {
   flex-shrink: 0;
-  background: #f3f5f8;
-  border-top: 1px solid #d8dee6;
-  padding: 8px 12px;
+  background: var(--pampas);
+  border-top: 1px solid var(--line);
+  padding: 6px 12px;
   display: flex;
   flex-direction: column;
-  gap: 4px;
+  gap: 3px;
 }
 
 .composer-box {
   display: grid;
   grid-template-columns: minmax(0, 1fr) auto;
   align-items: center;
-  gap: 8px;
-  border: 1px solid #8ea8c2;
-  border-radius: 28px;
-  background: linear-gradient(90deg, #6e93b7 0%, #759ec1 100%);
-  padding: 6px 10px;
-  min-height: 46px;
-  box-shadow: 0 2px 8px rgba(25, 46, 68, 0.16);
+  gap: 6px;
+  border: 1px solid var(--line);
+  border-radius: 20px;
+  background: var(--linen);
+  padding: 4px 8px;
+  min-height: 38px;
+  box-shadow: none;
   transition: border-color 0.16s, box-shadow 0.16s, min-height 0.16s, padding 0.16s;
 }
 
 .composer-box:focus-within {
-  border-color: #6f8eac;
-  box-shadow: 0 0 0 2px rgba(111, 142, 172, 0.28);
+  border-color: var(--crail);
+  box-shadow: 0 0 0 2px rgba(193, 95, 60, 0.15);
 }
 
 .composer-box.is-multiline {
   align-items: stretch;
   grid-template-rows: minmax(0, 1fr) auto;
-  min-height: 68px;
-  padding-top: 8px;
-  padding-bottom: 8px;
+  min-height: 54px;
+  padding-top: 6px;
+  padding-bottom: 6px;
 }
 
 .composer-box.is-multiline .composer-textarea {
@@ -646,18 +646,18 @@ body {
   background: transparent;
   border: 0;
   outline: none;
-  font-size: 15px;
+  font-size: 14px;
   font-family: inherit;
-  color: #f1f7ff;
-  line-height: 1.38;
-  min-height: 20px;
-  max-height: 84px;
+  color: var(--text);
+  line-height: 1.35;
+  min-height: 18px;
+  max-height: 56px;
   overflow-y: auto;
   padding: 0;
 }
 
 .composer-textarea::placeholder {
-  color: rgba(228, 240, 252, 0.74);
+  color: var(--text-subtle);
 }
 
 .composer-textarea::-webkit-scrollbar {
@@ -665,7 +665,7 @@ body {
 }
 
 .composer-textarea::-webkit-scrollbar-thumb {
-  background: rgba(241, 247, 255, 0.4);
+  background: var(--line);
   border-radius: 999px;
 }
 
@@ -679,7 +679,7 @@ body {
 .context-usage-note {
   font-size: 10px;
   line-height: 1.35;
-  color: #6f7f92;
+  color: var(--text-subtle);
   padding: 0 4px;
   white-space: nowrap;
   overflow: hidden;
@@ -688,8 +688,8 @@ body {
 
 .btn-stop,
 .btn-send {
-  width: 36px;
-  height: 36px;
+  width: 28px;
+  height: 28px;
   border-radius: 999px;
   border: 0;
   display: flex;
@@ -699,9 +699,9 @@ body {
 }
 
 .btn-stop {
-  background: rgba(255, 90, 90, 0.18);
+  background: rgba(178, 83, 52, 0.14);
   cursor: pointer;
-  color: #FF5A5A;
+  color: var(--error);
   opacity: 1;
 }
 
@@ -710,7 +710,7 @@ body {
 }
 
 .btn-stop.active:hover {
-  background: rgba(255, 90, 90, 0.28);
+  background: rgba(178, 83, 52, 0.2);
 }
 
 .btn-stop:disabled {
@@ -719,38 +719,38 @@ body {
 }
 
 .btn-stop svg {
-  width: 16px;
-  height: 16px;
+  width: 12px;
+  height: 12px;
 }
 
 .btn-send {
-  background: #2B78F6;
+  background: var(--crail);
   color: #fff;
   cursor: pointer;
-  box-shadow: 0 3px 8px rgba(43, 120, 246, 0.35);
+  box-shadow: none;
 }
 
 .btn-send:hover:not(:disabled) {
-  background: #2269D9;
+  background: var(--crail-dark);
   transform: translateY(-1px);
 }
 
 .btn-send:active:not(:disabled) {
-  background: #1A5CC7;
+  background: var(--crail-press);
   transform: translateY(0);
 }
 
 .btn-send:disabled {
-  background: rgba(214, 226, 239, 0.82);
-  color: #8da2b9;
+  background: #d9d4cb;
+  color: #9d9487;
   opacity: 1;
   cursor: not-allowed;
   box-shadow: none;
 }
 
 .btn-send svg {
-  width: 15px;
-  height: 15px;
+  width: 12px;
+  height: 12px;
 }
 
 .popover-backdrop {

From c420a81bdb0921e9052b1639617e7197fc2f419d Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Tue, 3 Mar 2026 20:02:47 +0530
Subject: [PATCH 147/192] chore(release): bump versions to 1.0.18

---
 extension/manifest.json | 2 +-
 mcp/package.json        | 2 +-
 package.json            | 2 +-
 3 files changed, 3 insertions(+), 3 deletions(-)

diff --git a/extension/manifest.json b/extension/manifest.json
index 2730d36..766c2c3 100644
--- a/extension/manifest.json
+++ b/extension/manifest.json
@@ -1,7 +1,7 @@
 {
   "manifest_version": 3,
   "name": "BrowserForce",
-  "version": "1.0.0",
+  "version": "1.0.18",
   "description": "Give AI agents your real Chrome browser — your logins, cookies, and tabs. Works with OpenClaw, Claude, and any MCP agent.",
   "permissions": [
     "debugger",
diff --git a/mcp/package.json b/mcp/package.json
index 63b727e..c6f4d6e 100644
--- a/mcp/package.json
+++ b/mcp/package.json
@@ -1,6 +1,6 @@
 {
   "name": "browserforce-mcp",
-  "version": "1.0.0",
+  "version": "1.0.18",
   "private": true,
   "type": "module",
   "description": "MCP server exposing Chrome browser control via BrowserForce",
diff --git a/package.json b/package.json
index 1efaefd..cc56975 100644
--- a/package.json
+++ b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "browserforce",
-  "version": "1.0.17",
+  "version": "1.0.18",
   "type": "module",
   "description": "Give AI agents your real Chrome browser with progressive examples: simple reads, form interactions, multi-tab workflows, and state persistence. Search X and GitHub, extract ProductHunt data, test forms, compare A/B variants, monitor status pages. Works with OpenClaw, Claude, and any MCP agent.",
   "homepage": "https://github.com/ivalsaraj/browserforce",

From 23ed580517a971ce4fb6a17083519a20c31435dd Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 09:57:59 +0530
Subject: [PATCH 148/192] fix(sidepanel): refresh tab-attach banner after async
 auto-attach

---
 extension/agent-panel.js                     | 1 +
 test/agent/agent-panel-send-contract.test.js | 7 +++++++
 2 files changed, 8 insertions(+)

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 4e0e5e2..773a3e4 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -753,6 +753,7 @@ function startInitialTabAttach() {
       .finally(() => {
         state.initialTabAttachInFlight = false;
         renderContextUsageChip();
+        scheduleTabAttachRefresh(0);
       });
   }, 2000);
 }
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 885818d..2c062db 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -99,6 +99,13 @@ test('initial tab attach waits 2 seconds before attaching', () => {
   assert.match(fnBlock, /},\s*2000\)/);
 });
 
+test('initial async attach always refreshes banner state after completion', () => {
+  const fnMatch = js.match(/function startInitialTabAttach\(\)[\s\S]*?\n}\n\nasync function getActiveTabContext/);
+  assert.ok(fnMatch, 'startInitialTabAttach function block should be present');
+  const fnBlock = fnMatch[0];
+  assert.match(fnBlock, /\.finally\(\(\)\s*=>\s*\{[\s\S]*scheduleTabAttachRefresh\(0\);[\s\S]*\}\)/);
+});
+
 test('tool-call timeline entries render collapsed toggle rows with click-to-expand details', () => {
   assert.match(js, /data-step-key=/);
   assert.match(js, /class="step-details"/);

From 9e3f43c552fa167eb8fc636bbaace36a7707dae4 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 09:58:27 +0530
Subject: [PATCH 149/192] fix(sidepanel): use light-theme tokens for tab-attach
 banner

---
 extension/agent-panel.css               |  8 ++++----
 test/agent/agent-panel-contract.test.js | 11 +++++++++++
 2 files changed, 15 insertions(+), 4 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 2a9bd6f..e7c14a1 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -154,15 +154,15 @@ body {
   align-items: center;
   justify-content: space-between;
   gap: 8px;
-  background: var(--card-bg);
-  color: var(--muted);
+  background: var(--linen);
+  color: var(--text-muted);
   font-size: 12px;
 }
 
 .tab-attach-btn {
   border: 1px solid var(--line);
-  background: var(--accent-soft);
-  color: var(--accent-soft-text);
+  background: var(--crail-soft);
+  color: var(--crail-dark);
   border-radius: 8px;
   padding: 4px 8px;
   font-size: 12px;
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index e81a647..057b852 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -38,6 +38,17 @@ test('agent panel keeps horizontal overflow contained in transcript cards', () =
   assert.match(css, /\.bubble-assistant code[\s\S]*overflow-wrap:\s*anywhere/);
 });
 
+test('tab attach banner uses defined light-theme tokens only', () => {
+  assert.match(css, /\.tab-attach[\s\S]*background:\s*var\(--linen\)/);
+  assert.match(css, /\.tab-attach[\s\S]*color:\s*var\(--text-muted\)/);
+  assert.match(css, /\.tab-attach-btn[\s\S]*background:\s*var\(--crail-soft\)/);
+  assert.match(css, /\.tab-attach-btn[\s\S]*color:\s*var\(--crail-dark\)/);
+  assert.doesNotMatch(css, /var\(--card-bg\)/);
+  assert.doesNotMatch(css, /var\(--muted\)/);
+  assert.doesNotMatch(css, /var\(--accent-soft\)/);
+  assert.doesNotMatch(css, /var\(--accent-soft-text\)/);
+});
+
 test('agent panel composer matches compact/expanded shell structure', () => {
   assert.doesNotMatch(html, /id="bf-attach-btn"/);
   assert.doesNotMatch(html, /icon-mic/);

From a75a5925190779a86f72bda907c7c9a9a6b74ae5 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 10:03:24 +0530
Subject: [PATCH 150/192] fix(agent): tighten MCP failure guidance and hidden
 composer button styles

---
 agent/src/chatd.js                      | 3 +++
 extension/agent-panel.css               | 4 ++++
 test/agent/agent-panel-contract.test.js | 4 ++++
 test/agent/chatd-api.test.js            | 2 ++
 4 files changed, 13 insertions(+)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index a2290ff..08b3bc9 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -525,6 +525,9 @@ function buildRunPrompt({ message, browserContext }) {
   lines.push('When the user asks what you can see, asks about this page/tab, or requests a summary of the current page, inspect the active page and answer directly.');
   lines.push('Use BrowserForce browser tools to read the current page content before replying in these cases.');
   lines.push('Do not ask for permission to inspect, and do not say you only have tab metadata.');
+  lines.push('If BrowserForce MCP, relay, or browser tool calls fail, state the exact error message and stop.');
+  lines.push('Do not infer page contents from title/URL/tab metadata, cached logs, or web search when live inspection fails.');
+  lines.push('After reporting the error, provide one concrete recovery action focused on MCP/relay health.');
   lines.push('If the request is still ambiguous after inspecting, ask one focused clarifying question.');
   lines.push('');
   lines.push(`User request: ${message}`);
diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index e7c14a1..0572f4d 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -676,6 +676,10 @@ body {
   flex-shrink: 0;
 }
 
+.composer-actions button[hidden] {
+  display: none;
+}
+
 .context-usage-note {
   font-size: 10px;
   line-height: 1.35;
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index 057b852..f0d8a64 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -57,3 +57,7 @@ test('agent panel composer matches compact/expanded shell structure', () => {
   assert.match(css, /\.composer-box\.is-multiline/);
   assert.match(css, /\.btn-send[\s\S]*border-radius:\s*999px/);
 });
+
+test('composer action buttons respect hidden attribute for send/stop swapping', () => {
+  assert.match(css, /\.composer-actions button\[hidden\][\s\S]*display:\s*none/);
+});
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index 3398e5c..3997364 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -429,6 +429,8 @@ test('POST /v1/runs includes active tab context in runExecutor prompt', async ()
     assert.match(prompt, /Active tab URL: https:\/\/example\.com\/pricing/);
     assert.match(prompt, /inspect the active page and answer directly/i);
     assert.match(prompt, /do not ask for permission to inspect/i);
+    assert.match(prompt, /state the exact error message/i);
+    assert.match(prompt, /do not infer page contents from title\/url\/tab metadata/i);
     assert.match(prompt, /User request:\s*summarize this page/i);
   } finally {
     await daemon.stop();

From 29ec6c01421d6adb3b79db73873c36ae5ca1d66d Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 12:35:02 +0530
Subject: [PATCH 151/192] Fix logs viewer relay auth in extension options page

---
 extension/options.js              | 3 +++
 test/agent/popup-contract.test.js | 6 ++++++
 2 files changed, 9 insertions(+)

diff --git a/extension/options.js b/extension/options.js
index 4a2688c..f8288e7 100644
--- a/extension/options.js
+++ b/extension/options.js
@@ -148,9 +148,12 @@ async function pollOnce() {
 }
 
 async function fetchJson(pathname) {
+  const extensionId = chrome?.runtime?.id;
+  const headers = extensionId ? { 'x-browserforce-extension-id': extensionId } : {};
   const response = await fetch(`${state.relayHttpBase}${pathname}`, {
     method: 'GET',
     cache: 'no-store',
+    headers,
   });
 
   if (!response.ok) {
diff --git a/test/agent/popup-contract.test.js b/test/agent/popup-contract.test.js
index 3995a17..7530b38 100644
--- a/test/agent/popup-contract.test.js
+++ b/test/agent/popup-contract.test.js
@@ -3,7 +3,13 @@ import test from 'node:test';
 import assert from 'node:assert/strict';
 
 const html = fs.readFileSync('extension/popup.html', 'utf8');
+const optionsJs = fs.readFileSync('extension/options.js', 'utf8');
 
 test('popup includes Open BrowserForce Agent button', () => {
   assert.match(html, /Open BrowserForce Agent/);
 });
+
+test('logs viewer requests include extension identity header', () => {
+  assert.match(optionsJs, /chrome\?\.runtime\?\.id/);
+  assert.match(optionsJs, /'x-browserforce-extension-id'/);
+});

From eb7ce1f5adea021f59d149f6ca35578c66bda3e1 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 12:46:55 +0530
Subject: [PATCH 152/192] agent-panel: collapse tool lifecycle rows and parse
 modern codex events

---
 agent/src/chatd.js                  | 202 ++++++++++++++++++++++++--
 agent/src/codex-runner.js           | 217 ++++++++++++++++++++++++++++
 agent/src/session-store.js          |  70 ++++++++-
 extension/agent-panel-state.js      | 215 +++++++++++++++++++++++++--
 extension/agent-panel.css           |   6 +-
 extension/agent-panel.js            |   2 +
 test/agent/codex-runner.test.js     |  67 +++++++++
 test/agent/session-store.test.js    |  23 +++
 test/agent/session-ui-state.test.js |  30 ++++
 test/agent/sse-events.test.js       |  48 +++++-
 10 files changed, 846 insertions(+), 34 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 08b3bc9..e652241 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -300,14 +300,43 @@ function trimStepLabel(label) {
   return text.length > 160 ? `${text.slice(0, 157)}...` : text;
 }
 
+function trimStepKey(key) {
+  const text = String(key || '').trim();
+  if (!text) return '';
+  return text.length > 220 ? text.slice(0, 220) : text;
+}
+
+function normalizeStepStatus(status) {
+  const normalized = String(status || '').trim().toLowerCase();
+  if (!normalized) return 'running';
+  if (normalized === 'completed' || normalized === 'success' || normalized === 'succeeded') return 'done';
+  return normalized;
+}
+
+function isTerminalStepStatus(status) {
+  const normalized = normalizeStepStatus(status);
+  return normalized === 'done' || normalized === 'failed' || normalized === 'aborted';
+}
+
+function isGenericToolLabel(label) {
+  const normalized = String(label || '').trim().toLowerCase();
+  return normalized === 'tool call started' || normalized === 'tool call completed' || normalized === 'working...';
+}
+
+function detailsEqual(a, b) {
+  return JSON.stringify(a || []) === JSON.stringify(b || []);
+}
+
 function normalizeRunStep(step) {
   if (!step || typeof step !== 'object') return null;
   const label = trimStepLabel(step.label);
   if (!label) return null;
   return {
     kind: String(step?.kind || '').trim() || 'reasoning',
-    status: String(step?.status || '').trim() || 'running',
+    status: normalizeStepStatus(step?.status),
     label,
+    ...(trimStepKey(step.key) ? { key: trimStepKey(step.key) } : {}),
+    ...(Array.isArray(step.details) && step.details.length > 0 ? { details: step.details } : {}),
   };
 }
 
@@ -316,8 +345,61 @@ function pushRunStep(run, step) {
   const steps = Array.isArray(run.steps) ? run.steps : [];
   const normalized = normalizeRunStep(step);
   if (!normalized || !normalized.label) return;
+  const keyedIndex = normalized.key
+    ? (() => {
+      for (let idx = steps.length - 1; idx >= 0; idx -= 1) {
+        if (steps[idx]?.key === normalized.key) return idx;
+      }
+      return -1;
+    })()
+    : -1;
+  if (keyedIndex >= 0) {
+    const existing = steps[keyedIndex];
+    steps[keyedIndex] = {
+      ...existing,
+      ...normalized,
+      label: (isGenericToolLabel(normalized.label) && existing?.label) ? existing.label : normalized.label,
+      details: normalized.details && normalized.details.length > 0 ? normalized.details : existing?.details,
+    };
+    run.steps = steps;
+    return;
+  }
+
+  if (!normalized.key && isTerminalStepStatus(normalized.status)) {
+    let fallbackIndex = -1;
+    for (let idx = steps.length - 1; idx >= 0; idx -= 1) {
+      const entry = steps[idx];
+      if (
+        entry
+        && !entry.key
+        && String(entry.kind || '') === normalized.kind
+        && String(entry.label || '') === normalized.label
+        && !isTerminalStepStatus(entry.status)
+      ) {
+        fallbackIndex = idx;
+        break;
+      }
+    }
+    if (fallbackIndex >= 0) {
+      const existing = steps[fallbackIndex];
+      steps[fallbackIndex] = {
+        ...existing,
+        ...normalized,
+        details: normalized.details && normalized.details.length > 0 ? normalized.details : existing?.details,
+      };
+      run.steps = steps;
+      return;
+    }
+  }
+
   const last = steps[steps.length - 1];
-  if (last && last.label === normalized.label && last.kind === normalized.kind && last.status === normalized.status) {
+  if (
+    last
+    && last.label === normalized.label
+    && last.kind === normalized.kind
+    && last.status === normalized.status
+    && detailsEqual(last.details, normalized.details)
+  ) {
     return;
   }
   steps.push(normalized);
@@ -341,17 +423,64 @@ function pushRunTimelineEntry(run, entry) {
     const normalized = normalizeRunStep(entry);
     if (!normalized) return;
     const next = { type: 'step', ...normalized };
-    const last = timeline[timeline.length - 1];
-    if (
-      last
-      && last.type === 'step'
-      && last.label === next.label
-      && last.kind === next.kind
-      && last.status === next.status
-    ) {
-      return;
+    const keyedIndex = next.key
+      ? (() => {
+        for (let idx = timeline.length - 1; idx >= 0; idx -= 1) {
+          const item = timeline[idx];
+          if (item?.type === 'step' && item.key === next.key) return idx;
+        }
+        return -1;
+      })()
+      : -1;
+    if (keyedIndex >= 0) {
+      const existing = timeline[keyedIndex];
+      timeline[keyedIndex] = {
+        ...existing,
+        ...next,
+        label: (isGenericToolLabel(next.label) && existing?.label) ? existing.label : next.label,
+        details: next.details && next.details.length > 0 ? next.details : existing?.details,
+      };
+    } else {
+      if (!next.key && isTerminalStepStatus(next.status)) {
+        let fallbackIndex = -1;
+        for (let idx = timeline.length - 1; idx >= 0; idx -= 1) {
+          const item = timeline[idx];
+          if (
+            item
+            && item.type === 'step'
+            && !item.key
+            && String(item.kind || '') === next.kind
+            && String(item.label || '') === next.label
+            && !isTerminalStepStatus(item.status)
+          ) {
+            fallbackIndex = idx;
+            break;
+          }
+        }
+        if (fallbackIndex >= 0) {
+          const existing = timeline[fallbackIndex];
+          timeline[fallbackIndex] = {
+            ...existing,
+            ...next,
+            details: next.details && next.details.length > 0 ? next.details : existing?.details,
+          };
+          run.timeline = timeline;
+          return;
+        }
+      }
+      const last = timeline[timeline.length - 1];
+      if (
+        last
+        && last.type === 'step'
+        && last.label === next.label
+        && last.kind === next.kind
+        && last.status === next.status
+        && detailsEqual(last.details, next.details)
+      ) {
+        return;
+      }
+      timeline.push(next);
     }
-    timeline.push(next);
   } else {
     return;
   }
@@ -387,20 +516,20 @@ function stepLabelForToolEvent(evt) {
   const payload = evt?.payload || {};
   if (evt.event === 'tool.started') {
     return firstString([
+      payload.command,
       payload.title,
       payload.name,
       payload.tool,
       payload.toolName,
-      payload.command,
     ]) || 'Tool call started';
   }
   if (evt.event === 'tool.final') {
     return firstString([
+      payload.command,
       payload.title,
       payload.name,
       payload.tool,
       payload.toolName,
-      payload.command,
     ]) || 'Tool call completed';
   }
   if (evt.event === 'tool.delta') {
@@ -418,6 +547,21 @@ function stepLabelForToolEvent(evt) {
   return '';
 }
 
+function stepKeyForToolEvent(evt) {
+  const payload = evt?.payload || {};
+  const key = firstString([
+    payload.stepKey,
+    payload.step_key,
+    payload.callId,
+    payload.call_id,
+    payload.toolCallId,
+    payload.tool_call_id,
+    payload.id,
+  ]);
+  if (!key) return '';
+  return key.startsWith('tool:') ? key : `tool:${key}`;
+}
+
 function humanizeToken(value) {
   const normalized = String(value || '')
     .trim()
@@ -463,6 +607,25 @@ function stepLabelForRunEvent(evt) {
   ]) || 'Working...';
 }
 
+function stepKeyForRunEvent(evt) {
+  const payload = evt?.payload || {};
+  const item = payload?.item && typeof payload.item === 'object' ? payload.item : {};
+  const key = firstString([
+    payload.stepKey,
+    payload.step_key,
+    item.stepKey,
+    item.step_key,
+    payload.callId,
+    payload.call_id,
+    item.callId,
+    item.call_id,
+    item.id,
+    payload.id,
+  ]);
+  if (!key) return '';
+  return key.startsWith('tool:') ? key : `tool:${key}`;
+}
+
 function trackRunStep(run, evt) {
   if (!run || !evt?.event) return;
 
@@ -471,6 +634,7 @@ function trackRunStep(run, evt) {
       kind: evt.event === 'tool.delta' ? 'reasoning' : 'tool',
       status: evt.event === 'tool.final' ? 'done' : 'running',
       label: stepLabelForToolEvent(evt),
+      ...(stepKeyForToolEvent(evt) ? { key: stepKeyForToolEvent(evt) } : {}),
     };
     pushRunStep(run, step);
     pushRunTimelineEntry(run, { type: 'step', ...step });
@@ -482,6 +646,7 @@ function trackRunStep(run, evt) {
       kind: stepKindForRunEvent(evt),
       status: stepStatusForRunEvent(evt),
       label: stepLabelForRunEvent(evt),
+      ...(stepKeyForRunEvent(evt) ? { key: stepKeyForRunEvent(evt) } : {}),
     };
     pushRunStep(run, step);
     pushRunTimelineEntry(run, { type: 'step', ...step });
@@ -909,6 +1074,15 @@ export async function startChatd(opts = {}) {
                   return;
                 }
 
+                if (evt.event === 'chat.commentary') {
+                  const delta = evt.payload?.delta || '';
+                  if (delta) {
+                    pushRunTimelineEntry(active, { type: 'text', text: delta });
+                    broadcast(buildEvent({ event: 'chat.commentary', runId, sessionId, payload: { delta } }));
+                  }
+                  return;
+                }
+
                 if (evt.event === 'chat.final') {
                   const text = evt.payload?.text || active.assistantBuffer || '';
                   await finalizeRun(active, text);
diff --git a/agent/src/codex-runner.js b/agent/src/codex-runner.js
index 776f039..6272038 100644
--- a/agent/src/codex-runner.js
+++ b/agent/src/codex-runner.js
@@ -48,6 +48,214 @@ function toUsagePayload(source = {}) {
   };
 }
 
+function firstString(values) {
+  for (const value of values) {
+    if (typeof value === 'string' && value.trim()) return value.trim();
+  }
+  return '';
+}
+
+function safeParseJson(value) {
+  if (typeof value !== 'string' || !value.trim()) return null;
+  try {
+    return JSON.parse(value);
+  } catch {
+    return null;
+  }
+}
+
+function normalizeToolIdentity(payload = {}, fallbackCallId = '') {
+  const callId = firstString([
+    payload.callId,
+    payload.call_id,
+    payload.toolCallId,
+    payload.tool_call_id,
+    payload.id,
+    fallbackCallId,
+  ]);
+  const stepKey = firstString([
+    payload.stepKey,
+    payload.step_key,
+    callId ? `tool:${callId}` : '',
+  ]);
+  return {
+    ...payload,
+    ...(callId ? { callId } : {}),
+    ...(stepKey ? { stepKey } : {}),
+  };
+}
+
+function quoteForShell(value) {
+  const source = String(value || '');
+  if (!source) return '';
+  return source.replace(/'/g, `'\"'\"'`);
+}
+
+function toolCommandLabel({ name, parsedArgs, rawArgs }) {
+  if (parsedArgs && typeof parsedArgs === 'object') {
+    const cmd = firstString([
+      parsedArgs.cmd,
+      parsedArgs.command,
+    ]);
+    if (cmd) {
+      return `/bin/zsh -lc '${quoteForShell(cmd)}'`;
+    }
+  }
+
+  if (typeof rawArgs === 'string' && rawArgs.trim() && rawArgs.trim().startsWith('{')) {
+    const parsed = safeParseJson(rawArgs);
+    if (parsed && typeof parsed === 'object') {
+      const cmd = firstString([parsed.cmd, parsed.command]);
+      if (cmd) return `/bin/zsh -lc '${quoteForShell(cmd)}'`;
+    }
+  }
+
+  if (name === 'exec_command' && typeof rawArgs === 'string' && rawArgs.trim()) {
+    return rawArgs.trim().length > 160 ? `${rawArgs.trim().slice(0, 157)}...` : rawArgs.trim();
+  }
+  return '';
+}
+
+function messageTextFromContent(content) {
+  if (typeof content === 'string') return content;
+  if (!Array.isArray(content)) return '';
+  const parts = [];
+  for (const item of content) {
+    if (!item || typeof item !== 'object') continue;
+    const text = firstString([item.text, item.message, item.delta]);
+    if (text) parts.push(text);
+  }
+  return parts.join('');
+}
+
+function normalizeResponseItem({ runId, sessionId, payload }) {
+  if (!payload || typeof payload !== 'object') return null;
+  const itemType = String(payload.type || '').toLowerCase();
+
+  if (itemType === 'message') {
+    const role = String(payload.role || '').toLowerCase();
+    if (role !== 'assistant') return null;
+    const text = firstString([
+      payload.text,
+      payload.message,
+      messageTextFromContent(payload.content),
+    ]);
+    if (!text) return null;
+    const phase = String(payload.phase || '').toLowerCase();
+    if (phase === 'final_answer') {
+      return envelope({ event: 'chat.final', runId, sessionId, payload: { text, phase } });
+    }
+    return envelope({ event: 'chat.commentary', runId, sessionId, payload: { delta: text, phase } });
+  }
+
+  if (itemType === 'function_call' || itemType === 'custom_tool_call') {
+    const callId = firstString([payload.call_id, payload.callId, payload.id]);
+    const parsedArgs = safeParseJson(payload.arguments);
+    const command = toolCommandLabel({
+      name: String(payload.name || ''),
+      parsedArgs,
+      rawArgs: payload.arguments,
+    });
+    return envelope({
+      event: 'tool.started',
+      runId,
+      sessionId,
+      payload: normalizeToolIdentity({
+        ...payload,
+        ...(command ? { command } : {}),
+        ...(parsedArgs && typeof parsedArgs === 'object' ? { args: parsedArgs } : {}),
+      }, callId),
+    });
+  }
+
+  if (itemType === 'function_call_output' || itemType === 'custom_tool_call_output') {
+    const callId = firstString([payload.call_id, payload.callId, payload.id]);
+    return envelope({
+      event: 'tool.final',
+      runId,
+      sessionId,
+      payload: normalizeToolIdentity(payload, callId),
+    });
+  }
+
+  if (itemType === 'reasoning') {
+    const text = firstString([
+      payload.text,
+      payload.message,
+      ...(Array.isArray(payload.summary)
+        ? payload.summary
+          .map((summaryItem) => summaryItem?.text || summaryItem?.summary_text || '')
+          .filter(Boolean)
+        : []),
+    ]);
+    if (!text) return null;
+    return envelope({
+      event: 'tool.delta',
+      runId,
+      sessionId,
+      payload: { type: 'reasoning', text },
+    });
+  }
+
+  return envelope({ event: 'run.event', runId, sessionId, payload });
+}
+
+function normalizeEventMsg({ runId, sessionId, payload }) {
+  if (!payload || typeof payload !== 'object') return null;
+  const payloadType = String(payload.type || '').toLowerCase();
+
+  if (payloadType === 'token_count' && payload.info && typeof payload.info === 'object') {
+    const usage = payload.info.total_token_usage && typeof payload.info.total_token_usage === 'object'
+      ? payload.info.total_token_usage
+      : {};
+    return envelope({
+      event: 'run.usage',
+      runId,
+      sessionId,
+      payload: toUsagePayload({
+        ...usage,
+        model_context_window: payload.info.model_context_window,
+        reasoning_output_tokens: payload.info.reasoning_output_tokens,
+      }),
+    });
+  }
+
+  if (payloadType === 'agent_reasoning') {
+    const text = firstString([payload.text, payload.message]);
+    if (!text) return null;
+    return envelope({
+      event: 'tool.delta',
+      runId,
+      sessionId,
+      payload: { type: 'reasoning', text },
+    });
+  }
+
+  if (payloadType === 'agent_message') {
+    const text = firstString([payload.message, payload.text]);
+    if (!text) return null;
+    const phase = String(payload.phase || '').toLowerCase();
+    if (phase === 'final_answer') {
+      return envelope({ event: 'chat.final', runId, sessionId, payload: { text, phase } });
+    }
+    return envelope({ event: 'chat.commentary', runId, sessionId, payload: { delta: text, phase } });
+  }
+
+  if (payloadType === 'task_started') {
+    return envelope({ event: 'run.started', runId, sessionId, payload });
+  }
+
+  if (payloadType === 'task_complete') {
+    const text = firstString([payload.last_agent_message, payload.message, payload.text]);
+    if (text) {
+      return envelope({ event: 'chat.final', runId, sessionId, payload: { text } });
+    }
+    return envelope({ event: 'run.event', runId, sessionId, payload });
+  }
+
+  return envelope({ event: 'run.event', runId, sessionId, payload });
+}
+
 export function normalizeCodexLine({ runId, sessionId, line }) {
   const parsed = safeParse(line);
   if (!parsed || typeof parsed !== 'object') {
@@ -56,6 +264,14 @@ export function normalizeCodexLine({ runId, sessionId, line }) {
 
   const type = String(parsed.type || '').toLowerCase();
 
+  if (type === 'response_item') {
+    return normalizeResponseItem({ runId, sessionId, payload: parsed.payload });
+  }
+
+  if (type === 'event_msg') {
+    return normalizeEventMsg({ runId, sessionId, payload: parsed.payload });
+  }
+
   if (type === 'thread.started') {
     const providerSessionId = String(parsed.thread_id || '').trim();
     if (providerSessionId) {
@@ -198,6 +414,7 @@ export function startCodexRun({
   stdoutLines.on('line', (line) => {
     try {
       const evt = normalizeCodexLine({ runId, sessionId, line });
+      if (!evt) return;
       onEvent?.(evt);
     } catch (error) {
       onError?.(error);
diff --git a/agent/src/session-store.js b/agent/src/session-store.js
index a624f4b..148f7cf 100644
--- a/agent/src/session-store.js
+++ b/agent/src/session-store.js
@@ -55,11 +55,23 @@ function normalizeStep(step) {
   const label = String(step.label || '').trim();
   if (!label) return null;
   const kind = String(step.kind || '').trim() || 'reasoning';
-  const status = String(step.status || '').trim() || 'running';
+  const normalizedStatus = String(step.status || '').trim().toLowerCase();
+  const status = normalizedStatus === 'completed' || normalizedStatus === 'success' || normalizedStatus === 'succeeded'
+    ? 'done'
+    : (normalizedStatus || 'running');
+  const key = String(step.key || '').trim();
+  const details = Array.isArray(step.details)
+    ? step.details
+      .map((item) => String(item || '').trim())
+      .filter(Boolean)
+      .slice(0, 8)
+    : [];
   return {
     kind,
     status,
     label: label.length > 160 ? `${label.slice(0, 157)}...` : label,
+    ...(key ? { key: key.length > 220 ? key.slice(0, 220) : key } : {}),
+    ...(details.length > 0 ? { details } : {}),
   };
 }
 
@@ -86,6 +98,11 @@ function normalizeTimelineEntry(entry) {
 function normalizeTimeline(timeline) {
   if (!Array.isArray(timeline)) return [];
   const entries = [];
+  const isTerminal = (status) => ['done', 'failed', 'aborted'].includes(String(status || '').toLowerCase());
+  const isGenericLabel = (label) => {
+    const normalized = String(label || '').trim().toLowerCase();
+    return normalized === 'tool call started' || normalized === 'tool call completed' || normalized === 'working...';
+  };
   for (const item of timeline.slice(-200)) {
     const normalized = normalizeTimelineEntry(item);
     if (!normalized) continue;
@@ -94,12 +111,63 @@ function normalizeTimeline(timeline) {
       last.text = `${last.text || ''}${normalized.text || ''}`;
       continue;
     }
+    if (normalized.type === 'step' && normalized.key) {
+      const index = (() => {
+        for (let idx = entries.length - 1; idx >= 0; idx -= 1) {
+          const entry = entries[idx];
+          if (entry?.type === 'step' && entry.key === normalized.key) return idx;
+        }
+        return -1;
+      })();
+      if (index >= 0) {
+        const existing = entries[index];
+        entries[index] = {
+          ...existing,
+          ...normalized,
+          label: (isGenericLabel(normalized.label) && existing?.label) ? existing.label : normalized.label,
+          details: normalized.details && normalized.details.length > 0 ? normalized.details : existing?.details,
+        };
+        continue;
+      }
+    }
+    if (
+      normalized.type === 'step'
+      && !normalized.key
+      && isTerminal(normalized.status)
+    ) {
+      const index = (() => {
+        for (let idx = entries.length - 1; idx >= 0; idx -= 1) {
+          const entry = entries[idx];
+          if (
+            entry
+            && entry.type === 'step'
+            && !entry.key
+            && entry.kind === normalized.kind
+            && entry.label === normalized.label
+            && !isTerminal(entry.status)
+          ) {
+            return idx;
+          }
+        }
+        return -1;
+      })();
+      if (index >= 0) {
+        const existing = entries[index];
+        entries[index] = {
+          ...existing,
+          ...normalized,
+          details: normalized.details && normalized.details.length > 0 ? normalized.details : existing?.details,
+        };
+        continue;
+      }
+    }
     if (
       normalized.type === 'step'
       && last?.type === 'step'
       && last.label === normalized.label
       && last.kind === normalized.kind
       && last.status === normalized.status
+      && last.key === normalized.key
     ) {
       continue;
     }
diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index 7cd3fe7..5661b98 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -19,6 +19,33 @@ function trimStepLabel(label) {
   return text.length > 160 ? `${text.slice(0, 157)}...` : text;
 }
 
+function trimStepKey(key) {
+  const text = String(key || '').trim();
+  if (!text) return '';
+  return text.length > 220 ? text.slice(0, 220) : text;
+}
+
+function normalizeStepStatus(status) {
+  const normalized = String(status || '').trim().toLowerCase();
+  if (!normalized) return 'running';
+  if (normalized === 'completed' || normalized === 'success' || normalized === 'succeeded') return 'done';
+  return normalized;
+}
+
+function isTerminalStepStatus(status) {
+  const normalized = normalizeStepStatus(status);
+  return normalized === 'done' || normalized === 'failed' || normalized === 'aborted';
+}
+
+function isGenericToolLabel(label) {
+  const normalized = String(label || '').trim().toLowerCase();
+  return normalized === 'tool call started' || normalized === 'tool call completed' || normalized === 'working...';
+}
+
+function detailsEqual(a, b) {
+  return JSON.stringify(a || []) === JSON.stringify(b || []);
+}
+
 function normalizeStepDetails(details, label = '') {
   const lines = [];
   const pushLine = (value) => {
@@ -67,8 +94,9 @@ function normalizeStep(step) {
   const details = normalizeStepDetails(step.details, label);
   return {
     kind: step.kind || 'reasoning',
-    status: step.status || 'running',
+    status: normalizeStepStatus(step.status),
     label,
+    ...(trimStepKey(step.key) ? { key: trimStepKey(step.key) } : {}),
     ...(details.length > 0 ? { details } : {}),
   };
 }
@@ -77,13 +105,58 @@ function pushStep(run, step) {
   const steps = Array.isArray(run?.steps) ? run.steps.slice() : [];
   const normalized = normalizeStep(step);
   if (!normalized || !normalized.label) return steps;
+  const keyedIndex = normalized.key
+    ? (() => {
+      for (let idx = steps.length - 1; idx >= 0; idx -= 1) {
+        if (steps[idx]?.key === normalized.key) return idx;
+      }
+      return -1;
+    })()
+    : -1;
+  if (keyedIndex >= 0) {
+    const existing = steps[keyedIndex];
+    steps[keyedIndex] = {
+      ...existing,
+      ...normalized,
+      label: (isGenericToolLabel(normalized.label) && existing?.label) ? existing.label : normalized.label,
+      details: normalized.details && normalized.details.length > 0 ? normalized.details : existing?.details,
+    };
+    return steps;
+  }
+
+  if (!normalized.key && isTerminalStepStatus(normalized.status)) {
+    let fallbackIndex = -1;
+    for (let idx = steps.length - 1; idx >= 0; idx -= 1) {
+      const entry = steps[idx];
+      if (
+        entry
+        && !entry.key
+        && String(entry.kind || '') === normalized.kind
+        && String(entry.label || '') === normalized.label
+        && !isTerminalStepStatus(entry.status)
+      ) {
+        fallbackIndex = idx;
+        break;
+      }
+    }
+    if (fallbackIndex >= 0) {
+      const existing = steps[fallbackIndex];
+      steps[fallbackIndex] = {
+        ...existing,
+        ...normalized,
+        details: normalized.details && normalized.details.length > 0 ? normalized.details : existing?.details,
+      };
+      return steps;
+    }
+  }
+
   const last = steps[steps.length - 1];
   if (
     last
     && last.label === normalized.label
     && last.kind === normalized.kind
     && last.status === normalized.status
-    && JSON.stringify(last.details || []) === JSON.stringify(normalized.details || [])
+    && detailsEqual(last.details, normalized.details)
   ) {
     return steps;
   }
@@ -109,6 +182,53 @@ function pushTimelineEntry(run, entry) {
     const normalized = normalizeStep(entry);
     if (!normalized) return timeline;
     const candidate = { type: 'step', ...normalized };
+    const keyedIndex = candidate.key
+      ? (() => {
+        for (let idx = timeline.length - 1; idx >= 0; idx -= 1) {
+          const item = timeline[idx];
+          if (item?.type === 'step' && item.key === candidate.key) return idx;
+        }
+        return -1;
+      })()
+      : -1;
+    if (keyedIndex >= 0) {
+      const existing = timeline[keyedIndex];
+      timeline[keyedIndex] = {
+        ...existing,
+        ...candidate,
+        label: (isGenericToolLabel(candidate.label) && existing?.label) ? existing.label : candidate.label,
+        details: candidate.details && candidate.details.length > 0 ? candidate.details : existing?.details,
+      };
+      return timeline;
+    }
+
+    if (!candidate.key && isTerminalStepStatus(candidate.status)) {
+      let fallbackIndex = -1;
+      for (let idx = timeline.length - 1; idx >= 0; idx -= 1) {
+        const item = timeline[idx];
+        if (
+          item
+          && item.type === 'step'
+          && !item.key
+          && String(item.kind || '') === candidate.kind
+          && String(item.label || '') === candidate.label
+          && !isTerminalStepStatus(item.status)
+        ) {
+          fallbackIndex = idx;
+          break;
+        }
+      }
+      if (fallbackIndex >= 0) {
+        const existing = timeline[fallbackIndex];
+        timeline[fallbackIndex] = {
+          ...existing,
+          ...candidate,
+          details: candidate.details && candidate.details.length > 0 ? candidate.details : existing?.details,
+        };
+        return timeline;
+      }
+    }
+
     const last = timeline[timeline.length - 1];
     if (
       last
@@ -116,7 +236,7 @@ function pushTimelineEntry(run, entry) {
       && last.label === candidate.label
       && last.kind === candidate.kind
       && last.status === candidate.status
-      && JSON.stringify(last.details || []) === JSON.stringify(candidate.details || [])
+      && detailsEqual(last.details, candidate.details)
     ) {
       return timeline;
     }
@@ -139,13 +259,24 @@ function normalizeStoredTimelineEntry(entry) {
   return { type: 'step', ...step };
 }
 
+function normalizeStoredTimeline(timeline) {
+  if (!Array.isArray(timeline)) return [];
+  let entries = [];
+  for (const item of timeline.slice(-200)) {
+    const normalized = normalizeStoredTimelineEntry(item);
+    if (!normalized) continue;
+    entries = pushTimelineEntry({ timeline: entries }, normalized);
+  }
+  return entries;
+}
+
 function fallbackTimelineFromMessage({ steps, text }) {
-  const timeline = [];
+  let timeline = [];
   for (const step of steps) {
-    timeline.push({ type: 'step', ...step });
+    timeline = pushTimelineEntry({ timeline }, { type: 'step', ...step });
   }
   if (typeof text === 'string' && text) {
-    timeline.push({ type: 'text', text });
+    timeline = pushTimelineEntry({ timeline }, { type: 'text', text });
   }
   return timeline;
 }
@@ -183,20 +314,20 @@ function stepLabelForToolEvent(evt) {
   const payload = evt?.payload || {};
   if (evt.event === 'tool.started') {
     return firstString([
+      payload.command,
       payload.title,
       payload.name,
       payload.tool,
       payload.toolName,
-      payload.command,
     ]) || 'Tool call started';
   }
   if (evt.event === 'tool.final') {
     return firstString([
+      payload.command,
       payload.title,
       payload.name,
       payload.tool,
       payload.toolName,
-      payload.command,
     ]) || 'Tool call completed';
   }
   if (evt.event === 'tool.delta') {
@@ -214,6 +345,21 @@ function stepLabelForToolEvent(evt) {
   return '';
 }
 
+function stepKeyForToolEvent(evt) {
+  const payload = evt?.payload || {};
+  const key = firstString([
+    payload.stepKey,
+    payload.step_key,
+    payload.callId,
+    payload.call_id,
+    payload.toolCallId,
+    payload.tool_call_id,
+    payload.id,
+  ]);
+  if (!key) return '';
+  return key.startsWith('tool:') ? key : `tool:${key}`;
+}
+
 function stepDetailsForToolEvent(evt, label) {
   const payload = evt?.payload || {};
   return normalizeStepDetails([
@@ -277,6 +423,25 @@ function stepLabelForRunEvent(evt) {
   ]) || 'Working...';
 }
 
+function stepKeyForRunEvent(evt) {
+  const payload = evt?.payload || {};
+  const item = payload?.item && typeof payload.item === 'object' ? payload.item : {};
+  const key = firstString([
+    payload.stepKey,
+    payload.step_key,
+    item.stepKey,
+    item.step_key,
+    payload.callId,
+    payload.call_id,
+    item.callId,
+    item.call_id,
+    item.id,
+    payload.id,
+  ]);
+  if (!key) return '';
+  return key.startsWith('tool:') ? key : `tool:${key}`;
+}
+
 function stepDetailsForRunEvent(evt, label) {
   const payload = evt?.payload || {};
   return normalizeStepDetails([
@@ -339,9 +504,7 @@ function hydrateRunsFromMessages(messages, sessionId, currentRuns) {
     const steps = Array.isArray(message?.steps)
       ? message.steps.map(normalizeStoredStep).filter(Boolean)
       : [];
-    const timeline = Array.isArray(message?.timeline)
-      ? message.timeline.map(normalizeStoredTimelineEntry).filter(Boolean)
-      : [];
+    const timeline = normalizeStoredTimeline(message?.timeline);
     const resolvedText = typeof message?.text === 'string' ? message.text : (currentRuns?.[runId]?.text || '');
     hydrated[runId] = {
       ...(currentRuns?.[runId] || { runId, text: '', done: false, steps: [] }),
@@ -443,6 +606,18 @@ export function applyEvent(state = initialState, evt = {}) {
     };
   }
 
+  if (evt.event === 'chat.commentary') {
+    const run = state.runs[evt.runId] || { text: '', done: false, steps: [], timeline: [] };
+    const delta = evt.payload?.delta || '';
+    return {
+      ...state,
+      runs: upsertRun(state, evt.runId, {
+        sessionId: evt.sessionId,
+        timeline: pushTimelineEntry(run, { type: 'text', text: delta }),
+      }),
+    };
+  }
+
   if (evt.event === 'chat.final') {
     const run = state.runs[evt.runId] || { text: '', done: false, steps: [], timeline: [] };
     const finalText = evt.payload?.text || run.text || '';
@@ -545,7 +720,14 @@ export function applyEvent(state = initialState, evt = {}) {
       : 'tool';
     const label = stepLabelForToolEvent(evt);
     const details = stepDetailsForToolEvent(evt, label);
-    const step = { kind, status, label, ...(details.length > 0 ? { details } : {}) };
+    const stepKey = stepKeyForToolEvent(evt);
+    const step = {
+      kind,
+      status,
+      label,
+      ...(stepKey ? { key: stepKey } : {}),
+      ...(details.length > 0 ? { details } : {}),
+    };
     return {
       ...state,
       runs: upsertRun(state, evt.runId, {
@@ -563,7 +745,14 @@ export function applyEvent(state = initialState, evt = {}) {
     const kind = stepKindForRunEvent(evt);
     const label = stepLabelForRunEvent(evt);
     const details = stepDetailsForRunEvent(evt, label);
-    const step = { kind, status, label, ...(details.length > 0 ? { details } : {}) };
+    const stepKey = stepKeyForRunEvent(evt);
+    const step = {
+      kind,
+      status,
+      label,
+      ...(stepKey ? { key: stepKey } : {}),
+      ...(details.length > 0 ? { details } : {}),
+    };
     return {
       ...state,
       runs: upsertRun(state, evt.runId, {
diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 0572f4d..a9a774a 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -328,7 +328,7 @@ body {
 }
 
 .timeline-step {
-  padding-left: 2px;
+  padding-left: 0;
 }
 
 .step-item {
@@ -350,7 +350,7 @@ body {
   color: inherit;
   cursor: pointer;
   display: flex;
-  align-items: center;
+  align-items: flex-start;
   justify-content: space-between;
   gap: 10px;
   text-align: left;
@@ -421,7 +421,7 @@ body {
   align-items: center;
   justify-content: center;
   flex-shrink: 0;
-  margin-top: 2px;
+  margin-top: 1px;
   color: var(--text-subtle);
   position: relative;
 }
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 773a3e4..89c9401 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -436,6 +436,8 @@ function renderRunTimeline(run, fallbackText = '') {
   const latestStepIndex = getLatestInFlightTimelineStepIndex(run, timeline);
   const getTimelineEntryKey = (entry, index) => {
     const runId = String(run?.runId || 'run');
+    const stableKey = String(entry?.key || '').trim();
+    if (stableKey) return `${runId}:${stableKey}`;
     const kind = String(entry?.kind || '');
     const status = String(entry?.status || '');
     const label = String(entry?.label || '');
diff --git a/test/agent/codex-runner.test.js b/test/agent/codex-runner.test.js
index b6f2205..bfd55c0 100644
--- a/test/agent/codex-runner.test.js
+++ b/test/agent/codex-runner.test.js
@@ -117,3 +117,70 @@ test('maps codex thread.started provider session id event to run.provider_sessio
   assert.equal(evt.payload.provider, 'codex');
   assert.equal(evt.payload.sessionId, '019caa6f-8c63-7c81-a542-3dbcf922d065');
 });
+
+test('maps response_item function_call/function_call_output into keyed tool lifecycle events', () => {
+  const start = normalizeCodexLine({
+    runId: 'r1',
+    sessionId: 's1',
+    line: JSON.stringify({
+      type: 'response_item',
+      payload: {
+        type: 'function_call',
+        call_id: 'call_123',
+        name: 'exec_command',
+        arguments: JSON.stringify({ cmd: 'rg --files' }),
+      },
+    }),
+  });
+  const done = normalizeCodexLine({
+    runId: 'r1',
+    sessionId: 's1',
+    line: JSON.stringify({
+      type: 'response_item',
+      payload: {
+        type: 'function_call_output',
+        call_id: 'call_123',
+        output: 'ok',
+      },
+    }),
+  });
+
+  assert.equal(start.event, 'tool.started');
+  assert.equal(start.payload.callId, 'call_123');
+  assert.equal(start.payload.command, "/bin/zsh -lc 'rg --files'");
+  assert.equal(done.event, 'tool.final');
+  assert.equal(done.payload.callId, 'call_123');
+  assert.equal(done.payload.stepKey, 'tool:call_123');
+});
+
+test('maps event_msg agent_message commentary to chat.commentary and final_answer to chat.final', () => {
+  const commentary = normalizeCodexLine({
+    runId: 'r1',
+    sessionId: 's1',
+    line: JSON.stringify({
+      type: 'event_msg',
+      payload: {
+        type: 'agent_message',
+        phase: 'commentary',
+        message: 'Inspecting files',
+      },
+    }),
+  });
+  const final = normalizeCodexLine({
+    runId: 'r1',
+    sessionId: 's1',
+    line: JSON.stringify({
+      type: 'event_msg',
+      payload: {
+        type: 'agent_message',
+        phase: 'final_answer',
+        message: 'All done',
+      },
+    }),
+  });
+
+  assert.equal(commentary.event, 'chat.commentary');
+  assert.equal(commentary.payload.delta, 'Inspecting files');
+  assert.equal(final.event, 'chat.final');
+  assert.equal(final.payload.text, 'All done');
+});
diff --git a/test/agent/session-store.test.js b/test/agent/session-store.test.js
index 162a95f..e17fccf 100644
--- a/test/agent/session-store.test.js
+++ b/test/agent/session-store.test.js
@@ -58,6 +58,29 @@ test('messages preserve optional run metadata used for transcript rehydration',
   ]);
 });
 
+test('messages preserve step key metadata used for lifecycle collapse on reload', async () => {
+  const { sessionId } = await createSession({ title: 'Run step keys', storageRoot });
+  await appendMessage({
+    sessionId,
+    role: 'assistant',
+    text: 'done',
+    runId: 'run_456',
+    steps: [{ kind: 'tool', status: 'done', label: 'Run command', key: 'tool:call_1' }],
+    timeline: [
+      { type: 'step', kind: 'tool', status: 'done', label: 'Run command', key: 'tool:call_1' },
+      { type: 'text', text: 'done' },
+    ],
+    storageRoot,
+  });
+  const rows = await readMessages({ sessionId, limit: 20, storageRoot });
+  const last = rows.at(-1);
+  assert.deepEqual(last.steps, [{ kind: 'tool', status: 'done', label: 'Run command', key: 'tool:call_1' }]);
+  assert.deepEqual(last.timeline, [
+    { type: 'step', kind: 'tool', status: 'done', label: 'Run command', key: 'tool:call_1' },
+    { type: 'text', text: 'done' },
+  ]);
+});
+
 test('rejects unsafe session ids', async () => {
   await assert.rejects(
     appendMessage({ sessionId: '../escape', role: 'user', text: 'x', storageRoot }),
diff --git a/test/agent/session-ui-state.test.js b/test/agent/session-ui-state.test.js
index c491fe0..a5781bb 100644
--- a/test/agent/session-ui-state.test.js
+++ b/test/agent/session-ui-state.test.js
@@ -89,6 +89,36 @@ test('messages.loaded hydrates stored timeline entries for reopened sessions', (
   assert.equal(next.runs.run_2?.timeline?.[1]?.type, 'text');
 });
 
+test('messages.loaded collapses legacy running+done duplicate tool entries on reload', () => {
+  const state = {
+    activeSessionId: 's1',
+    sessions: [],
+    runs: {},
+    messagesBySession: {},
+  };
+
+  const next = reduceState(state, {
+    type: 'messages.loaded',
+    sessionId: 's1',
+    messages: [{
+      role: 'assistant',
+      text: 'Done',
+      runId: 'run_3',
+      timeline: [
+        { type: 'step', kind: 'tool', status: 'running', label: "/bin/zsh -lc 'rg --files'" },
+        { type: 'step', kind: 'tool', status: 'done', label: "/bin/zsh -lc 'rg --files'" },
+        { type: 'text', text: 'Done' },
+      ],
+    }],
+  });
+
+  const timeline = next.runs.run_3?.timeline || [];
+  assert.equal(timeline.length, 2);
+  assert.equal(timeline[0]?.type, 'step');
+  assert.equal(timeline[0]?.status, 'done');
+  assert.equal(timeline[1]?.type, 'text');
+});
+
 test('session.metadata.loaded hydrates persisted codex usage for reopened session', () => {
   const state = {
     activeSessionId: 's1',
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
index 5661388..be9f34a 100644
--- a/test/agent/sse-events.test.js
+++ b/test/agent/sse-events.test.js
@@ -57,9 +57,10 @@ test('tool and reasoning events are tracked as steps', () => {
   const s4 = applyEvent(s3, { event: 'tool.final', runId: 'r1', sessionId: 's1', payload: { tool: 'fetch' } });
 
   assert.equal(Array.isArray(s4.runs.r1.steps), true);
-  assert.equal(s4.runs.r1.steps.length, 3);
-  assert.match(s4.runs.r1.steps[0].label, /fetch/i);
-  assert.match(s4.runs.r1.steps[1].label, /Planning/);
+  assert.equal(s4.runs.r1.steps.length, 2);
+  assert.equal(s4.runs.r1.steps.filter((step) => /fetch/i.test(step?.label || '')).length, 1);
+  assert.equal(s4.runs.r1.steps.some((step) => /Planning/.test(step?.label || '')), true);
+  assert.equal(s4.runs.r1.steps.find((step) => /fetch/i.test(step?.label || ''))?.status, 'done');
 });
 
 test('chat and tool events preserve inline timeline order', () => {
@@ -78,6 +79,47 @@ test('chat and tool events preserve inline timeline order', () => {
   assert.equal(timeline[2]?.text, 'Second chunk.');
 });
 
+test('tool.final replaces matching in-flight tool step at original timeline position', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, { event: 'chat.delta', runId: 'r1', sessionId: 's1', payload: { delta: 'Before. ' } });
+  const s3 = applyEvent(s2, {
+    event: 'tool.started',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: { tool: 'execute', callId: 'call_1', stepKey: 'tool:call_1' },
+  });
+  const s4 = applyEvent(s3, { event: 'chat.delta', runId: 'r1', sessionId: 's1', payload: { delta: 'After.' } });
+  const s5 = applyEvent(s4, {
+    event: 'tool.final',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: { callId: 'call_1', stepKey: 'tool:call_1' },
+  });
+
+  const timeline = s5.runs.r1.timeline || [];
+  assert.deepEqual(timeline.map((item) => item.type), ['text', 'step', 'text']);
+  assert.equal(timeline[1]?.status, 'done');
+  assert.equal(timeline[1]?.key, 'tool:call_1');
+  assert.equal((s5.runs.r1.steps || []).filter((item) => item?.key === 'tool:call_1').length, 1);
+});
+
+test('chat.commentary text stays inline but does not pollute final assistant message text', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, {
+    event: 'chat.commentary',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: { delta: 'Inspecting files...' },
+  });
+  const s3 = applyEvent(s2, { event: 'chat.final', runId: 'r1', sessionId: 's1', payload: { text: 'Final answer.' } });
+
+  const timeline = s3.runs.r1.timeline || [];
+  assert.equal(timeline.some((item) => item?.type === 'text' && /Inspecting files/.test(item?.text || '')), true);
+  assert.equal(timeline.some((item) => item?.type === 'text' && /Final answer/.test(item?.text || '')), true);
+  assert.equal(s3.runs.r1.text, 'Final answer.');
+  assert.equal(s3.messagesBySession.s1.at(-1)?.text, 'Final answer.');
+});
+
 test('chat.final stores timeline with assistant transcript message', () => {
   const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
   const s2 = applyEvent(s1, { event: 'chat.delta', runId: 'r1', sessionId: 's1', payload: { delta: 'Done.' } });

From 351db29f73bcbc00916f88c7958f24d1668d8f3b Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 13:22:06 +0530
Subject: [PATCH 153/192] agent: fix phase-less final text loss and legacy tool
 collapse

---
 agent/src/chatd.js                  | 28 ++++++++++++------------
 agent/src/codex-runner.js           | 23 ++++++++++++++++----
 agent/src/session-store.js          | 18 +++++++++-------
 extension/agent-panel-state.js      | 28 ++++++++++++------------
 test/agent/codex-runner.test.js     | 33 +++++++++++++++++++++++++++++
 test/agent/session-ui-state.test.js | 29 +++++++++++++++++++++++++
 test/agent/sse-events.test.js       | 24 +++++++++++++++++++++
 7 files changed, 141 insertions(+), 42 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index e652241..c279f44 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -323,6 +323,15 @@ function isGenericToolLabel(label) {
   return normalized === 'tool call started' || normalized === 'tool call completed' || normalized === 'working...';
 }
 
+function shouldLegacyTerminalCollapseMatch(existing, candidate) {
+  if (!existing || existing.key) return false;
+  if (isTerminalStepStatus(existing.status)) return false;
+  if (String(existing.kind || '') !== String(candidate.kind || '')) return false;
+  const wildcardLabel = candidate.kind === 'tool' && isGenericToolLabel(candidate.label);
+  if (wildcardLabel) return true;
+  return String(existing.label || '') === String(candidate.label || '');
+}
+
 function detailsEqual(a, b) {
   return JSON.stringify(a || []) === JSON.stringify(b || []);
 }
@@ -369,13 +378,7 @@ function pushRunStep(run, step) {
     let fallbackIndex = -1;
     for (let idx = steps.length - 1; idx >= 0; idx -= 1) {
       const entry = steps[idx];
-      if (
-        entry
-        && !entry.key
-        && String(entry.kind || '') === normalized.kind
-        && String(entry.label || '') === normalized.label
-        && !isTerminalStepStatus(entry.status)
-      ) {
+      if (shouldLegacyTerminalCollapseMatch(entry, normalized)) {
         fallbackIndex = idx;
         break;
       }
@@ -385,6 +388,7 @@ function pushRunStep(run, step) {
       steps[fallbackIndex] = {
         ...existing,
         ...normalized,
+        label: (isGenericToolLabel(normalized.label) && existing?.label) ? existing.label : normalized.label,
         details: normalized.details && normalized.details.length > 0 ? normalized.details : existing?.details,
       };
       run.steps = steps;
@@ -445,14 +449,7 @@ function pushRunTimelineEntry(run, entry) {
         let fallbackIndex = -1;
         for (let idx = timeline.length - 1; idx >= 0; idx -= 1) {
           const item = timeline[idx];
-          if (
-            item
-            && item.type === 'step'
-            && !item.key
-            && String(item.kind || '') === next.kind
-            && String(item.label || '') === next.label
-            && !isTerminalStepStatus(item.status)
-          ) {
+          if (item?.type === 'step' && shouldLegacyTerminalCollapseMatch(item, next)) {
             fallbackIndex = idx;
             break;
           }
@@ -462,6 +459,7 @@ function pushRunTimelineEntry(run, entry) {
           timeline[fallbackIndex] = {
             ...existing,
             ...next,
+            label: (isGenericToolLabel(next.label) && existing?.label) ? existing.label : next.label,
             details: next.details && next.details.length > 0 ? next.details : existing?.details,
           };
           run.timeline = timeline;
diff --git a/agent/src/codex-runner.js b/agent/src/codex-runner.js
index 6272038..e7d9bfe 100644
--- a/agent/src/codex-runner.js
+++ b/agent/src/codex-runner.js
@@ -128,6 +128,15 @@ function messageTextFromContent(content) {
   return parts.join('');
 }
 
+function isFinalPhase(phase) {
+  return String(phase || '').trim().toLowerCase() === 'final_answer';
+}
+
+function isCommentaryPhase(phase) {
+  const normalized = String(phase || '').trim().toLowerCase();
+  return normalized === 'commentary' || normalized === 'analysis' || normalized === 'thinking';
+}
+
 function normalizeResponseItem({ runId, sessionId, payload }) {
   if (!payload || typeof payload !== 'object') return null;
   const itemType = String(payload.type || '').toLowerCase();
@@ -142,10 +151,13 @@ function normalizeResponseItem({ runId, sessionId, payload }) {
     ]);
     if (!text) return null;
     const phase = String(payload.phase || '').toLowerCase();
-    if (phase === 'final_answer') {
+    if (isFinalPhase(phase)) {
       return envelope({ event: 'chat.final', runId, sessionId, payload: { text, phase } });
     }
-    return envelope({ event: 'chat.commentary', runId, sessionId, payload: { delta: text, phase } });
+    if (isCommentaryPhase(phase)) {
+      return envelope({ event: 'chat.commentary', runId, sessionId, payload: { delta: text, phase } });
+    }
+    return envelope({ event: 'chat.delta', runId, sessionId, payload: { delta: text, phase } });
   }
 
   if (itemType === 'function_call' || itemType === 'custom_tool_call') {
@@ -235,10 +247,13 @@ function normalizeEventMsg({ runId, sessionId, payload }) {
     const text = firstString([payload.message, payload.text]);
     if (!text) return null;
     const phase = String(payload.phase || '').toLowerCase();
-    if (phase === 'final_answer') {
+    if (isFinalPhase(phase)) {
       return envelope({ event: 'chat.final', runId, sessionId, payload: { text, phase } });
     }
-    return envelope({ event: 'chat.commentary', runId, sessionId, payload: { delta: text, phase } });
+    if (isCommentaryPhase(phase)) {
+      return envelope({ event: 'chat.commentary', runId, sessionId, payload: { delta: text, phase } });
+    }
+    return envelope({ event: 'chat.delta', runId, sessionId, payload: { delta: text, phase } });
   }
 
   if (payloadType === 'task_started') {
diff --git a/agent/src/session-store.js b/agent/src/session-store.js
index 148f7cf..567f7e3 100644
--- a/agent/src/session-store.js
+++ b/agent/src/session-store.js
@@ -103,6 +103,14 @@ function normalizeTimeline(timeline) {
     const normalized = String(label || '').trim().toLowerCase();
     return normalized === 'tool call started' || normalized === 'tool call completed' || normalized === 'working...';
   };
+  const shouldLegacyTerminalCollapseMatch = (existing, candidate) => {
+    if (!existing || existing.key) return false;
+    if (isTerminal(existing.status)) return false;
+    if (String(existing.kind || '') !== String(candidate.kind || '')) return false;
+    const wildcardLabel = candidate.kind === 'tool' && isGenericLabel(candidate.label);
+    if (wildcardLabel) return true;
+    return String(existing.label || '') === String(candidate.label || '');
+  };
   for (const item of timeline.slice(-200)) {
     const normalized = normalizeTimelineEntry(item);
     if (!normalized) continue;
@@ -138,14 +146,7 @@ function normalizeTimeline(timeline) {
       const index = (() => {
         for (let idx = entries.length - 1; idx >= 0; idx -= 1) {
           const entry = entries[idx];
-          if (
-            entry
-            && entry.type === 'step'
-            && !entry.key
-            && entry.kind === normalized.kind
-            && entry.label === normalized.label
-            && !isTerminal(entry.status)
-          ) {
+          if (entry?.type === 'step' && shouldLegacyTerminalCollapseMatch(entry, normalized)) {
             return idx;
           }
         }
@@ -156,6 +157,7 @@ function normalizeTimeline(timeline) {
         entries[index] = {
           ...existing,
           ...normalized,
+          label: (isGenericLabel(normalized.label) && existing?.label) ? existing.label : normalized.label,
           details: normalized.details && normalized.details.length > 0 ? normalized.details : existing?.details,
         };
         continue;
diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index 5661b98..490e7d9 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -42,6 +42,15 @@ function isGenericToolLabel(label) {
   return normalized === 'tool call started' || normalized === 'tool call completed' || normalized === 'working...';
 }
 
+function shouldLegacyTerminalCollapseMatch(existing, candidate) {
+  if (!existing || existing.key) return false;
+  if (isTerminalStepStatus(existing.status)) return false;
+  if (String(existing.kind || '') !== String(candidate.kind || '')) return false;
+  const wildcardLabel = candidate.kind === 'tool' && isGenericToolLabel(candidate.label);
+  if (wildcardLabel) return true;
+  return String(existing.label || '') === String(candidate.label || '');
+}
+
 function detailsEqual(a, b) {
   return JSON.stringify(a || []) === JSON.stringify(b || []);
 }
@@ -128,13 +137,7 @@ function pushStep(run, step) {
     let fallbackIndex = -1;
     for (let idx = steps.length - 1; idx >= 0; idx -= 1) {
       const entry = steps[idx];
-      if (
-        entry
-        && !entry.key
-        && String(entry.kind || '') === normalized.kind
-        && String(entry.label || '') === normalized.label
-        && !isTerminalStepStatus(entry.status)
-      ) {
+      if (shouldLegacyTerminalCollapseMatch(entry, normalized)) {
         fallbackIndex = idx;
         break;
       }
@@ -144,6 +147,7 @@ function pushStep(run, step) {
       steps[fallbackIndex] = {
         ...existing,
         ...normalized,
+        label: (isGenericToolLabel(normalized.label) && existing?.label) ? existing.label : normalized.label,
         details: normalized.details && normalized.details.length > 0 ? normalized.details : existing?.details,
       };
       return steps;
@@ -206,14 +210,7 @@ function pushTimelineEntry(run, entry) {
       let fallbackIndex = -1;
       for (let idx = timeline.length - 1; idx >= 0; idx -= 1) {
         const item = timeline[idx];
-        if (
-          item
-          && item.type === 'step'
-          && !item.key
-          && String(item.kind || '') === candidate.kind
-          && String(item.label || '') === candidate.label
-          && !isTerminalStepStatus(item.status)
-        ) {
+        if (item?.type === 'step' && shouldLegacyTerminalCollapseMatch(item, candidate)) {
           fallbackIndex = idx;
           break;
         }
@@ -223,6 +220,7 @@ function pushTimelineEntry(run, entry) {
         timeline[fallbackIndex] = {
           ...existing,
           ...candidate,
+          label: (isGenericToolLabel(candidate.label) && existing?.label) ? existing.label : candidate.label,
           details: candidate.details && candidate.details.length > 0 ? candidate.details : existing?.details,
         };
         return timeline;
diff --git a/test/agent/codex-runner.test.js b/test/agent/codex-runner.test.js
index bfd55c0..b9e3749 100644
--- a/test/agent/codex-runner.test.js
+++ b/test/agent/codex-runner.test.js
@@ -184,3 +184,36 @@ test('maps event_msg agent_message commentary to chat.commentary and final_answe
   assert.equal(final.event, 'chat.final');
   assert.equal(final.payload.text, 'All done');
 });
+
+test('maps agent_message without phase to chat.delta so text is not dropped', () => {
+  const evt = normalizeCodexLine({
+    runId: 'r1',
+    sessionId: 's1',
+    line: JSON.stringify({
+      type: 'event_msg',
+      payload: {
+        type: 'agent_message',
+        message: 'Hello without phase',
+      },
+    }),
+  });
+  assert.equal(evt.event, 'chat.delta');
+  assert.equal(evt.payload.delta, 'Hello without phase');
+});
+
+test('maps response_item assistant message without phase to chat.delta', () => {
+  const evt = normalizeCodexLine({
+    runId: 'r1',
+    sessionId: 's1',
+    line: JSON.stringify({
+      type: 'response_item',
+      payload: {
+        type: 'message',
+        role: 'assistant',
+        content: [{ type: 'output_text', text: 'Final text without phase' }],
+      },
+    }),
+  });
+  assert.equal(evt.event, 'chat.delta');
+  assert.equal(evt.payload.delta, 'Final text without phase');
+});
diff --git a/test/agent/session-ui-state.test.js b/test/agent/session-ui-state.test.js
index a5781bb..40f6c73 100644
--- a/test/agent/session-ui-state.test.js
+++ b/test/agent/session-ui-state.test.js
@@ -119,6 +119,35 @@ test('messages.loaded collapses legacy running+done duplicate tool entries on re
   assert.equal(timeline[1]?.type, 'text');
 });
 
+test('messages.loaded collapses generic terminal tool row onto latest running row', () => {
+  const state = {
+    activeSessionId: 's1',
+    sessions: [],
+    runs: {},
+    messagesBySession: {},
+  };
+
+  const next = reduceState(state, {
+    type: 'messages.loaded',
+    sessionId: 's1',
+    messages: [{
+      role: 'assistant',
+      text: 'Done',
+      runId: 'run_4',
+      timeline: [
+        { type: 'step', kind: 'tool', status: 'running', label: "/bin/zsh -lc 'cat skills/browserforce/SKILL.md'" },
+        { type: 'step', kind: 'tool', status: 'done', label: 'Tool call completed' },
+        { type: 'text', text: 'Done' },
+      ],
+    }],
+  });
+
+  const timeline = next.runs.run_4?.timeline || [];
+  assert.equal(timeline.length, 2);
+  assert.equal(timeline[0]?.status, 'done');
+  assert.match(timeline[0]?.label || '', /cat skills\/browserforce\/SKILL\.md/);
+});
+
 test('session.metadata.loaded hydrates persisted codex usage for reopened session', () => {
   const state = {
     activeSessionId: 's1',
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
index be9f34a..3d7f0aa 100644
--- a/test/agent/sse-events.test.js
+++ b/test/agent/sse-events.test.js
@@ -103,6 +103,30 @@ test('tool.final replaces matching in-flight tool step at original timeline posi
   assert.equal((s5.runs.r1.steps || []).filter((item) => item?.key === 'tool:call_1').length, 1);
 });
 
+test('tool.final with generic label collapses latest in-flight non-keyed tool step', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, {
+    event: 'tool.started',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: { tool: 'execute', command: "/bin/zsh -lc 'rg --files'" },
+  });
+  const s3 = applyEvent(s2, {
+    event: 'tool.final',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: {},
+  });
+
+  const steps = s3.runs.r1.steps || [];
+  const timeline = s3.runs.r1.timeline || [];
+  assert.equal(steps.length, 1);
+  assert.equal(steps[0]?.status, 'done');
+  assert.match(steps[0]?.label || '', /rg --files/);
+  assert.equal(timeline.filter((item) => item.type === 'step').length, 1);
+  assert.equal(timeline.find((item) => item.type === 'step')?.status, 'done');
+});
+
 test('chat.commentary text stays inline but does not pollute final assistant message text', () => {
   const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
   const s2 = applyEvent(s1, {

From da880cfbed5f39925c383fe6709e28fbe71216bd Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 14:58:40 +0530
Subject: [PATCH 154/192] agent-panel: restore execute step details for
 collapsible tool rows

---
 agent/src/chatd.js             | 108 ++++++++++++++++++++++++++++++++-
 extension/agent-panel-state.js |   8 ++-
 test/agent/chatd-api.test.js   |  65 ++++++++++++++++++++
 test/agent/sse-events.test.js  |  62 +++++++++++++++++++
 4 files changed, 239 insertions(+), 4 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index c279f44..5d2a6bb 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -336,6 +336,50 @@ function detailsEqual(a, b) {
   return JSON.stringify(a || []) === JSON.stringify(b || []);
 }
 
+function normalizeStepDetails(details, label = '') {
+  const lines = [];
+  const pushLine = (value) => {
+    const parts = String(value || '')
+      .split('\n')
+      .map((part) => part.trim())
+      .filter(Boolean);
+    for (const rawPart of parts) {
+      const part = rawPart.replace(/^[-*]\s+/, '').trim();
+      if (!part) continue;
+      if (part === label) continue;
+      if (lines.includes(part)) continue;
+      lines.push(part.length > 220 ? `${part.slice(0, 217)}...` : part);
+      if (lines.length >= 8) return;
+    }
+  };
+  const visit = (value) => {
+    if (value == null) return;
+    if (Array.isArray(value)) {
+      for (const item of value) {
+        if (lines.length >= 8) return;
+        visit(item);
+      }
+      return;
+    }
+    if (typeof value === 'object') {
+      visit(value.text);
+      visit(value.message);
+      visit(value.output);
+      visit(value.command);
+      visit(value.cmd);
+      visit(value.code);
+      visit(value.arguments);
+      visit(value.path);
+      visit(value.query);
+      visit(value.pattern);
+      return;
+    }
+    pushLine(value);
+  };
+  visit(details);
+  return lines;
+}
+
 function normalizeRunStep(step) {
   if (!step || typeof step !== 'object') return null;
   const label = trimStepLabel(step.label);
@@ -545,6 +589,27 @@ function stepLabelForToolEvent(evt) {
   return '';
 }
 
+function stepDetailsForToolEvent(evt, label) {
+  const payload = evt?.payload || {};
+  return normalizeStepDetails([
+    payload.details,
+    payload.text,
+    payload.message,
+    payload.delta,
+    payload.command,
+    payload.cmd,
+    payload.code,
+    payload.arguments,
+    payload.path,
+    payload.query,
+    payload.pattern,
+    payload.args,
+    payload.paths,
+    payload.items,
+    payload.item,
+  ], label);
+}
+
 function stepKeyForToolEvent(evt) {
   const payload = evt?.payload || {};
   const key = firstString([
@@ -624,15 +689,49 @@ function stepKeyForRunEvent(evt) {
   return key.startsWith('tool:') ? key : `tool:${key}`;
 }
 
+function stepDetailsForRunEvent(evt, label) {
+  const payload = evt?.payload || {};
+  const item = payload?.item && typeof payload.item === 'object' ? payload.item : {};
+  return normalizeStepDetails([
+    payload.details,
+    payload.text,
+    payload.message,
+    payload.delta,
+    payload.command,
+    payload.path,
+    payload.query,
+    payload.pattern,
+    payload.args,
+    payload.paths,
+    payload.items,
+    payload.item,
+    item?.details,
+    item?.text,
+    item?.message,
+    item?.summary,
+    item?.command,
+    item?.path,
+    item?.query,
+    item?.pattern,
+    item?.args,
+    item?.paths,
+  ], label);
+}
+
 function trackRunStep(run, evt) {
   if (!run || !evt?.event) return;
 
   if (evt.event === 'tool.started' || evt.event === 'tool.delta' || evt.event === 'tool.final') {
+    const label = stepLabelForToolEvent(evt);
+    const details = stepDetailsForToolEvent(evt, label);
     const step = {
-      kind: evt.event === 'tool.delta' ? 'reasoning' : 'tool',
+      kind: (evt.event === 'tool.delta' && String(evt?.payload?.type || '').toLowerCase() === 'reasoning')
+        ? 'reasoning'
+        : 'tool',
       status: evt.event === 'tool.final' ? 'done' : 'running',
-      label: stepLabelForToolEvent(evt),
+      label,
       ...(stepKeyForToolEvent(evt) ? { key: stepKeyForToolEvent(evt) } : {}),
+      ...(details.length > 0 ? { details } : {}),
     };
     pushRunStep(run, step);
     pushRunTimelineEntry(run, { type: 'step', ...step });
@@ -640,11 +739,14 @@ function trackRunStep(run, evt) {
   }
 
   if (evt.event === 'run.event') {
+    const label = stepLabelForRunEvent(evt);
+    const details = stepDetailsForRunEvent(evt, label);
     const step = {
       kind: stepKindForRunEvent(evt),
       status: stepStatusForRunEvent(evt),
-      label: stepLabelForRunEvent(evt),
+      label,
       ...(stepKeyForRunEvent(evt) ? { key: stepKeyForRunEvent(evt) } : {}),
+      ...(details.length > 0 ? { details } : {}),
     };
     pushRunStep(run, step);
     pushRunTimelineEntry(run, { type: 'step', ...step });
diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index 490e7d9..781da48 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -85,6 +85,9 @@ function normalizeStepDetails(details, label = '') {
       visit(value.message);
       visit(value.output);
       visit(value.command);
+      visit(value.cmd);
+      visit(value.code);
+      visit(value.arguments);
       visit(value.path);
       visit(value.query);
       visit(value.pattern);
@@ -366,6 +369,9 @@ function stepDetailsForToolEvent(evt, label) {
     payload.message,
     payload.delta,
     payload.command,
+    payload.cmd,
+    payload.code,
+    payload.arguments,
     payload.path,
     payload.query,
     payload.pattern,
@@ -713,7 +719,7 @@ export function applyEvent(state = initialState, evt = {}) {
     const status = evt.event === 'tool.final'
       ? 'done'
       : 'running';
-    const kind = evt.event === 'tool.delta'
+    const kind = (evt.event === 'tool.delta' && String(evt?.payload?.type || '').toLowerCase() === 'reasoning')
       ? 'reasoning'
       : 'tool';
     const label = stepLabelForToolEvent(evt);
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index 3997364..27d56cc 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -280,6 +280,71 @@ test('POST /v1/runs persists run steps so reopened sessions can render them', as
   }
 });
 
+test('POST /v1/runs persists execute tool details for collapsible timeline rows', async () => {
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    runExecutor: ({ runId, sessionId, onEvent, onExit }) => {
+      setTimeout(() => {
+        onEvent({
+          event: 'tool.started',
+          runId,
+          sessionId,
+          payload: {
+            name: 'execute',
+            args: {
+              code: "const tree = await snapshot();\nreturn tree;",
+            },
+          },
+        });
+      }, 5);
+      setTimeout(() => {
+        onEvent({ event: 'tool.final', runId, sessionId, payload: { name: 'execute' } });
+      }, 10);
+      setTimeout(() => {
+        onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'done' } });
+      }, 15);
+      setTimeout(() => onExit({ code: 0 }), 20);
+      return { abort() {} };
+    },
+  });
+
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'Execute details' }),
+    }).then((res) => res.json());
+
+    const runRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'hi' }),
+    });
+    assert.equal(runRes.status, 202);
+
+    await new Promise((resolve) => setTimeout(resolve, 80));
+
+    const messagesBody = await fetch(
+      `${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}/messages`,
+      { headers: { authorization: `Bearer ${daemon.token}` } },
+    ).then((res) => res.json());
+    const assistant = (messagesBody.messages || []).at(-1);
+    const executeStep = (assistant?.timeline || []).find((item) => item?.type === 'step' && /execute/i.test(item?.label || ''));
+
+    assert.equal(Array.isArray(executeStep?.details), true);
+    assert.equal(executeStep.details.some((line) => /snapshot/.test(line)), true);
+  } finally {
+    await daemon.stop();
+  }
+});
+
 test('POST /v1/runs abort persists partial assistant output for session reloads', async () => {
   const daemon = await startChatd({
     port: 0,
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
index 3d7f0aa..0426840 100644
--- a/test/agent/sse-events.test.js
+++ b/test/agent/sse-events.test.js
@@ -103,6 +103,47 @@ test('tool.final replaces matching in-flight tool step at original timeline posi
   assert.equal((s5.runs.r1.steps || []).filter((item) => item?.key === 'tool:call_1').length, 1);
 });
 
+test('stderr tool lifecycle collapses into one keyed step with expandable details', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, {
+    event: 'tool.started',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: { tool: 'stderr', stepKey: 'tool:stderr', title: 'Codex stderr' },
+  });
+  const s3 = applyEvent(s2, {
+    event: 'tool.delta',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: {
+      tool: 'stderr',
+      type: 'stderr',
+      stepKey: 'tool:stderr',
+      message: 'Codex stderr (2 lines)',
+      details: ['warn line 1', 'warn line 2'],
+    },
+  });
+  const s4 = applyEvent(s3, {
+    event: 'tool.final',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: {
+      tool: 'stderr',
+      stepKey: 'tool:stderr',
+      title: 'Codex stderr (2 lines)',
+      details: ['warn line 1', 'warn line 2'],
+    },
+  });
+
+  const steps = s4.runs.r1.steps || [];
+  const stderrStep = steps.find((item) => item?.key === 'tool:stderr');
+  assert.equal(steps.filter((item) => item?.key === 'tool:stderr').length, 1);
+  assert.equal(stderrStep?.kind, 'tool');
+  assert.equal(stderrStep?.status, 'done');
+  assert.match(stderrStep?.label || '', /2 lines/);
+  assert.deepEqual(stderrStep?.details, ['warn line 1', 'warn line 2']);
+});
+
 test('tool.final with generic label collapses latest in-flight non-keyed tool step', () => {
   const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
   const s2 = applyEvent(s1, {
@@ -127,6 +168,27 @@ test('tool.final with generic label collapses latest in-flight non-keyed tool st
   assert.equal(timeline.find((item) => item.type === 'step')?.status, 'done');
 });
 
+test('execute tool step captures code details for collapsible rendering', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, {
+    event: 'tool.started',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: {
+      name: 'execute',
+      args: {
+        code: "const rows = await snapshot();\nreturn rows;",
+      },
+    },
+  });
+
+  const step = s2.runs.r1.steps.find((item) => /execute/i.test(item?.label || ''));
+  assert.deepEqual(step?.details, [
+    'const rows = await snapshot();',
+    'return rows;',
+  ]);
+});
+
 test('chat.commentary text stays inline but does not pollute final assistant message text', () => {
   const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
   const s2 = applyEvent(s1, {

From 2706f0b92f0476767d2fd55f8d3401b43f4b9436 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 15:00:16 +0530
Subject: [PATCH 155/192] feat(agent): summarize codex stderr as collapsible
 timeline step

---
 agent/src/codex-runner.js       | 114 +++++++++++++++++++++++++++++++-
 test/agent/codex-runner.test.js |  64 +++++++++++++++++-
 2 files changed, 176 insertions(+), 2 deletions(-)

diff --git a/agent/src/codex-runner.js b/agent/src/codex-runner.js
index e7d9bfe..2f61a20 100644
--- a/agent/src/codex-runner.js
+++ b/agent/src/codex-runner.js
@@ -20,6 +20,66 @@ function safeParse(line) {
   }
 }
 
+function braceDelta(text) {
+  const source = String(text || '');
+  let delta = 0;
+  for (const ch of source) {
+    if (ch === '{') delta += 1;
+    else if (ch === '}') delta -= 1;
+  }
+  return delta;
+}
+
+export function shouldSuppressCodexStderrLine(line, state = {}) {
+  const text = String(line || '');
+  if (!text.trim()) return false;
+
+  if (!Number.isInteger(state.authJsonDepth) || state.authJsonDepth < 0) {
+    state.authJsonDepth = 0;
+  }
+
+  if (state.authJsonDepth > 0) {
+    state.authJsonDepth += braceDelta(text);
+    if (state.authJsonDepth < 0) state.authJsonDepth = 0;
+    return true;
+  }
+
+  const lower = text.toLowerCase();
+  const isAuthRefreshLine = lower.includes('codex_core::auth: failed to refresh token');
+  if (!isAuthRefreshLine) return false;
+
+  const startsJsonBlock = lower.includes('401 unauthorized') && text.includes('{');
+  if (startsJsonBlock) {
+    state.authJsonDepth = Math.max(0, braceDelta(text));
+    return true;
+  }
+
+  return (
+    lower.includes('refresh token was already used')
+    || lower.includes('already been used to generate a new access token')
+    || lower.includes('refresh_token_reused')
+  );
+}
+
+export function buildCodexStderrStepPayload({ count, lines } = {}) {
+  const normalizedLines = Array.isArray(lines)
+    ? lines
+      .map((line) => String(line || '').trim())
+      .filter(Boolean)
+      .slice(-8)
+    : [];
+  const numericCount = Number.isInteger(count) && count > 0
+    ? count
+    : normalizedLines.length;
+  const suffix = numericCount === 1 ? 'line' : 'lines';
+  return {
+    stream: 'stderr',
+    type: 'stderr',
+    message: `Codex stderr (${numericCount} ${suffix})`,
+    details: normalizedLines,
+  };
+}
+
 function toCount(value) {
   const parsed = Number(value);
   if (!Number.isFinite(parsed) || parsed < 0) return null;
@@ -424,6 +484,12 @@ export function startCodexRun({
   });
 
   const stderrChunks = [];
+  const stderrFilterState = {};
+  const stderrStepState = {
+    started: false,
+    count: 0,
+    lines: [],
+  };
 
   const stdoutLines = readline.createInterface({ input: child.stdout });
   stdoutLines.on('line', (line) => {
@@ -441,11 +507,40 @@ export function startCodexRun({
     if (!line) return;
     stderrChunks.push(String(line));
     if (stderrChunks.length > 200) stderrChunks.shift();
+    if (shouldSuppressCodexStderrLine(line, stderrFilterState)) return;
+
+    if (!stderrStepState.started) {
+      stderrStepState.started = true;
+      onEvent?.(envelope({
+        event: 'tool.started',
+        runId,
+        sessionId,
+        payload: {
+          stream: 'stderr',
+          tool: 'stderr',
+          stepKey: 'tool:stderr',
+          title: 'Codex stderr',
+        },
+      }));
+    }
+
+    stderrStepState.count += 1;
+    stderrStepState.lines.push(String(line));
+    if (stderrStepState.lines.length > 32) stderrStepState.lines.shift();
+    const payload = buildCodexStderrStepPayload({
+      count: stderrStepState.count,
+      lines: stderrStepState.lines,
+    });
+
     onEvent?.(envelope({
       event: 'tool.delta',
       runId,
       sessionId,
-      payload: { stream: 'stderr', text: line },
+      payload: {
+        ...payload,
+        tool: 'stderr',
+        stepKey: 'tool:stderr',
+      },
     }));
   });
 
@@ -454,6 +549,23 @@ export function startCodexRun({
   });
 
   child.on('close', (code, signal) => {
+    if (stderrStepState.started) {
+      const payload = buildCodexStderrStepPayload({
+        count: stderrStepState.count,
+        lines: stderrStepState.lines,
+      });
+      onEvent?.(envelope({
+        event: 'tool.final',
+        runId,
+        sessionId,
+        payload: {
+          ...payload,
+          tool: 'stderr',
+          stepKey: 'tool:stderr',
+          title: payload.message,
+        },
+      }));
+    }
     onExit?.({ code, signal, stderr: stderrChunks.join('\n') });
   });
 
diff --git a/test/agent/codex-runner.test.js b/test/agent/codex-runner.test.js
index b9e3749..20a6d8f 100644
--- a/test/agent/codex-runner.test.js
+++ b/test/agent/codex-runner.test.js
@@ -1,6 +1,11 @@
 import test from 'node:test';
 import assert from 'node:assert/strict';
-import { buildCodexExecArgs, normalizeCodexLine } from '../../agent/src/codex-runner.js';
+import {
+  buildCodexStderrStepPayload,
+  buildCodexExecArgs,
+  normalizeCodexLine,
+  shouldSuppressCodexStderrLine,
+} from '../../agent/src/codex-runner.js';
 
 test('maps text delta line to chat.delta event', () => {
   const evt = normalizeCodexLine({
@@ -73,6 +78,63 @@ test('maps transient codex error line to non-fatal tool event', () => {
   assert.match(evt.payload.message, /Reconnecting/);
 });
 
+test('suppresses refresh_token_reused auth stderr block', () => {
+  const state = {};
+  assert.equal(
+    shouldSuppressCodexStderrLine(
+      '2026-03-04T08:56:12.804579Z ERROR codex_core::auth: Failed to refresh token: 401 Unauthorized: {',
+      state,
+    ),
+    true,
+  );
+  assert.equal(shouldSuppressCodexStderrLine('  "error": {', state), true);
+  assert.equal(shouldSuppressCodexStderrLine('    "code": "refresh_token_reused"', state), true);
+  assert.equal(shouldSuppressCodexStderrLine('  }', state), true);
+  assert.equal(shouldSuppressCodexStderrLine('}', state), true);
+  assert.equal(
+    shouldSuppressCodexStderrLine(
+      '2026-03-04T08:56:12.804795Z ERROR codex_core::auth: Failed to refresh token: Your access token could not be refreshed because your refresh token was already used. Please log out and sign in again.',
+      state,
+    ),
+    true,
+  );
+});
+
+test('does not suppress non-auth stderr lines', () => {
+  const state = {};
+  assert.equal(
+    shouldSuppressCodexStderrLine(
+      '2026-03-04T08:56:14.841913Z  WARN codex_core::shell_snapshot: Failed to delete shell snapshot',
+      state,
+    ),
+    false,
+  );
+  assert.equal(
+    shouldSuppressCodexStderrLine(
+      '2026-03-04T08:56:23.764711Z ERROR rmcp::transport::async_rw: Error reading from stream',
+      state,
+    ),
+    false,
+  );
+});
+
+test('builds human-readable stderr summary payload with singular/plural label', () => {
+  const single = buildCodexStderrStepPayload({ count: 1, lines: ['first warning'] });
+  assert.equal(single.message, 'Codex stderr (1 line)');
+  assert.deepEqual(single.details, ['first warning']);
+
+  const plural = buildCodexStderrStepPayload({ count: 2, lines: ['first', 'second'] });
+  assert.equal(plural.message, 'Codex stderr (2 lines)');
+  assert.deepEqual(plural.details, ['first', 'second']);
+});
+
+test('stderr summary payload keeps only latest detail lines', () => {
+  const lines = Array.from({ length: 11 }, (_, i) => `line-${i + 1}`);
+  const payload = buildCodexStderrStepPayload({ count: lines.length, lines });
+  assert.equal(payload.message, 'Codex stderr (11 lines)');
+  assert.deepEqual(payload.details, lines.slice(-8));
+});
+
 test('maps codex turn.completed usage into run.usage event', () => {
   const line = JSON.stringify({
     type: 'turn.completed',

From 4aecb35084bd0cf04d452168533e358b090daeba Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 15:10:35 +0530
Subject: [PATCH 156/192] agent-panel: clean tool command labels and keep
 execute details

---
 agent/src/chatd.js                  | 28 +++++++++++++++++++--
 agent/src/codex-runner.js           | 31 +++++++++++++++++------
 agent/src/session-store.js          | 21 ++++++++++++++--
 extension/agent-panel-state.js      | 39 +++++++++++++++++++++++++++--
 test/agent/codex-runner.test.js     |  2 +-
 test/agent/session-ui-state.test.js | 32 +++++++++++++++++++++++
 test/agent/sse-events.test.js       | 28 +++++++++++++++++++++
 7 files changed, 167 insertions(+), 14 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 5d2a6bb..3696876 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -294,8 +294,25 @@ function firstString(values) {
   return '';
 }
 
+const SHELL_LC_WRAPPER_RE = /^(?:\/usr\/bin\/env\s+)?(?:\/bin\/)?(?:zsh|bash|sh)\s+-lc\s+([\s\S]+)$/i;
+
+function unwrapShellLcCommand(value) {
+  const text = String(value || '').trim();
+  if (!text) return '';
+  const match = text.match(SHELL_LC_WRAPPER_RE);
+  if (!match) return text;
+  let command = String(match[1] || '').trim();
+  if (!command) return text;
+  if (command.length >= 2 && command.startsWith("'") && command.endsWith("'")) {
+    command = command.slice(1, -1).replace(/'"'"'/g, "'");
+  } else if (command.length >= 2 && command.startsWith('"') && command.endsWith('"')) {
+    command = command.slice(1, -1).replace(/\\"/g, '"').replace(/\\\\/g, '\\');
+  }
+  return command.trim() || text;
+}
+
 function trimStepLabel(label) {
-  const text = String(label || '').trim();
+  const text = unwrapShellLcCommand(label);
   if (!text) return '';
   return text.length > 160 ? `${text.slice(0, 157)}...` : text;
 }
@@ -339,7 +356,7 @@ function detailsEqual(a, b) {
 function normalizeStepDetails(details, label = '') {
   const lines = [];
   const pushLine = (value) => {
-    const parts = String(value || '')
+    const parts = unwrapShellLcCommand(value)
       .split('\n')
       .map((part) => part.trim())
       .filter(Boolean);
@@ -368,6 +385,11 @@ function normalizeStepDetails(details, label = '') {
       visit(value.command);
       visit(value.cmd);
       visit(value.code);
+      visit(value.input);
+      visit(value.args);
+      visit(value.parameters);
+      visit(value.params);
+      visit(value.payload);
       visit(value.arguments);
       visit(value.path);
       visit(value.query);
@@ -715,6 +737,8 @@ function stepDetailsForRunEvent(evt, label) {
     item?.pattern,
     item?.args,
     item?.paths,
+    item?.input,
+    item?.arguments,
   ], label);
 }
 
diff --git a/agent/src/codex-runner.js b/agent/src/codex-runner.js
index 2f61a20..828fb61 100644
--- a/agent/src/codex-runner.js
+++ b/agent/src/codex-runner.js
@@ -145,10 +145,27 @@ function normalizeToolIdentity(payload = {}, fallbackCallId = '') {
   };
 }
 
-function quoteForShell(value) {
-  const source = String(value || '');
-  if (!source) return '';
-  return source.replace(/'/g, `'\"'\"'`);
+const SHELL_LC_WRAPPER_RE = /^(?:\/usr\/bin\/env\s+)?(?:\/bin\/)?(?:zsh|bash|sh)\s+-lc\s+([\s\S]+)$/i;
+
+function unwrapShellLcCommand(value) {
+  const text = String(value || '').trim();
+  if (!text) return '';
+  const match = text.match(SHELL_LC_WRAPPER_RE);
+  if (!match) return text;
+  let command = String(match[1] || '').trim();
+  if (!command) return text;
+  if (command.length >= 2 && command.startsWith("'") && command.endsWith("'")) {
+    command = command.slice(1, -1).replace(/'"'"'/g, "'");
+  } else if (command.length >= 2 && command.startsWith('"') && command.endsWith('"')) {
+    command = command.slice(1, -1).replace(/\\"/g, '"').replace(/\\\\/g, '\\');
+  }
+  return command.trim() || text;
+}
+
+function trimCommandLabel(value) {
+  const command = unwrapShellLcCommand(value);
+  if (!command) return '';
+  return command.length > 160 ? `${command.slice(0, 157)}...` : command;
 }
 
 function toolCommandLabel({ name, parsedArgs, rawArgs }) {
@@ -158,7 +175,7 @@ function toolCommandLabel({ name, parsedArgs, rawArgs }) {
       parsedArgs.command,
     ]);
     if (cmd) {
-      return `/bin/zsh -lc '${quoteForShell(cmd)}'`;
+      return trimCommandLabel(cmd);
     }
   }
 
@@ -166,12 +183,12 @@ function toolCommandLabel({ name, parsedArgs, rawArgs }) {
     const parsed = safeParseJson(rawArgs);
     if (parsed && typeof parsed === 'object') {
       const cmd = firstString([parsed.cmd, parsed.command]);
-      if (cmd) return `/bin/zsh -lc '${quoteForShell(cmd)}'`;
+      if (cmd) return trimCommandLabel(cmd);
     }
   }
 
   if (name === 'exec_command' && typeof rawArgs === 'string' && rawArgs.trim()) {
-    return rawArgs.trim().length > 160 ? `${rawArgs.trim().slice(0, 157)}...` : rawArgs.trim();
+    return trimCommandLabel(rawArgs);
   }
   return '';
 }
diff --git a/agent/src/session-store.js b/agent/src/session-store.js
index 567f7e3..eb58851 100644
--- a/agent/src/session-store.js
+++ b/agent/src/session-store.js
@@ -50,9 +50,26 @@ function normalizeRunId(runId) {
   return normalized;
 }
 
+const SHELL_LC_WRAPPER_RE = /^(?:\/usr\/bin\/env\s+)?(?:\/bin\/)?(?:zsh|bash|sh)\s+-lc\s+([\s\S]+)$/i;
+
+function unwrapShellLcCommand(value) {
+  const text = String(value || '').trim();
+  if (!text) return '';
+  const match = text.match(SHELL_LC_WRAPPER_RE);
+  if (!match) return text;
+  let command = String(match[1] || '').trim();
+  if (!command) return text;
+  if (command.length >= 2 && command.startsWith("'") && command.endsWith("'")) {
+    command = command.slice(1, -1).replace(/'"'"'/g, "'");
+  } else if (command.length >= 2 && command.startsWith('"') && command.endsWith('"')) {
+    command = command.slice(1, -1).replace(/\\"/g, '"').replace(/\\\\/g, '\\');
+  }
+  return command.trim() || text;
+}
+
 function normalizeStep(step) {
   if (!step || typeof step !== 'object') return null;
-  const label = String(step.label || '').trim();
+  const label = unwrapShellLcCommand(step.label);
   if (!label) return null;
   const kind = String(step.kind || '').trim() || 'reasoning';
   const normalizedStatus = String(step.status || '').trim().toLowerCase();
@@ -62,7 +79,7 @@ function normalizeStep(step) {
   const key = String(step.key || '').trim();
   const details = Array.isArray(step.details)
     ? step.details
-      .map((item) => String(item || '').trim())
+      .map((item) => unwrapShellLcCommand(item))
       .filter(Boolean)
       .slice(0, 8)
     : [];
diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index 781da48..c13c0f5 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -13,8 +13,25 @@ function firstString(values) {
   return '';
 }
 
+const SHELL_LC_WRAPPER_RE = /^(?:\/usr\/bin\/env\s+)?(?:\/bin\/)?(?:zsh|bash|sh)\s+-lc\s+([\s\S]+)$/i;
+
+function unwrapShellLcCommand(value) {
+  const text = String(value || '').trim();
+  if (!text) return '';
+  const match = text.match(SHELL_LC_WRAPPER_RE);
+  if (!match) return text;
+  let command = String(match[1] || '').trim();
+  if (!command) return text;
+  if (command.length >= 2 && command.startsWith("'") && command.endsWith("'")) {
+    command = command.slice(1, -1).replace(/'"'"'/g, "'");
+  } else if (command.length >= 2 && command.startsWith('"') && command.endsWith('"')) {
+    command = command.slice(1, -1).replace(/\\"/g, '"').replace(/\\\\/g, '\\');
+  }
+  return command.trim() || text;
+}
+
 function trimStepLabel(label) {
-  const text = String(label || '').trim();
+  const text = unwrapShellLcCommand(label);
   if (!text) return '';
   return text.length > 160 ? `${text.slice(0, 157)}...` : text;
 }
@@ -58,7 +75,7 @@ function detailsEqual(a, b) {
 function normalizeStepDetails(details, label = '') {
   const lines = [];
   const pushLine = (value) => {
-    const line = String(value || '')
+    const line = unwrapShellLcCommand(value)
       .split('\n')
       .map((part) => part.trim())
       .filter(Boolean);
@@ -87,6 +104,11 @@ function normalizeStepDetails(details, label = '') {
       visit(value.command);
       visit(value.cmd);
       visit(value.code);
+      visit(value.input);
+      visit(value.args);
+      visit(value.parameters);
+      visit(value.params);
+      visit(value.payload);
       visit(value.arguments);
       visit(value.path);
       visit(value.query);
@@ -448,6 +470,7 @@ function stepKeyForRunEvent(evt) {
 
 function stepDetailsForRunEvent(evt, label) {
   const payload = evt?.payload || {};
+  const item = payload?.item && typeof payload.item === 'object' ? payload.item : {};
   return normalizeStepDetails([
     payload.details,
     payload.text,
@@ -461,6 +484,18 @@ function stepDetailsForRunEvent(evt, label) {
     payload.paths,
     payload.items,
     payload.item,
+    item?.details,
+    item?.text,
+    item?.message,
+    item?.summary,
+    item?.command,
+    item?.path,
+    item?.query,
+    item?.pattern,
+    item?.args,
+    item?.paths,
+    item?.input,
+    item?.arguments,
   ], label);
 }
 
diff --git a/test/agent/codex-runner.test.js b/test/agent/codex-runner.test.js
index 20a6d8f..3ec4166 100644
--- a/test/agent/codex-runner.test.js
+++ b/test/agent/codex-runner.test.js
@@ -209,7 +209,7 @@ test('maps response_item function_call/function_call_output into keyed tool life
 
   assert.equal(start.event, 'tool.started');
   assert.equal(start.payload.callId, 'call_123');
-  assert.equal(start.payload.command, "/bin/zsh -lc 'rg --files'");
+  assert.equal(start.payload.command, 'rg --files');
   assert.equal(done.event, 'tool.final');
   assert.equal(done.payload.callId, 'call_123');
   assert.equal(done.payload.stepKey, 'tool:call_123');
diff --git a/test/agent/session-ui-state.test.js b/test/agent/session-ui-state.test.js
index 40f6c73..c80dbdf 100644
--- a/test/agent/session-ui-state.test.js
+++ b/test/agent/session-ui-state.test.js
@@ -148,6 +148,38 @@ test('messages.loaded collapses generic terminal tool row onto latest running ro
   assert.match(timeline[0]?.label || '', /cat skills\/browserforce\/SKILL\.md/);
 });
 
+test('messages.loaded strips shell wrapper prefixes from tool labels and details', () => {
+  const state = {
+    activeSessionId: 's1',
+    sessions: [],
+    runs: {},
+    messagesBySession: {},
+  };
+
+  const next = reduceState(state, {
+    type: 'messages.loaded',
+    sessionId: 's1',
+    messages: [{
+      role: 'assistant',
+      text: 'Done',
+      runId: 'run_5',
+      timeline: [{
+        type: 'step',
+        kind: 'tool',
+        status: 'done',
+        label: "/bin/zsh -lc \"sed -n '1,220p' AGENTS.local.md\"",
+        details: [
+          "/bin/zsh -lc 'rg --files'",
+        ],
+      }],
+    }],
+  });
+
+  const step = next.runs.run_5?.timeline?.[0];
+  assert.equal(step?.label, "sed -n '1,220p' AGENTS.local.md");
+  assert.deepEqual(step?.details, ['rg --files']);
+});
+
 test('session.metadata.loaded hydrates persisted codex usage for reopened session', () => {
   const state = {
     activeSessionId: 's1',
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
index 0426840..e2596aa 100644
--- a/test/agent/sse-events.test.js
+++ b/test/agent/sse-events.test.js
@@ -272,6 +272,34 @@ test('run.event captures detail lines for collapsible tool-call rendering', () =
   assert.deepEqual(lastTimeline?.details, lastStep?.details);
 });
 
+test('run.event extracts execute code from nested item input for collapsible details', () => {
+  const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
+  const s2 = applyEvent(s1, {
+    event: 'run.event',
+    runId: 'r1',
+    sessionId: 's1',
+    payload: {
+      type: 'item.completed',
+      item: {
+        id: 'item_2',
+        type: 'custom_tool_call',
+        name: 'execute',
+        status: 'completed',
+        input: {
+          code: "const rows = await snapshot();\nreturn rows;",
+        },
+      },
+    },
+  });
+
+  const step = (s2.runs.r1.steps || []).find((item) => item?.key === 'tool:item_2');
+  assert.equal(step?.label, 'execute');
+  assert.deepEqual(step?.details, [
+    'const rows = await snapshot();',
+    'return rows;',
+  ]);
+});
+
 test('run.usage stores normalized usage for run and session', () => {
   const s1 = applyEvent(baseState, { event: 'run.started', runId: 'r1', sessionId: 's1', payload: {} });
   const s2 = applyEvent(s1, {

From 1cae1553e52b2511f6e64f20ea09638a0aba4088 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 15:11:54 +0530
Subject: [PATCH 157/192] agent-panel: set composer textarea font size to 12px

---
 extension/agent-panel.css | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index a9a774a..a7074d7 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -646,7 +646,7 @@ body {
   background: transparent;
   border: 0;
   outline: none;
-  font-size: 14px;
+  font-size: 12px;
   font-family: inherit;
   color: var(--text);
   line-height: 1.35;

From 493d26d0d2bada7a2ccc64f069ac1e7250cc9971 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 15:35:58 +0530
Subject: [PATCH 158/192] popup: close on open-agent click and replace
 auto-mode border with bottom note

---
 extension/popup.css               | 14 +++++++++-----
 extension/popup.html              |  2 ++
 extension/popup.js                | 14 ++++++++------
 test/agent/popup-contract.test.js | 14 ++++++++++++++
 4 files changed, 33 insertions(+), 11 deletions(-)

diff --git a/extension/popup.css b/extension/popup.css
index 62bcc57..33eb84f 100644
--- a/extension/popup.css
+++ b/extension/popup.css
@@ -45,11 +45,6 @@ body {
   padding: 16px;
 }
 
-.bf-popup.auto-mode {
-  border: 2px dotted var(--bf-accent);
-  border-radius: 10px;
-}
-
 header {
   display: flex;
   align-items: center;
@@ -412,3 +407,12 @@ textarea {
 textarea:focus {
   border-color: var(--bf-accent);
 }
+
+.auto-mode-note {
+  margin-top: 10px;
+  padding-top: 8px;
+  border-top: 1px solid var(--bf-border-soft);
+  font-size: 11px;
+  color: var(--bf-text-subtle);
+  line-height: 1.35;
+}
diff --git a/extension/popup.html b/extension/popup.html
index 703efd6..c10f1d9 100644
--- a/extension/popup.html
+++ b/extension/popup.html
@@ -123,6 +123,8 @@ <h1>BrowserForce</h1>
         <textarea id="bf-instructions" rows="4" placeholder="Custom instructions for the AI agent..."></textarea>
       </section>
     </div>
+
+    <p id="bf-auto-mode-note" class="auto-mode-note" hidden>Auto mode is on. The agent can automatically create tabs.</p>
   </div>
   <script src="popup.js"></script>
 </body>
diff --git a/extension/popup.js b/extension/popup.js
index 76d9905..1bb23c4 100644
--- a/extension/popup.js
+++ b/extension/popup.js
@@ -14,7 +14,7 @@ const RESTRICTION_LINES = {
 const statusEl = document.getElementById('bf-status');
 const statusTextEl = document.getElementById('bf-status-text');
 const mcpClientsEl = document.getElementById('bf-mcp-clients');
-const popupEl = document.querySelector('.bf-popup');
+const autoModeNoteEl = document.getElementById('bf-auto-mode-note');
 const relayUrlInput = document.getElementById('bf-relay-url');
 const saveUrlBtn = document.getElementById('bf-save-url');
 const tabCountEl = document.getElementById('bf-tab-count');
@@ -63,7 +63,7 @@ chrome.storage.local.get(SETTINGS_KEYS, (s) => {
   noNewTabsCb.checked = !!s.noNewTabs;
   readOnlyCb.checked = !!s.readOnly;
   instructionsEl.value = s.userInstructions || '';
-  setAutoModeBorder(s.mode || 'auto');
+  setAutoModeState(s.mode || 'auto');
 });
 
 // --- Save Handlers ---
@@ -108,7 +108,7 @@ saveUrlBtn.addEventListener('click', () => {
 
 modeSelect.addEventListener('change', () => {
   chrome.storage.local.set({ mode: modeSelect.value });
-  setAutoModeBorder(modeSelect.value);
+  setAutoModeState(modeSelect.value);
 });
 
 executionModeSelect.addEventListener('change', () => {
@@ -199,6 +199,7 @@ openAgentBtn.addEventListener('click', async () => {
   try {
     const [tab] = await chrome.tabs.query({ active: true, currentWindow: true });
     await chrome.sidePanel.open({ windowId: tab?.windowId });
+    window.close();
   } catch {
     openAgentBtn.textContent = 'Failed to open';
     setTimeout(() => { openAgentBtn.textContent = 'Open BrowserForce Agent'; }, 1500);
@@ -223,7 +224,7 @@ function refreshStatus() {
     setTabs(response.tabs || []);
     setAutoTimer(response.nextAutoActionSecs);
     setMcpClientCount(response.mcpClientCount);
-    setAutoModeBorder(response.mode || modeSelect.value || 'auto');
+    setAutoModeState(response.mode || modeSelect.value || 'auto');
   });
 }
 
@@ -279,8 +280,9 @@ function setMcpClientCount(count) {
   mcpClientsEl.textContent = `MCP ${safeCount}`;
 }
 
-function setAutoModeBorder(mode) {
-  popupEl.classList.toggle('auto-mode', mode === 'auto');
+function setAutoModeState(mode) {
+  if (!autoModeNoteEl) return;
+  autoModeNoteEl.hidden = mode !== 'auto';
 }
 
 function escapeHtml(str) {
diff --git a/test/agent/popup-contract.test.js b/test/agent/popup-contract.test.js
index 7530b38..dd5e075 100644
--- a/test/agent/popup-contract.test.js
+++ b/test/agent/popup-contract.test.js
@@ -4,6 +4,8 @@ import assert from 'node:assert/strict';
 
 const html = fs.readFileSync('extension/popup.html', 'utf8');
 const optionsJs = fs.readFileSync('extension/options.js', 'utf8');
+const popupJs = fs.readFileSync('extension/popup.js', 'utf8');
+const popupCss = fs.readFileSync('extension/popup.css', 'utf8');
 
 test('popup includes Open BrowserForce Agent button', () => {
   assert.match(html, /Open BrowserForce Agent/);
@@ -13,3 +15,15 @@ test('logs viewer requests include extension identity header', () => {
   assert.match(optionsJs, /chrome\?\.runtime\?\.id/);
   assert.match(optionsJs, /'x-browserforce-extension-id'/);
 });
+
+test('open agent action opens side panel and closes popup', () => {
+  assert.match(popupJs, /chrome\.sidePanel\.open\(/);
+  assert.match(popupJs, /window\.close\(\)/);
+});
+
+test('auto mode uses bottom note instead of dotted popup border', () => {
+  assert.match(html, /id="bf-auto-mode-note"/);
+  assert.match(html, /Auto mode is on\. The agent can automatically create tabs\./);
+  assert.match(popupCss, /\.auto-mode-note\s*\{/);
+  assert.equal(/\.bf-popup\.auto-mode\s*\{[\s\S]*dotted/.test(popupCss), false);
+});

From a600deeb67b3574640de35963a1ba24b9cdfa332 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 16:00:59 +0530
Subject: [PATCH 159/192] popup: make auto-mode note single-line with NOTE
 prefix and full-width bottom bars

---
 extension/popup.css               | 32 +++++++++++++++++++++++++++----
 extension/popup.html              |  2 +-
 test/agent/popup-contract.test.js |  6 +++++-
 3 files changed, 34 insertions(+), 6 deletions(-)

diff --git a/extension/popup.css b/extension/popup.css
index 33eb84f..582cbe3 100644
--- a/extension/popup.css
+++ b/extension/popup.css
@@ -409,10 +409,34 @@ textarea:focus {
 }
 
 .auto-mode-note {
-  margin-top: 10px;
-  padding-top: 8px;
-  border-top: 1px solid var(--bf-border-soft);
+  margin: 10px -16px -16px;
+  padding: 8px 16px 12px;
   font-size: 11px;
   color: var(--bf-text-subtle);
-  line-height: 1.35;
+  line-height: 1.2;
+  white-space: nowrap;
+  overflow: hidden;
+  text-overflow: ellipsis;
+  position: relative;
+  background: var(--bf-surface-soft);
+}
+
+.auto-mode-note::before,
+.auto-mode-note::after {
+  content: '';
+  position: absolute;
+  left: 0;
+  right: 0;
+}
+
+.auto-mode-note::before {
+  bottom: 2px;
+  height: 1px;
+  background: var(--bf-danger-fg);
+}
+
+.auto-mode-note::after {
+  bottom: 0;
+  height: 2px;
+  background: var(--bf-accent);
 }
diff --git a/extension/popup.html b/extension/popup.html
index c10f1d9..bd02cc3 100644
--- a/extension/popup.html
+++ b/extension/popup.html
@@ -124,7 +124,7 @@ <h1>BrowserForce</h1>
       </section>
     </div>
 
-    <p id="bf-auto-mode-note" class="auto-mode-note" hidden>Auto mode is on. The agent can automatically create tabs.</p>
+    <p id="bf-auto-mode-note" class="auto-mode-note" hidden>NOTE: Auto mode is on. Agent can create tabs automatically.</p>
   </div>
   <script src="popup.js"></script>
 </body>
diff --git a/test/agent/popup-contract.test.js b/test/agent/popup-contract.test.js
index dd5e075..21376f8 100644
--- a/test/agent/popup-contract.test.js
+++ b/test/agent/popup-contract.test.js
@@ -23,7 +23,11 @@ test('open agent action opens side panel and closes popup', () => {
 
 test('auto mode uses bottom note instead of dotted popup border', () => {
   assert.match(html, /id="bf-auto-mode-note"/);
-  assert.match(html, /Auto mode is on\. The agent can automatically create tabs\./);
+  assert.match(html, /NOTE:\s*Auto mode is on\./);
   assert.match(popupCss, /\.auto-mode-note\s*\{/);
+  assert.match(popupCss, /white-space:\s*nowrap/);
+  assert.match(popupCss, /margin:\s*10px\s+-16px\s+-16px/);
+  assert.match(popupCss, /\.auto-mode-note::before[\s\S]*background:\s*var\(--bf-danger-fg\)/);
+  assert.match(popupCss, /\.auto-mode-note::after[\s\S]*background:\s*var\(--bf-accent\)/);
   assert.equal(/\.bf-popup\.auto-mode\s*\{[\s\S]*dotted/.test(popupCss), false);
 });

From a935a80484353a6ca8d6a850a5738376b49a416e Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 16:04:30 +0530
Subject: [PATCH 160/192] popup: constrain auto-mode note text width with inner
 ellipsis

---
 extension/popup.css               | 10 ++++++++--
 extension/popup.html              |  4 +++-
 test/agent/popup-contract.test.js |  5 ++++-
 3 files changed, 15 insertions(+), 4 deletions(-)

diff --git a/extension/popup.css b/extension/popup.css
index 582cbe3..ad09547 100644
--- a/extension/popup.css
+++ b/extension/popup.css
@@ -43,6 +43,7 @@ body {
 .bf-popup {
   width: 320px;
   padding: 16px;
+  overflow-x: hidden;
 }
 
 header {
@@ -414,11 +415,16 @@ textarea:focus {
   font-size: 11px;
   color: var(--bf-text-subtle);
   line-height: 1.2;
+  position: relative;
+  background: var(--bf-surface-soft);
+}
+
+.auto-mode-note-text {
+  display: block;
+  max-width: 100%;
   white-space: nowrap;
   overflow: hidden;
   text-overflow: ellipsis;
-  position: relative;
-  background: var(--bf-surface-soft);
 }
 
 .auto-mode-note::before,
diff --git a/extension/popup.html b/extension/popup.html
index bd02cc3..93dddad 100644
--- a/extension/popup.html
+++ b/extension/popup.html
@@ -124,7 +124,9 @@ <h1>BrowserForce</h1>
       </section>
     </div>
 
-    <p id="bf-auto-mode-note" class="auto-mode-note" hidden>NOTE: Auto mode is on. Agent can create tabs automatically.</p>
+    <p id="bf-auto-mode-note" class="auto-mode-note" hidden>
+      <span class="auto-mode-note-text">NOTE: Auto mode is on. Agent can create tabs automatically.</span>
+    </p>
   </div>
   <script src="popup.js"></script>
 </body>
diff --git a/test/agent/popup-contract.test.js b/test/agent/popup-contract.test.js
index 21376f8..5bd9711 100644
--- a/test/agent/popup-contract.test.js
+++ b/test/agent/popup-contract.test.js
@@ -23,9 +23,12 @@ test('open agent action opens side panel and closes popup', () => {
 
 test('auto mode uses bottom note instead of dotted popup border', () => {
   assert.match(html, /id="bf-auto-mode-note"/);
+  assert.match(html, /class="auto-mode-note-text"/);
   assert.match(html, /NOTE:\s*Auto mode is on\./);
   assert.match(popupCss, /\.auto-mode-note\s*\{/);
-  assert.match(popupCss, /white-space:\s*nowrap/);
+  assert.match(popupCss, /\.auto-mode-note-text\s*\{/);
+  assert.match(popupCss, /\.auto-mode-note-text[\s\S]*max-width:\s*100%/);
+  assert.match(popupCss, /\.auto-mode-note-text[\s\S]*white-space:\s*nowrap/);
   assert.match(popupCss, /margin:\s*10px\s+-16px\s+-16px/);
   assert.match(popupCss, /\.auto-mode-note::before[\s\S]*background:\s*var\(--bf-danger-fg\)/);
   assert.match(popupCss, /\.auto-mode-note::after[\s\S]*background:\s*var\(--bf-accent\)/);

From 9a1a3a52e57a184b5e2b2d6d8437d0f96da36597 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 16:07:26 +0530
Subject: [PATCH 161/192] popup: fit auto-mode note text to one line without
 ellipsis

---
 extension/popup.css               |  6 +++---
 extension/popup.js                | 34 ++++++++++++++++++++++++++++++-
 test/agent/popup-contract.test.js |  6 +++++-
 3 files changed, 41 insertions(+), 5 deletions(-)

diff --git a/extension/popup.css b/extension/popup.css
index ad09547..ab57010 100644
--- a/extension/popup.css
+++ b/extension/popup.css
@@ -421,10 +421,10 @@ textarea:focus {
 
 .auto-mode-note-text {
   display: block;
-  max-width: 100%;
+  width: 100%;
+  font-size: 11px;
+  line-height: 1.2;
   white-space: nowrap;
-  overflow: hidden;
-  text-overflow: ellipsis;
 }
 
 .auto-mode-note::before,
diff --git a/extension/popup.js b/extension/popup.js
index 1bb23c4..a0d4bee 100644
--- a/extension/popup.js
+++ b/extension/popup.js
@@ -15,6 +15,7 @@ const statusEl = document.getElementById('bf-status');
 const statusTextEl = document.getElementById('bf-status-text');
 const mcpClientsEl = document.getElementById('bf-mcp-clients');
 const autoModeNoteEl = document.getElementById('bf-auto-mode-note');
+const autoModeNoteTextEl = autoModeNoteEl?.querySelector('.auto-mode-note-text') || null;
 const relayUrlInput = document.getElementById('bf-relay-url');
 const saveUrlBtn = document.getElementById('bf-save-url');
 const tabCountEl = document.getElementById('bf-tab-count');
@@ -280,9 +281,37 @@ function setMcpClientCount(count) {
   mcpClientsEl.textContent = `MCP ${safeCount}`;
 }
 
+function fitAutoModeNoteText() {
+  if (!autoModeNoteEl || !autoModeNoteTextEl || autoModeNoteEl.hidden) return;
+  const maxSizePx = 11;
+  const minSizePx = 8;
+  let size = maxSizePx;
+  autoModeNoteTextEl.style.fontSize = `${size}px`;
+  autoModeNoteTextEl.style.letterSpacing = '';
+
+  let safety = 0;
+  while (
+    size > minSizePx
+    && autoModeNoteTextEl.scrollWidth > autoModeNoteTextEl.clientWidth
+    && safety < 24
+  ) {
+    size -= 0.25;
+    autoModeNoteTextEl.style.fontSize = `${size}px`;
+    safety += 1;
+  }
+
+  if (autoModeNoteTextEl.scrollWidth > autoModeNoteTextEl.clientWidth) {
+    autoModeNoteTextEl.style.letterSpacing = '-0.02em';
+  }
+}
+
 function setAutoModeState(mode) {
   if (!autoModeNoteEl) return;
-  autoModeNoteEl.hidden = mode !== 'auto';
+  const showAutoModeNote = mode === 'auto';
+  autoModeNoteEl.hidden = !showAutoModeNote;
+  if (showAutoModeNote) {
+    window.requestAnimationFrame(fitAutoModeNoteText);
+  }
 }
 
 function escapeHtml(str) {
@@ -293,3 +322,6 @@ function escapeHtml(str) {
 
 refreshStatus();
 setInterval(refreshStatus, 1000);
+window.addEventListener('resize', () => {
+  window.requestAnimationFrame(fitAutoModeNoteText);
+});
diff --git a/test/agent/popup-contract.test.js b/test/agent/popup-contract.test.js
index 5bd9711..72cd6f5 100644
--- a/test/agent/popup-contract.test.js
+++ b/test/agent/popup-contract.test.js
@@ -27,8 +27,12 @@ test('auto mode uses bottom note instead of dotted popup border', () => {
   assert.match(html, /NOTE:\s*Auto mode is on\./);
   assert.match(popupCss, /\.auto-mode-note\s*\{/);
   assert.match(popupCss, /\.auto-mode-note-text\s*\{/);
-  assert.match(popupCss, /\.auto-mode-note-text[\s\S]*max-width:\s*100%/);
+  assert.match(popupCss, /\.auto-mode-note-text[\s\S]*width:\s*100%/);
   assert.match(popupCss, /\.auto-mode-note-text[\s\S]*white-space:\s*nowrap/);
+  assert.equal(/\.auto-mode-note-text[\s\S]*text-overflow:\s*ellipsis/.test(popupCss), false);
+  assert.match(popupJs, /function\s+fitAutoModeNoteText\(/);
+  assert.match(popupJs, /scrollWidth\s*>\s*autoModeNoteTextEl\.clientWidth/);
+  assert.match(popupJs, /requestAnimationFrame\(fitAutoModeNoteText\)/);
   assert.match(popupCss, /margin:\s*10px\s+-16px\s+-16px/);
   assert.match(popupCss, /\.auto-mode-note::before[\s\S]*background:\s*var\(--bf-danger-fg\)/);
   assert.match(popupCss, /\.auto-mode-note::after[\s\S]*background:\s*var\(--bf-accent\)/);

From d2f2b1321a05c9badbf390383bd62c9be8cf25d8 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 16:41:01 +0530
Subject: [PATCH 162/192] agent: label BrowserForce execute/reset tool steps

---
 agent/src/chatd.js             | 101 +++++++++++++++++++++++++++------
 extension/agent-panel-state.js | 101 +++++++++++++++++++++++++++------
 test/agent/chatd-api.test.js   |   1 +
 test/agent/sse-events.test.js  |   3 +-
 4 files changed, 169 insertions(+), 37 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 3696876..2cfdea4 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -294,6 +294,63 @@ function firstString(values) {
   return '';
 }
 
+function isBrowserForceExecutePayload(payload = {}) {
+  const name = String(firstString([
+    payload.name,
+    payload.toolName,
+    payload.tool,
+  ]) || '').trim().toLowerCase();
+
+  if (name === 'browserforce:execute' || name === 'mcp__browserforce__execute') return true;
+  if (name !== 'execute') return false;
+
+  const args = payload?.args && typeof payload.args === 'object' ? payload.args : null;
+  if (args && typeof args.code === 'string') return true;
+  if (typeof payload.code === 'string') return true;
+
+  const rawArgs = String(firstString([payload.arguments, payload.input]) || '').trim();
+  return /"code"\s*:/.test(rawArgs);
+}
+
+function isBrowserForceResetPayload(payload = {}) {
+  const name = String(firstString([
+    payload.name,
+    payload.toolName,
+    payload.tool,
+  ]) || '').trim().toLowerCase();
+
+  if (name === 'browserforce:reset' || name === 'mcp__browserforce__reset') return true;
+  if (name !== 'reset') return false;
+
+  const args = payload?.args && typeof payload.args === 'object' ? payload.args : null;
+  if (args && Object.keys(args).length > 0) return false;
+
+  const rawArgs = String(firstString([payload.arguments, payload.input]) || '').trim();
+  return !rawArgs || rawArgs === '{}' || rawArgs === 'null';
+}
+
+function normalizeToolLabel(label, payload = {}) {
+  const raw = String(label || '').trim();
+  if (!raw) return '';
+  const normalized = raw.toLowerCase();
+
+  if (
+    isBrowserForceExecutePayload(payload)
+    && (normalized === 'execute' || normalized === 'mcp__browserforce__execute' || normalized === 'browserforce:execute')
+  ) {
+    return 'BrowserForce:execute';
+  }
+
+  if (
+    isBrowserForceResetPayload(payload)
+    && (normalized === 'reset' || normalized === 'mcp__browserforce__reset' || normalized === 'browserforce:reset')
+  ) {
+    return 'BrowserForce:reset';
+  }
+
+  return raw;
+}
+
 const SHELL_LC_WRAPPER_RE = /^(?:\/usr\/bin\/env\s+)?(?:\/bin\/)?(?:zsh|bash|sh)\s+-lc\s+([\s\S]+)$/i;
 
 function unwrapShellLcCommand(value) {
@@ -578,26 +635,21 @@ function syncFinalTextToRunTimeline(run, finalText) {
 
 function stepLabelForToolEvent(evt) {
   const payload = evt?.payload || {};
+  const toolLabel = normalizeToolLabel(firstString([
+    payload.command,
+    payload.title,
+    payload.name,
+    payload.tool,
+    payload.toolName,
+  ]), payload);
   if (evt.event === 'tool.started') {
-    return firstString([
-      payload.command,
-      payload.title,
-      payload.name,
-      payload.tool,
-      payload.toolName,
-    ]) || 'Tool call started';
+    return toolLabel || 'Tool call started';
   }
   if (evt.event === 'tool.final') {
-    return firstString([
-      payload.command,
-      payload.title,
-      payload.name,
-      payload.tool,
-      payload.toolName,
-    ]) || 'Tool call completed';
+    return toolLabel || 'Tool call completed';
   }
   if (evt.event === 'tool.delta') {
-    return firstString([
+    return normalizeToolLabel(firstString([
       payload.text,
       payload.message,
       payload.delta,
@@ -606,7 +658,7 @@ function stepLabelForToolEvent(evt) {
       payload.tool,
       payload.toolName,
       payload.type === 'reasoning' ? 'Reasoning' : '',
-    ]) || 'Working...';
+    ]), payload) || 'Working...';
   }
   return '';
 }
@@ -675,7 +727,7 @@ function stepKindForRunEvent(evt) {
 function stepLabelForRunEvent(evt) {
   const payload = evt?.payload || {};
   const item = payload?.item && typeof payload.item === 'object' ? payload.item : {};
-  return firstString([
+  const label = firstString([
     payload.title,
     payload.message,
     payload.text,
@@ -689,7 +741,20 @@ function stepLabelForRunEvent(evt) {
     item.command,
     item.type ? humanizeToken(item.type) : '',
     payload.type ? humanizeToken(payload.type) : '',
-  ]) || 'Working...';
+  ]);
+
+  const normalized = normalizeToolLabel(label, {
+    ...payload,
+    ...item,
+    name: firstString([item.name, payload.name]),
+    toolName: firstString([item.toolName, payload.toolName]),
+    tool: firstString([item.tool, payload.tool]),
+    args: item.args || payload.args,
+    arguments: firstString([item.arguments, payload.arguments]),
+    input: item.input || payload.input,
+    code: firstString([item.code, payload.code]),
+  });
+  return normalized || 'Working...';
 }
 
 function stepKeyForRunEvent(evt) {
diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index c13c0f5..50d376e 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -13,6 +13,63 @@ function firstString(values) {
   return '';
 }
 
+function isBrowserForceExecutePayload(payload = {}) {
+  const name = String(firstString([
+    payload.name,
+    payload.toolName,
+    payload.tool,
+  ]) || '').trim().toLowerCase();
+
+  if (name === 'browserforce:execute' || name === 'mcp__browserforce__execute') return true;
+  if (name !== 'execute') return false;
+
+  const args = payload?.args && typeof payload.args === 'object' ? payload.args : null;
+  if (args && typeof args.code === 'string') return true;
+  if (typeof payload.code === 'string') return true;
+
+  const rawArgs = String(firstString([payload.arguments, payload.input]) || '').trim();
+  return /"code"\s*:/.test(rawArgs);
+}
+
+function isBrowserForceResetPayload(payload = {}) {
+  const name = String(firstString([
+    payload.name,
+    payload.toolName,
+    payload.tool,
+  ]) || '').trim().toLowerCase();
+
+  if (name === 'browserforce:reset' || name === 'mcp__browserforce__reset') return true;
+  if (name !== 'reset') return false;
+
+  const args = payload?.args && typeof payload.args === 'object' ? payload.args : null;
+  if (args && Object.keys(args).length > 0) return false;
+
+  const rawArgs = String(firstString([payload.arguments, payload.input]) || '').trim();
+  return !rawArgs || rawArgs === '{}' || rawArgs === 'null';
+}
+
+function normalizeToolLabel(label, payload = {}) {
+  const raw = String(label || '').trim();
+  if (!raw) return '';
+  const normalized = raw.toLowerCase();
+
+  if (
+    isBrowserForceExecutePayload(payload)
+    && (normalized === 'execute' || normalized === 'mcp__browserforce__execute' || normalized === 'browserforce:execute')
+  ) {
+    return 'BrowserForce:execute';
+  }
+
+  if (
+    isBrowserForceResetPayload(payload)
+    && (normalized === 'reset' || normalized === 'mcp__browserforce__reset' || normalized === 'browserforce:reset')
+  ) {
+    return 'BrowserForce:reset';
+  }
+
+  return raw;
+}
+
 const SHELL_LC_WRAPPER_RE = /^(?:\/usr\/bin\/env\s+)?(?:\/bin\/)?(?:zsh|bash|sh)\s+-lc\s+([\s\S]+)$/i;
 
 function unwrapShellLcCommand(value) {
@@ -335,26 +392,21 @@ function applyFinalTextToTimeline(run, finalText) {
 
 function stepLabelForToolEvent(evt) {
   const payload = evt?.payload || {};
+  const toolLabel = normalizeToolLabel(firstString([
+    payload.command,
+    payload.title,
+    payload.name,
+    payload.tool,
+    payload.toolName,
+  ]), payload);
   if (evt.event === 'tool.started') {
-    return firstString([
-      payload.command,
-      payload.title,
-      payload.name,
-      payload.tool,
-      payload.toolName,
-    ]) || 'Tool call started';
+    return toolLabel || 'Tool call started';
   }
   if (evt.event === 'tool.final') {
-    return firstString([
-      payload.command,
-      payload.title,
-      payload.name,
-      payload.tool,
-      payload.toolName,
-    ]) || 'Tool call completed';
+    return toolLabel || 'Tool call completed';
   }
   if (evt.event === 'tool.delta') {
-    return firstString([
+    return normalizeToolLabel(firstString([
       payload.text,
       payload.message,
       payload.delta,
@@ -363,7 +415,7 @@ function stepLabelForToolEvent(evt) {
       payload.tool,
       payload.toolName,
       payload.type === 'reasoning' ? 'Reasoning' : '',
-    ]) || 'Working...';
+    ]), payload) || 'Working...';
   }
   return '';
 }
@@ -432,7 +484,7 @@ function stepKindForRunEvent(evt) {
 function stepLabelForRunEvent(evt) {
   const payload = evt?.payload || {};
   const item = payload?.item && typeof payload.item === 'object' ? payload.item : {};
-  return firstString([
+  const label = firstString([
     payload.title,
     payload.message,
     payload.text,
@@ -446,7 +498,20 @@ function stepLabelForRunEvent(evt) {
     item.command,
     item.type ? humanizeToken(item.type) : '',
     payload.type ? humanizeToken(payload.type) : '',
-  ]) || 'Working...';
+  ]);
+
+  const normalized = normalizeToolLabel(label, {
+    ...payload,
+    ...item,
+    name: firstString([item.name, payload.name]),
+    toolName: firstString([item.toolName, payload.toolName]),
+    tool: firstString([item.tool, payload.tool]),
+    args: item.args || payload.args,
+    arguments: firstString([item.arguments, payload.arguments]),
+    input: item.input || payload.input,
+    code: firstString([item.code, payload.code]),
+  });
+  return normalized || 'Working...';
 }
 
 function stepKeyForRunEvent(evt) {
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index 27d56cc..7221b1b 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -338,6 +338,7 @@ test('POST /v1/runs persists execute tool details for collapsible timeline rows'
     const assistant = (messagesBody.messages || []).at(-1);
     const executeStep = (assistant?.timeline || []).find((item) => item?.type === 'step' && /execute/i.test(item?.label || ''));
 
+    assert.equal(executeStep?.label, 'BrowserForce:execute');
     assert.equal(Array.isArray(executeStep?.details), true);
     assert.equal(executeStep.details.some((line) => /snapshot/.test(line)), true);
   } finally {
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
index e2596aa..1e52e2b 100644
--- a/test/agent/sse-events.test.js
+++ b/test/agent/sse-events.test.js
@@ -183,6 +183,7 @@ test('execute tool step captures code details for collapsible rendering', () =>
   });
 
   const step = s2.runs.r1.steps.find((item) => /execute/i.test(item?.label || ''));
+  assert.equal(step?.label, 'BrowserForce:execute');
   assert.deepEqual(step?.details, [
     'const rows = await snapshot();',
     'return rows;',
@@ -293,7 +294,7 @@ test('run.event extracts execute code from nested item input for collapsible det
   });
 
   const step = (s2.runs.r1.steps || []).find((item) => item?.key === 'tool:item_2');
-  assert.equal(step?.label, 'execute');
+  assert.equal(step?.label, 'BrowserForce:execute');
   assert.deepEqual(step?.details, [
     'const rows = await snapshot();',
     'return rows;',

From 8924417cac62abc78a47f66ae0da976855d06c45 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 16:41:05 +0530
Subject: [PATCH 163/192] google-sheets: add summary-first helper and workflow
 guidance

---
 README.md                               |   2 +-
 docs/google-sheets-issues.md            |   6 +
 plugins/official/google-sheets/SKILL.md |  40 ++++--
 plugins/official/google-sheets/index.js | 156 ++++++++++++++++++------
 4 files changed, 152 insertions(+), 52 deletions(-)

diff --git a/README.md b/README.md
index 9948bc9..7e5539d 100644
--- a/README.md
+++ b/README.md
@@ -445,7 +445,7 @@ That's it. Restart MCP (or Claude Desktop) and `highlight()` is available in eve
 | Plugin      | What it adds                                                                                   | Install                                 |
 | ----------- | ---------------------------------------------------------------------------------------------- | --------------------------------------- |
 | `highlight` | `highlight(selector, color?)` — outlines matching elements; `clearHighlights()` — removes them | `browserforce plugin install highlight` |
-| `google-sheets` | `gsReadContiguousRows()`; `gsFormatBulletsInRange()`; `gsSplitBulletsInRange()`; `gsRebalanceBoldInRange()`; `gsLogIssue()` | `browserforce plugin install google-sheets` |
+| `google-sheets` | `gsSummarizeSheet()`; `gsReadContiguousRows()`; `gsFormatBulletsInRange()`; `gsSplitBulletsInRange()`; `gsRebalanceBoldInRange()`; `gsLogIssue()` | `browserforce plugin install google-sheets` |
 | `openclaw`  | OpenClaw-specific BrowserForce tab policy (skill text only, no helper functions)              | Auto-installed by `browserforce setup openclaw` |
 
 
diff --git a/docs/google-sheets-issues.md b/docs/google-sheets-issues.md
index 64dcf12..1a0325c 100644
--- a/docs/google-sheets-issues.md
+++ b/docs/google-sheets-issues.md
@@ -35,3 +35,9 @@ Use this format for each new entry:
 - Root cause: Feature work started before surveying existing Claude and MCP Google Sheets solutions.
 - Fix: Added a mandatory pre-build lookup step against official docs + known MCP repositories.
 - Rule: Before expanding Sheets automation behavior, check official support and existing MCP implementations.
+
+## 2026-03-04 — [SUMMARY] Export Drift During Simple Read Requests
+- Symptom: Agent attempted gviz/CSV export and extra-tab fetch flows when the user only asked for a page summary.
+- Root cause: Skill guidance did not enforce a summary-first path for Google Sheets and lacked anti-export guardrails.
+- Fix: Added `gsSummarizeSheet()` helper plus strict skill rules to summarize directly from active-sheet helpers first.
+- Rule: For "summarize/read this sheet" requests, use helper-driven page reads and answer directly before any export path.
diff --git a/plugins/official/google-sheets/SKILL.md b/plugins/official/google-sheets/SKILL.md
index e96f2c9..82a0fca 100644
--- a/plugins/official/google-sheets/SKILL.md
+++ b/plugins/official/google-sheets/SKILL.md
@@ -1,46 +1,64 @@
 ## google-sheets plugin
 
-Use Google Sheets helpers when work involves reading or structuring sheet content reliably without guesswork.
+Use Google Sheets helpers when work involves reading, summarizing, or structuring sheet content from the active page without guesswork.
+
+Tool naming note:
+- The same browser tool may appear as `execute` or `BrowserForce:execute`.
+- Treat both labels as the same BrowserForce execution path.
 
 Available helpers:
 - `gsGetMeta()` → current spreadsheet id + gid + title + URL
 - `gsGotoCell(cellRef)` → jump to a cell using the Sheets name box
 - `gsReadCell(cellRef, options?)` → read cell text through the in-cell editor
 - `gsReadContiguousRows(options?)` → detect used rows without hard-scanning arbitrary ranges
+- `gsSummarizeSheet(options?)` → one-call summary payload (sheet meta + scan stats + preview rows)
 - `gsSplitBulletsInRange(rangeRef, options?)` → replace in-cell bullet separators with real new lines
 - `gsRebalanceBoldInRange(rangeRef, options?)` → sparse bolding (default: max 1 bold segment per line)
 - `gsFormatBulletsInRange(rangeRef, options?)` → split bullets + rebalance bold in one pass
 - `gsLogIssue(summary, details?, options?)` → append a JSONL issue entry
 - `gsIssueLogPath()` → return default issue log path
 
+## Summary-First Workflow (Default)
+
+When the user says "summarize this page/sheet", "read this sheet", or equivalent:
+- Use `gsSummarizeSheet()` first.
+- Answer directly from returned `preview` rows.
+- Include `scannedRows`, `usedRowCount`, and `stopReason` in the summary.
+- Ask a focused follow-up only when `usedRowCount === 0` or the user asks for a wider range.
+
 ## Reliability Rules
 
 - Never hardcode long row scans (`1..80`, `1..200`) when structure is contiguous.
 - Use `gsReadContiguousRows({ columns: ['A','B'], startRow: 1, maxRows: 30, emptyStreakStop: 2 })`.
 - Always report `scannedRows`, `usedRowCount`, and `stopReason` when summarizing extraction.
+- For summary requests, prefer `gsSummarizeSheet()` over ad-hoc DOM probing loops.
 - Prefer `gsFormatBulletsInRange()` for multi-cell content cleanup tasks.
 - Use `dryRun: true` first for formatting helpers when changing many cells.
 - Log every process failure or unexpected behavior with `gsLogIssue(...)`.
 
-## Example: Read Guidelines Table
+## Guardrails (Google Sheets)
+
+- Do not switch to `/export`, `/gviz`, CSV downloads, or out-of-page fetch flows unless the user explicitly asks for export data.
+- Do not open extra tabs for summary-only requests.
+- Do not infer cell content from toolbar/status text when table rows are available via helpers.
+
+## Example: One-Shot Summary
 
 ```js
-const meta = await gsGetMeta();
-const result = await gsReadContiguousRows({
-  columns: ['A', 'B'],
+const result = await gsSummarizeSheet({
   startRow: 1,
   maxRows: 30,
-  emptyStreakStop: 2
+  previewRows: 8
 });
 
 return {
-  sheet: meta,
+  sheet: result.sheet,
   scan: {
-    scannedRows: result.scannedRows,
-    usedRowCount: result.usedRowCount,
-    stopReason: result.stopReason
+    scannedRows: result.scan.scannedRows,
+    usedRowCount: result.scan.usedRowCount,
+    stopReason: result.scan.stopReason
   },
-  rows: result.rows
+  preview: result.preview
 };
 ```
 
diff --git a/plugins/official/google-sheets/index.js b/plugins/official/google-sheets/index.js
index 750173a..1ede7cd 100644
--- a/plugins/official/google-sheets/index.js
+++ b/plugins/official/google-sheets/index.js
@@ -421,6 +421,96 @@ function hasData(cells) {
   return Object.values(cells).some((value) => Boolean(String(value || '').trim()));
 }
 
+async function scanContiguousRows(page, options = {}) {
+  const columns = normalizeColumns(options.columns || ['A', 'B']);
+  const startRow = Number.isInteger(options.startRow) && options.startRow > 0 ? options.startRow : 1;
+  const maxRows = Number.isInteger(options.maxRows) && options.maxRows > 0
+    ? options.maxRows
+    : DEFAULT_SCAN_MAX_ROWS;
+  const emptyStreakStop = Number.isInteger(options.emptyStreakStop) && options.emptyStreakStop > 0
+    ? options.emptyStreakStop
+    : DEFAULT_EMPTY_STREAK_STOP;
+
+  const rows = [];
+  let scannedRows = 0;
+  let seenData = false;
+  let emptyStreak = 0;
+  let stopReason = 'max_rows_reached';
+
+  for (let i = 0; i < maxRows; i += 1) {
+    const row = startRow + i;
+    const cells = await readRow(page, row, columns, options);
+    scannedRows += 1;
+
+    if (hasData(cells)) {
+      rows.push({ row, cells });
+      seenData = true;
+      emptyStreak = 0;
+      continue;
+    }
+
+    if (seenData) {
+      emptyStreak += 1;
+      if (emptyStreak >= emptyStreakStop) {
+        stopReason = 'empty_streak_stop';
+        break;
+      }
+    }
+  }
+
+  return {
+    rows,
+    scannedRows,
+    usedRowCount: rows.length,
+    stopReason,
+    config: { columns, startRow, maxRows, emptyStreakStop },
+  };
+}
+
+async function inferColumnsFromHeaderRow(page, options = {}) {
+  const startRow = Number.isInteger(options.startRow) && options.startRow > 0 ? options.startRow : 1;
+  const maxColumns = Number.isInteger(options.maxColumns) && options.maxColumns > 0
+    ? options.maxColumns
+    : 8;
+  const emptyColumnStreakStop = Number.isInteger(options.emptyColumnStreakStop) && options.emptyColumnStreakStop > 0
+    ? options.emptyColumnStreakStop
+    : 1;
+  const fallbackColumnsCount = Number.isInteger(options.fallbackColumnsCount) && options.fallbackColumnsCount > 0
+    ? options.fallbackColumnsCount
+    : 2;
+  const startColumn = normalizeColumns([options.startColumn || 'A'])[0];
+  const startColIdx = columnToIndex(startColumn);
+
+  const columns = [];
+  let seenData = false;
+  let emptyStreak = 0;
+
+  for (let i = 0; i < maxColumns; i += 1) {
+    const col = indexToColumn(startColIdx + i);
+    const { value } = await readCell(page, `${col}${startRow}`, options);
+    const nonEmpty = Boolean(String(value || '').trim());
+    if (nonEmpty) {
+      columns.push(col);
+      seenData = true;
+      emptyStreak = 0;
+      continue;
+    }
+    if (seenData) {
+      emptyStreak += 1;
+      if (emptyStreak >= emptyColumnStreakStop) break;
+    }
+  }
+
+  if (columns.length > 0) return columns;
+
+  const fallback = [];
+  const count = Math.min(Math.max(fallbackColumnsCount, 1), maxColumns);
+  for (let i = 0; i < count; i += 1) {
+    fallback.push(indexToColumn(startColIdx + i));
+  }
+  return fallback;
+}
+
 export default {
   name: 'google-sheets',
   description: 'Google Sheets helpers for reliable row scanning, cell reads, and issue logging',
@@ -447,49 +537,35 @@ export default {
 
     gsReadContiguousRows: async (page, ctx, state, options = {}) => {
       assertGoogleSheet(page, 'gsReadContiguousRows');
+      return scanContiguousRows(page, options);
+    },
 
-      const columns = normalizeColumns(options.columns || ['A', 'B']);
-      const startRow = Number.isInteger(options.startRow) && options.startRow > 0 ? options.startRow : 1;
-      const maxRows = Number.isInteger(options.maxRows) && options.maxRows > 0
-        ? options.maxRows
-        : DEFAULT_SCAN_MAX_ROWS;
-      const emptyStreakStop = Number.isInteger(options.emptyStreakStop) && options.emptyStreakStop > 0
-        ? options.emptyStreakStop
-        : DEFAULT_EMPTY_STREAK_STOP;
-
-      const rows = [];
-      let scannedRows = 0;
-      let seenData = false;
-      let emptyStreak = 0;
-      let stopReason = 'max_rows_reached';
-
-      for (let i = 0; i < maxRows; i += 1) {
-        const row = startRow + i;
-        const cells = await readRow(page, row, columns, options);
-        scannedRows += 1;
-
-        if (hasData(cells)) {
-          rows.push({ row, cells });
-          seenData = true;
-          emptyStreak = 0;
-          continue;
-        }
-
-        if (seenData) {
-          emptyStreak += 1;
-          if (emptyStreak >= emptyStreakStop) {
-            stopReason = 'empty_streak_stop';
-            break;
-          }
-        }
-      }
+    gsSummarizeSheet: async (page, ctx, state, options = {}) => {
+      assertGoogleSheet(page, 'gsSummarizeSheet');
+      const title = await page.title();
+      const sheet = { ...parseSheetMeta(page.url()), title };
+      const includeRows = options.includeRows === true;
+      const previewRows = Number.isInteger(options.previewRows) && options.previewRows > 0 ? options.previewRows : 8;
+      const columns = options.columns
+        ? normalizeColumns(options.columns)
+        : await inferColumnsFromHeaderRow(page, options);
+      const scanResult = await scanContiguousRows(page, { ...options, columns });
+      const preview = scanResult.rows.slice(0, previewRows).map((entry) => ({ row: entry.row, cells: entry.cells }));
+      const firstDataRow = scanResult.rows[0] || null;
+      const headerCandidate = scanResult.rows.find((entry) => entry.row === scanResult.config.startRow) || null;
 
       return {
-        rows,
-        scannedRows,
-        usedRowCount: rows.length,
-        stopReason,
-        config: { columns, startRow, maxRows, emptyStreakStop },
+        sheet,
+        columns,
+        scan: {
+          scannedRows: scanResult.scannedRows,
+          usedRowCount: scanResult.usedRowCount,
+          stopReason: scanResult.stopReason,
+        },
+        firstDataRow: firstDataRow ? { row: firstDataRow.row, cells: firstDataRow.cells } : null,
+        headerCandidate: headerCandidate ? { row: headerCandidate.row, cells: headerCandidate.cells } : null,
+        preview,
+        ...(includeRows ? { rows: scanResult.rows } : {}),
       };
     },
 

From a223e0d95c9085faf755099ac7b822c56fdd2ff0 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 20:11:40 +0530
Subject: [PATCH 164/192] agent-panel: animate active reasoning titles with
 shimmer and enter transition

---
 extension/agent-panel.css               | 68 +++++++++++++++++++++++++
 extension/agent-panel.js                | 31 ++++++++++-
 test/agent/agent-panel-contract.test.js | 14 +++++
 3 files changed, 111 insertions(+), 2 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index a7074d7..4d1e19b 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -399,6 +399,45 @@ body {
   white-space: pre-wrap;
 }
 
+.step-label.title-label {
+  font-size: 13px;
+  font-weight: 600;
+  color: var(--text);
+  line-height: 1.3;
+  letter-spacing: 0.01em;
+  transition: opacity 0.18s ease, transform 0.18s ease;
+}
+
+.step-item:not(.latest) .step-label.title-label {
+  opacity: 0.84;
+}
+
+.step-label.title-label.shimmer-text {
+  color: transparent;
+  -webkit-text-fill-color: transparent;
+  background-image: linear-gradient(
+    95deg,
+    rgba(61, 48, 40, 0.45) 0%,
+    rgba(61, 48, 40, 0.45) 35%,
+    rgba(193, 95, 60, 0.96) 50%,
+    rgba(61, 48, 40, 0.45) 65%,
+    rgba(61, 48, 40, 0.45) 100%
+  );
+  background-size: 220% 100%;
+  background-position: 110% 0;
+  -webkit-background-clip: text;
+  background-clip: text;
+  animation: reasoning-shimmer 2.3s ease-in-out infinite;
+}
+
+.step-label.title-label.title-transition-in {
+  animation: reasoning-title-in 210ms ease-out;
+}
+
+.step-label.title-label.shimmer-text.title-transition-in {
+  animation: reasoning-title-in 210ms ease-out, reasoning-shimmer 2.3s ease-in-out 150ms infinite;
+}
+
 .step-label strong {
   font-weight: 600;
   color: var(--text);
@@ -989,3 +1028,32 @@ body {
     opacity: 0.65;
   }
 }
+
+@keyframes reasoning-shimmer {
+  0% {
+    background-position: 110% 0;
+  }
+  100% {
+    background-position: -110% 0;
+  }
+}
+
+@keyframes reasoning-title-in {
+  from {
+    opacity: 0;
+    transform: translateY(4px);
+  }
+  to {
+    opacity: 1;
+    transform: translateY(0);
+  }
+}
+
+@media (prefers-reduced-motion: reduce) {
+  .step-item.pulse .run-step-icon,
+  .step-label.title-label.shimmer-text,
+  .step-label.title-label.title-transition-in,
+  .step-label.title-label.shimmer-text.title-transition-in {
+    animation: none;
+  }
+}
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 89c9401..30cb247 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -15,6 +15,7 @@ const state = {
   modelPresets: [{ value: null, label: 'Default' }],
   currentRunBySession: {},
   expandedTimelineEntries: {},
+  latestReasoningTitleByRun: {},
   transcriptHandlersBound: false,
   initialTabAttachInFlight: false,
   initialTabAttachStarted: false,
@@ -430,6 +431,21 @@ function getLatestInFlightTimelineStepIndex(run, timeline) {
   return -1;
 }
 
+function shouldAnimateLatestReasoningTitle({ run, entry, isLatest, isRunningReasoning }) {
+  if (!isLatest || !isRunningReasoning) return false;
+  const runId = String(run?.runId || '').trim();
+  if (!runId) return false;
+  const signature = `${String(entry?.key || '').trim()}::${String(entry?.label || '').trim()}`;
+  if (!signature || signature === '::') return false;
+  const previous = state.latestReasoningTitleByRun[runId];
+  if (previous === signature) return false;
+  state.latestReasoningTitleByRun = {
+    ...state.latestReasoningTitleByRun,
+    [runId]: signature,
+  };
+  return true;
+}
+
 function renderRunTimeline(run, fallbackText = '') {
   const timeline = normalizeRunTimeline(run, fallbackText);
   if (!timeline.length) return '';
@@ -450,16 +466,27 @@ function renderRunTimeline(run, fallbackText = '') {
       return `<div class="bubble-assistant"><p>${renderContent(entry.text || '')}</p></div>`;
     }
     const status = entry?.status || 'running';
+    const normalizedStatus = String(status || '').toLowerCase();
     const icon = classifyRunStepIcon(entry);
     const isLatest = index === latestStepIndex;
     const shouldPulse = isLatest && status === 'running';
+    const isReasoningTitle = String(entry?.kind || '').toLowerCase() === 'reasoning';
+    const isRunningReasoning = isReasoningTitle && normalizedStatus === 'running';
+    const labelClasses = ['step-label'];
+    if (isReasoningTitle) labelClasses.push('title-label');
+    if (isRunningReasoning && isLatest) {
+      labelClasses.push('shimmer-text');
+      if (shouldAnimateLatestReasoningTitle({ run, entry, isLatest, isRunningReasoning })) {
+        labelClasses.push('title-transition-in');
+      }
+    }
     const details = Array.isArray(entry?.details) ? entry.details.filter(Boolean) : [];
     const isCollapsible = details.length > 0;
     const classes = ['step-item', 'timeline-step', escapeHtml(status)];
     if (isLatest) classes.push('latest');
     if (shouldPulse) classes.push('pulse');
     if (!isCollapsible) {
-      return `<div class="${classes.join(' ')}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="step-label">${renderInlineContent(entry.label || 'Step')}</span></div>`;
+      return `<div class="${classes.join(' ')}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="${labelClasses.join(' ')}">${renderInlineContent(entry.label || 'Step')}</span></div>`;
     }
     classes.push('collapsible');
     const key = getTimelineEntryKey(entry, index);
@@ -473,7 +500,7 @@ function renderRunTimeline(run, fallbackText = '') {
         <span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span>
         <div class="step-body">
           <button type="button" class="step-toggle" data-step-key="${escapeHtml(key)}" aria-expanded="${expanded ? 'true' : 'false'}">
-            <span class="step-label">${renderInlineContent(entry.label || 'Step')}</span>
+            <span class="${labelClasses.join(' ')}">${renderInlineContent(entry.label || 'Step')}</span>
             <span class="step-caret" aria-hidden="true"></span>
           </button>
           ${expanded ? `<ul class="step-details">${detailsHtml}</ul>` : ''}
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index f0d8a64..45a65f2 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -4,6 +4,7 @@ import assert from 'node:assert/strict';
 
 const html = fs.readFileSync('extension/agent-panel.html', 'utf8');
 const css = fs.readFileSync('extension/agent-panel.css', 'utf8');
+const panelJs = fs.readFileSync('extension/agent-panel.js', 'utf8');
 
 test('agent panel has inline model and session selectors with popovers', () => {
   assert.match(html, /id="bf-model-trigger"/);
@@ -61,3 +62,16 @@ test('agent panel composer matches compact/expanded shell structure', () => {
 test('composer action buttons respect hidden attribute for send/stop swapping', () => {
   assert.match(css, /\.composer-actions button\[hidden\][\s\S]*display:\s*none/);
 });
+
+test('reasoning title rows use shimmer and enter transition treatment', () => {
+  assert.match(panelJs, /shouldAnimateLatestReasoningTitle/);
+  assert.match(panelJs, /title-label/);
+  assert.match(panelJs, /shimmer-text/);
+  assert.match(panelJs, /title-transition-in/);
+  assert.match(css, /\.step-label\.title-label/);
+  assert.match(css, /\.step-label\.title-label\.shimmer-text/);
+  assert.match(css, /\.step-label\.title-label\.title-transition-in/);
+  assert.match(css, /@keyframes reasoning-shimmer/);
+  assert.match(css, /@keyframes reasoning-title-in/);
+  assert.match(css, /@media\s*\(prefers-reduced-motion:\s*reduce\)/);
+});

From 7dde1fa0a20b7872ef308474ce56c2b5e4a9fd80 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 21:45:44 +0530
Subject: [PATCH 165/192] test: cover sheets cache invalidation helpers

---
 mcp/src/plugin-loader.js             | 283 +++++++++++++++++++++++-
 mcp/test/exec-engine-plugins.test.js | 316 +++++++++++++++++++++++++++
 2 files changed, 593 insertions(+), 6 deletions(-)

diff --git a/mcp/src/plugin-loader.js b/mcp/src/plugin-loader.js
index 69e417d..130029d 100644
--- a/mcp/src/plugin-loader.js
+++ b/mcp/src/plugin-loader.js
@@ -5,6 +5,208 @@ import { homedir } from 'node:os';
 
 export const PLUGINS_DIR = join(homedir(), '.browserforce', 'plugins');
 
+function stripWrappingQuotes(value) {
+  if (value.length >= 2) {
+    const first = value[0];
+    const last = value[value.length - 1];
+    if ((first === '"' && last === '"') || (first === '\'' && last === '\'')) {
+      return value.slice(1, -1);
+    }
+  }
+  return value;
+}
+
+const CANONICAL_SKILL_META_KEYS = new Set([
+  'name',
+  'description',
+  'helpers',
+  'tools',
+  'when_to_use',
+]);
+const CANONICAL_SKILL_LIST_KEYS = new Set([
+  'helpers',
+  'tools',
+  'when_to_use',
+]);
+
+function parseBlockScalarValue(lines, style) {
+  if (style === '|') {
+    return lines.join('\n').trimEnd();
+  }
+
+  // Minimal folded-scalar support for `>`: fold newlines into spaces.
+  return lines
+    .map((line) => line.trim())
+    .join(' ')
+    .replace(/\s+/g, ' ')
+    .trim();
+}
+
+function normalizeListItem(value) {
+  return stripWrappingQuotes(String(value || '').trim());
+}
+
+function parseInlineList(rawValue) {
+  const trimmed = String(rawValue || '').trim();
+  if (!trimmed.startsWith('[') || !trimmed.endsWith(']')) {
+    return null;
+  }
+  try {
+    const parsed = JSON.parse(trimmed);
+    if (Array.isArray(parsed)) {
+      return parsed.map(normalizeListItem).filter(Boolean);
+    }
+  } catch { /* fall back to scalar parsing */ }
+  return null;
+}
+
+function normalizeMetaValue(key, value) {
+  const normalizedValue = typeof value === 'string' ? value.trim() : value;
+  if (!CANONICAL_SKILL_LIST_KEYS.has(key)) {
+    return normalizedValue;
+  }
+
+  const inline = parseInlineList(normalizedValue);
+  if (inline) return inline;
+  if (typeof normalizedValue !== 'string') return [];
+  if (!normalizedValue) return [];
+  if (normalizedValue.includes(',')) {
+    return normalizedValue.split(',').map(normalizeListItem).filter(Boolean);
+  }
+  return [normalizeListItem(normalizedValue)].filter(Boolean);
+}
+
+function parseSkillFrontmatter(rawSkill = '') {
+  const skillText = typeof rawSkill === 'string' ? rawSkill : '';
+
+  try {
+    if (!skillText.startsWith('---')) {
+      return { meta: {}, body: skillText };
+    }
+
+    const match = skillText.match(/^---\s*\r?\n([\s\S]*?)\r?\n---\s*(?:\r?\n)?([\s\S]*)$/);
+    if (!match) {
+      return { meta: {}, body: skillText };
+    }
+
+    const [, rawMeta, rawBody] = match;
+    const meta = {};
+    const lines = rawMeta.split(/\r?\n/);
+    for (let i = 0; i < lines.length; i++) {
+      const line = lines[i];
+      const trimmed = line.trim();
+      if (!trimmed || trimmed.startsWith('#')) continue;
+
+      const keyMatch = line.match(/^\s*([^:]+?)\s*:\s*(.*)$/);
+      if (!keyMatch) continue;
+
+      const key = keyMatch[1].trim().toLowerCase();
+      const rawValue = keyMatch[2].trim();
+
+      if (rawValue === '|' || rawValue === '>') {
+        const blockLines = [];
+        let j = i + 1;
+        for (; j < lines.length; j++) {
+          const blockLine = lines[j];
+          if (blockLine.trim() === '') {
+            blockLines.push('');
+            continue;
+          }
+          if (!/^\s+/.test(blockLine)) {
+            break;
+          }
+          blockLines.push(blockLine.replace(/^\s+/, ''));
+        }
+
+        i = j - 1;
+        if (!CANONICAL_SKILL_META_KEYS.has(key)) continue;
+        meta[key] = normalizeMetaValue(key, parseBlockScalarValue(blockLines, rawValue));
+        continue;
+      }
+
+      if (rawValue === '' && CANONICAL_SKILL_LIST_KEYS.has(key)) {
+        const listItems = [];
+        let j = i + 1;
+        for (; j < lines.length; j++) {
+          const listLine = lines[j];
+          if (!listLine.trim()) continue;
+          if (!/^\s+/.test(listLine)) break;
+          const listMatch = listLine.match(/^\s*-\s+(.+)$/);
+          if (!listMatch) break;
+          listItems.push(normalizeListItem(listMatch[1]));
+        }
+        i = j - 1;
+        meta[key] = listItems.filter(Boolean);
+        continue;
+      }
+
+      if (!CANONICAL_SKILL_META_KEYS.has(key)) continue;
+      meta[key] = normalizeMetaValue(key, stripWrappingQuotes(rawValue));
+    }
+
+    return { meta, body: rawBody };
+  } catch {
+    return { meta: {}, body: skillText };
+  }
+}
+
+function normalizeMarkdownHeading(heading) {
+  return heading
+    .toLowerCase()
+    .trim()
+    .replace(/^[\d.)\s-]+/, '')
+    .replace(/[^\p{L}\p{N}\s-]/gu, '')
+    .replace(/\s+/g, ' ')
+    .trim();
+}
+
+function extractSkillSections(skillBody = '') {
+  const normalizedBody = typeof skillBody === 'string' ? skillBody.replace(/\r\n/g, '\n') : '';
+  const sections = {};
+  const headingEntries = [];
+  const lines = normalizedBody.split('\n');
+  let offset = 0;
+  let activeFence = null;
+
+  for (const line of lines) {
+    const fenceMatch = line.match(/^\s*(`{3,}|~{3,})/);
+    if (fenceMatch) {
+      const fence = fenceMatch[1];
+      if (!activeFence) {
+        activeFence = { char: fence[0], len: fence.length };
+      } else if (fence[0] === activeFence.char && fence.length >= activeFence.len) {
+        activeFence = null;
+      }
+    } else if (!activeFence) {
+      const headingMatch = line.match(/^\s*##\s+(.+?)\s*$/);
+      if (headingMatch) {
+        headingEntries.push({
+          headingText: String(headingMatch[1] || '').trim(),
+          lineStart: offset,
+          contentStart: offset + line.length + 1,
+        });
+      }
+    }
+    offset += line.length + 1;
+  }
+
+  if (headingEntries.length === 0) return sections;
+
+  for (let i = 0; i < headingEntries.length; i++) {
+    const { headingText, contentStart } = headingEntries[i];
+    const key = normalizeMarkdownHeading(headingText);
+    if (!key) continue;
+    const contentEnd = i + 1 < headingEntries.length ? headingEntries[i + 1].lineStart : normalizedBody.length;
+    const safeContentStart = Math.min(contentStart, normalizedBody.length);
+    const sectionBody = normalizedBody.slice(safeContentStart, contentEnd).trim();
+    if (sectionBody) {
+      sections[key] = sectionBody;
+    }
+  }
+
+  return sections;
+}
+
 /**
  * Scan pluginsDir for subfolders with index.js. Loads each as an ESM module.
  * @param {string} [pluginsDir]
@@ -47,8 +249,15 @@ export async function loadPlugins(pluginsDir = PLUGINS_DIR) {
     try {
       skill = await readFile(join(pluginDir, 'SKILL.md'), 'utf8');
     } catch { /* SKILL.md is optional */ }
+    const { meta: skillMeta, body: skillBody } = parseSkillFrontmatter(skill);
 
-    plugins.push({ ...plugin, _skill: skill, _dir: pluginDir });
+    plugins.push({
+      ...plugin,
+      _skill: skill,
+      _skillMeta: skillMeta,
+      _skillBody: skillBody,
+      _dir: pluginDir,
+    });
     process.stderr.write(`[bf-plugins] Loaded plugin: ${plugin.name}\n`);
   }
 
@@ -75,11 +284,73 @@ export function buildPluginHelpers(plugins) {
 
 /**
  * Build the SKILL.md appendix to append to the execute tool prompt.
- * Only includes plugins that have non-empty SKILL.md content.
+ * Includes plugins that provide either non-empty SKILL.md content or parsed
+ * frontmatter metadata.
  */
 export function buildPluginSkillAppendix(plugins) {
-  const sections = plugins
-    .filter(p => p._skill && p._skill.trim())
-    .map(p => `\n\n═══ PLUGIN: ${p.name} ═══\n\n${p._skill.trim()}`);
-  return sections.join('');
+  const lines = [];
+  lines.push('\n\n═══ PLUGINS (METADATA-ONLY) ═══');
+  lines.push('Use pluginCatalog() for plugin metadata, then pluginHelp(name, section?) for details on demand.');
+
+  let included = 0;
+  for (const plugin of plugins) {
+    const skillBody = typeof plugin._skillBody === 'string' ? plugin._skillBody : plugin._skill;
+    const hasSkill = typeof skillBody === 'string' && skillBody.trim().length > 0;
+    const meta = plugin._skillMeta && typeof plugin._skillMeta === 'object' ? plugin._skillMeta : {};
+    const hasMeta = Object.keys(meta).length > 0;
+    if (!hasSkill && !hasMeta) continue;
+    included += 1;
+
+    const helperNames = Object.keys(plugin.helpers || {});
+    const description = String(meta.description || '').trim() || 'No description provided';
+    lines.push('');
+    lines.push(`PLUGIN: ${plugin.name}`);
+    lines.push(`description: ${description}`);
+    lines.push(`helpers: ${helperNames.length ? helperNames.join(', ') : '(none)'}`);
+  }
+
+  if (included === 0) {
+    lines.push('No plugin skills currently advertise metadata.');
+  }
+
+  return lines.join('\n');
+}
+
+export function buildPluginSkillRuntime(plugins) {
+  const catalog = [];
+  const byName = {};
+
+  for (const plugin of plugins) {
+    const normalizedName = String(plugin.name).toLowerCase();
+    if (Object.prototype.hasOwnProperty.call(byName, normalizedName)) {
+      process.stderr.write(
+        `[bf-plugins] Duplicate plugin skill name after normalization: "${plugin.name}" conflicts with "${byName[normalizedName].name}" (key "${normalizedName}"). Keeping first.\n`
+      );
+      continue;
+    }
+
+    const helperNames = Object.keys(plugin.helpers || {});
+    const meta = plugin._skillMeta && typeof plugin._skillMeta === 'object' ? plugin._skillMeta : {};
+    const skillBody = (typeof plugin._skillBody === 'string' ? plugin._skillBody : plugin._skill || '').trim();
+    const description = String(meta.description || '').trim() || '';
+    const sections = extractSkillSections(skillBody);
+    const sectionNames = Object.keys(sections);
+
+    catalog.push({
+      name: plugin.name,
+      description: description || 'No description provided',
+      helpers: helperNames,
+      sections: sectionNames,
+    });
+
+    byName[normalizedName] = {
+      name: plugin.name,
+      description,
+      text: skillBody,
+      sections,
+      helpers: helperNames,
+    };
+  }
+
+  return { catalog, byName };
 }
diff --git a/mcp/test/exec-engine-plugins.test.js b/mcp/test/exec-engine-plugins.test.js
index 71c3659..50fe705 100644
--- a/mcp/test/exec-engine-plugins.test.js
+++ b/mcp/test/exec-engine-plugins.test.js
@@ -80,6 +80,60 @@ function createPageMarkdownPage(content = 'Markdown content line', options = {})
   };
 }
 
+function createGoogleSheetsMockPage(cellValues = {}) {
+  let activeRef = 'A1';
+  let editorReadCount = 0;
+
+  const page = {
+    isClosed: () => false,
+    url: () => 'https://docs.google.com/spreadsheets/d/test-sheet-id/edit#gid=1',
+    title: async () => 'Mock Sheet',
+    locator: (selector) => {
+      assert.equal(selector, '#t-name-box');
+      return {
+        click: async () => {},
+        fill: async (value) => {
+          activeRef = String(value || '').toUpperCase();
+        },
+      };
+    },
+    keyboard: {
+      press: async () => {},
+    },
+    waitForTimeout: async () => {},
+    evaluate: async (fn, arg) => {
+      const source = String(fn);
+      if (arg && typeof arg === 'object' && typeof arg.textValue === 'string') {
+        cellValues[activeRef] = arg.textValue;
+        return { after: arg.textValue, lineCount: arg.textValue.split('\n').length };
+      }
+      if (source.includes('createTreeWalker(editor, NodeFilter.SHOW_TEXT)')) {
+        const text = Object.prototype.hasOwnProperty.call(cellValues, activeRef)
+          ? String(cellValues[activeRef])
+          : '';
+        return {
+          text,
+          baseStyle: '',
+          boldRanges: [],
+          lineCount: text.split('\n').length,
+        };
+      }
+      if (source.includes('#waffle-rich-text-editor')) {
+        editorReadCount += 1;
+        return Object.prototype.hasOwnProperty.call(cellValues, activeRef)
+          ? String(cellValues[activeRef])
+          : '';
+      }
+      throw new Error('Unexpected evaluate call in google-sheets mock');
+    },
+  };
+
+  return {
+    page,
+    getEditorReadCount: () => editorReadCount,
+  };
+}
+
 test('plugin helpers are available in execute scope', async () => {
   const pluginHelpers = {
     myHelper: async (page, ctx, state, arg) => `result:${arg}`,
@@ -90,6 +144,33 @@ test('plugin helpers are available in execute scope', async () => {
   assert.equal(result, 'result:hello');
 });
 
+test('pluginCatalog and pluginHelp built-ins are available in execute scope', async () => {
+  const pluginSkillRuntime = {
+    catalog: [{
+      name: 'tagger',
+      description: 'Tags elements quickly',
+      helpers: ['tagger'],
+      sections: ['examples'],
+    }],
+    byName: {
+      tagger: {
+        text: 'Use tagger() to tag.',
+        sections: { examples: '- tagger("hero")' },
+      },
+    },
+  };
+
+  const ctx = buildExecContext(mockPage, mockCtx, {}, {}, {}, {}, {}, pluginSkillRuntime);
+  const catalog = await runCode('return pluginCatalog()', ctx, 5000);
+  assert.deepEqual(catalog, pluginSkillRuntime.catalog);
+
+  const defaultHelp = await runCode('return pluginHelp("tagger")', ctx, 5000);
+  assert.equal(defaultHelp, 'Use tagger() to tag.');
+
+  const sectionHelp = await runCode('return pluginHelp("tagger", "examples")', ctx, 5000);
+  assert.equal(sectionHelp, '- tagger("hero")');
+});
+
 test('built-in helpers always win over plugin helpers with same name', async () => {
   const pluginHelpers = {
     snapshot: async () => 'fake-snapshot-string', // attempt to override
@@ -102,6 +183,34 @@ test('built-in helpers always win over plugin helpers with same name', async ()
   assert.notEqual(result, 'fake-snapshot-string');
 });
 
+test('plugin helpers cannot override pluginCatalog/pluginHelp built-ins', async () => {
+  const pluginHelpers = {
+    pluginCatalog: async () => ['evil'],
+    pluginHelp: async () => 'evil-help',
+  };
+  const pluginSkillRuntime = {
+    catalog: [{ name: 'safe', helpers: [], sections: [] }],
+    byName: { safe: { text: 'safe-help', sections: {} } },
+  };
+
+  const ctx = buildExecContext(
+    mockPage,
+    mockCtx,
+    {},
+    {},
+    pluginHelpers,
+    {},
+    {},
+    pluginSkillRuntime,
+  );
+
+  const catalog = await runCode('return pluginCatalog()', ctx, 5000);
+  assert.deepEqual(catalog, pluginSkillRuntime.catalog);
+
+  const help = await runCode('return pluginHelp("safe")', ctx, 5000);
+  assert.equal(help, 'safe-help');
+});
+
 test('plugin helper receives null page gracefully when no page open', async () => {
   const pluginHelpers = {
     safeHelper: async (page, ctx, state) => page === null ? 'no-page' : 'has-page',
@@ -113,6 +222,213 @@ test('plugin helper receives null page gracefully when no page open', async () =
   assert.equal(result, 'no-page');
 });
 
+test('gsSummarizeSheet reuses cached rows on repeated calls with same options', async () => {
+  const { default: googleSheetsPlugin } = await import('../../plugins/official/google-sheets/index.js');
+  const summarize = googleSheetsPlugin.helpers.gsSummarizeSheet;
+  const { page, getEditorReadCount } = createGoogleSheetsMockPage({
+    A1: 'Level',
+    B1: 'Expectation',
+    A2: 'Junior',
+    B2: 'Owns scoped tasks',
+    A3: '',
+    B3: '',
+  });
+  const state = {};
+  const options = {
+    columns: ['A', 'B'],
+    startRow: 1,
+    maxRows: 6,
+    emptyStreakStop: 1,
+    previewRows: 2,
+  };
+
+  const first = await summarize(page, null, state, options);
+  const readsAfterFirst = getEditorReadCount();
+  assert.equal(first.scan.usedRowCount, 2);
+  assert.ok(readsAfterFirst > 0);
+
+  const second = await summarize(page, null, state, options);
+  const readsAfterSecond = getEditorReadCount();
+  assert.equal(second.scan.usedRowCount, 2);
+  assert.equal(readsAfterSecond, readsAfterFirst);
+});
+
+test('gsSummarizeSheet forceRefresh bypasses cache', async () => {
+  const { default: googleSheetsPlugin } = await import('../../plugins/official/google-sheets/index.js');
+  const summarize = googleSheetsPlugin.helpers.gsSummarizeSheet;
+  const { page, getEditorReadCount } = createGoogleSheetsMockPage({
+    A1: 'Level',
+    B1: 'Expectation',
+    A2: 'Junior',
+    B2: 'Owns scoped tasks',
+    A3: '',
+    B3: '',
+  });
+  const state = {};
+  const options = {
+    columns: ['A', 'B'],
+    startRow: 1,
+    maxRows: 6,
+    emptyStreakStop: 1,
+    previewRows: 2,
+  };
+
+  await summarize(page, null, state, options);
+  const readsAfterFirst = getEditorReadCount();
+  await summarize(page, null, state, { ...options, forceRefresh: true });
+  const readsAfterForceRefresh = getEditorReadCount();
+  assert.ok(readsAfterForceRefresh > readsAfterFirst);
+});
+
+test('gsSummarizeSheet useCache false bypasses cache reads and writes', async () => {
+  const { default: googleSheetsPlugin } = await import('../../plugins/official/google-sheets/index.js');
+  const summarize = googleSheetsPlugin.helpers.gsSummarizeSheet;
+  const { page, getEditorReadCount } = createGoogleSheetsMockPage({
+    A1: 'Level',
+    B1: 'Expectation',
+    A2: 'Junior',
+    B2: 'Owns scoped tasks',
+    A3: '',
+    B3: '',
+  });
+  const state = {};
+  const options = {
+    columns: ['A', 'B'],
+    startRow: 1,
+    maxRows: 6,
+    emptyStreakStop: 1,
+    previewRows: 2,
+    useCache: false,
+  };
+
+  await summarize(page, null, state, options);
+  const readsAfterFirst = getEditorReadCount();
+  await summarize(page, null, state, options);
+  const readsAfterSecond = getEditorReadCount();
+  assert.ok(readsAfterSecond > readsAfterFirst);
+});
+
+test('gsSplitBulletsInRange invalidates gsSummarizeSheet cache after real write', async () => {
+  const { default: googleSheetsPlugin } = await import('../../plugins/official/google-sheets/index.js');
+  const summarize = googleSheetsPlugin.helpers.gsSummarizeSheet;
+  const splitBullets = googleSheetsPlugin.helpers.gsSplitBulletsInRange;
+  const { page, getEditorReadCount } = createGoogleSheetsMockPage({
+    A1: 'Level',
+    B1: 'Expectation',
+    A2: 'Junior',
+    B2: 'Owns scoped tasks',
+    A3: '',
+    B3: '',
+    D2: 'Alpha - Beta',
+  });
+  const state = {};
+  const summarizeOptions = {
+    columns: ['A', 'B'],
+    startRow: 1,
+    maxRows: 6,
+    emptyStreakStop: 1,
+    previewRows: 2,
+  };
+
+  await summarize(page, null, state, summarizeOptions);
+  const readsAfterFirst = getEditorReadCount();
+  await summarize(page, null, state, summarizeOptions);
+  const readsAfterSecond = getEditorReadCount();
+  assert.equal(readsAfterSecond, readsAfterFirst);
+
+  const splitResult = await splitBullets(page, null, state, 'D2:D2', {
+    verify: false,
+    dryRun: false,
+  });
+  assert.equal(splitResult.changed, 1);
+
+  const readsAfterWrite = getEditorReadCount();
+  await summarize(page, null, state, summarizeOptions);
+  const readsAfterThird = getEditorReadCount();
+  assert.ok(readsAfterThird > readsAfterWrite);
+});
+
+test('gsRebalanceBoldInRange invalidates gsSummarizeSheet cache after real write', async () => {
+  const { default: googleSheetsPlugin } = await import('../../plugins/official/google-sheets/index.js');
+  const summarize = googleSheetsPlugin.helpers.gsSummarizeSheet;
+  const rebalanceBold = googleSheetsPlugin.helpers.gsRebalanceBoldInRange;
+  const { page, getEditorReadCount } = createGoogleSheetsMockPage({
+    A1: 'Level',
+    B1: 'Expectation',
+    A2: 'Junior',
+    B2: 'Owns scoped tasks',
+    A3: '',
+    B3: '',
+    D2: 'Alpha Beta',
+  });
+  const state = {};
+  const summarizeOptions = {
+    columns: ['A', 'B'],
+    startRow: 1,
+    maxRows: 6,
+    emptyStreakStop: 1,
+    previewRows: 2,
+  };
+
+  await summarize(page, null, state, summarizeOptions);
+  const readsAfterFirst = getEditorReadCount();
+  await summarize(page, null, state, summarizeOptions);
+  const readsAfterSecond = getEditorReadCount();
+  assert.equal(readsAfterSecond, readsAfterFirst);
+
+  const rebalanceResult = await rebalanceBold(page, null, state, 'D2:D2', {
+    verify: false,
+    dryRun: false,
+    preferredPhrases: ['Alpha'],
+  });
+  assert.equal(rebalanceResult.changed, 1);
+
+  const readsAfterWrite = getEditorReadCount();
+  await summarize(page, null, state, summarizeOptions);
+  const readsAfterThird = getEditorReadCount();
+  assert.ok(readsAfterThird > readsAfterWrite);
+});
+
+test('gsFormatBulletsInRange invalidates gsSummarizeSheet cache after real write', async () => {
+  const { default: googleSheetsPlugin } = await import('../../plugins/official/google-sheets/index.js');
+  const summarize = googleSheetsPlugin.helpers.gsSummarizeSheet;
+  const formatBullets = googleSheetsPlugin.helpers.gsFormatBulletsInRange;
+  const { page, getEditorReadCount } = createGoogleSheetsMockPage({
+    A1: 'Level',
+    B1: 'Expectation',
+    A2: 'Junior',
+    B2: 'Owns scoped tasks',
+    A3: '',
+    B3: '',
+    D2: 'Alpha - Beta',
+  });
+  const state = {};
+  const summarizeOptions = {
+    columns: ['A', 'B'],
+    startRow: 1,
+    maxRows: 6,
+    emptyStreakStop: 1,
+    previewRows: 2,
+  };
+
+  await summarize(page, null, state, summarizeOptions);
+  const readsAfterFirst = getEditorReadCount();
+  await summarize(page, null, state, summarizeOptions);
+  const readsAfterSecond = getEditorReadCount();
+  assert.equal(readsAfterSecond, readsAfterFirst);
+
+  const formatResult = await formatBullets(page, null, state, 'D2:D2', {
+    verify: false,
+    dryRun: false,
+  });
+  assert.equal(formatResult.changed, 1);
+
+  const readsAfterWrite = getEditorReadCount();
+  await summarize(page, null, state, summarizeOptions);
+  const readsAfterThird = getEditorReadCount();
+  assert.ok(readsAfterThird > readsAfterWrite);
+});
+
 test('buildExecContext exposes screenshot and content helpers in execute scope', () => {
   const ctx = buildExecContext(mockPage, mockCtx, {}, {}, {});
   assert.equal(typeof ctx.screenshotWithAccessibilityLabels, 'function');

From 99935a31e5bee8f8f37c68d83c27feb529d1ba88 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 21:59:56 +0530
Subject: [PATCH 166/192] agent-panel: show explicit relay/agent startup
 failure states

- map /chatd-url bootstrap failures to specific startup issue codes (relay unreachable, extension disconnected, agent not running)

- render a visible empty-state error card in the sidebar with actionable guidance and command hints

- keep status-dot text in sync with mapped startup state while preserving existing ready/thinking flow

- add contract tests for startup error mapping and error-state UI styling
---
 extension/agent-panel.css                    |  28 +++++
 extension/agent-panel.js                     | 102 +++++++++++++++++--
 test/agent/agent-panel-contract.test.js      |   9 ++
 test/agent/agent-panel-send-contract.test.js |  17 ++++
 4 files changed, 150 insertions(+), 6 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 4d1e19b..808bca6 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -213,6 +213,12 @@ body {
   padding: 40px 20px;
 }
 
+.empty-state.error-state {
+  align-items: flex-start;
+  text-align: left;
+  gap: 10px;
+}
+
 .empty-icon {
   width: 40px;
   height: 40px;
@@ -226,6 +232,10 @@ body {
   font-size: 18px;
 }
 
+.empty-icon.error {
+  background: var(--error);
+}
+
 .empty-title {
   font-size: 14px;
   font-weight: 500;
@@ -238,6 +248,24 @@ body {
   line-height: 1.5;
 }
 
+.empty-command {
+  margin-top: 8px;
+}
+
+.empty-command code {
+  display: inline-block;
+  background: var(--linen);
+  border: 1px solid var(--line);
+  border-radius: 8px;
+  color: var(--crail-dark);
+  font-family: 'SF Mono', 'Fira Code', 'Cascadia Code', monospace;
+  font-size: 11px;
+  padding: 4px 8px;
+  white-space: pre-wrap;
+  overflow-wrap: anywhere;
+  word-break: break-word;
+}
+
 .message {
   display: flex;
   flex-direction: column;
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 30cb247..066498b 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -25,6 +25,7 @@ const state = {
   eventLoopToken: 0,
   sessionSelectionToken: 0,
   popover: 'none',
+  startupIssue: null,
   status: {
     kind: 'info',
     text: 'Starting...',
@@ -157,6 +158,44 @@ function setStatus(kind, text) {
   syncStatusIndicator();
 }
 
+function normalizeStartupError(code = '', fallbackMessage = 'Unable to connect to BrowserForce Agent') {
+  const normalized = String(code || '').trim().toLowerCase();
+  if (normalized === 'agent_not_running') {
+    return {
+      code: 'agent_not_running',
+      statusText: 'Agent not running',
+      title: 'BrowserForce Agent is not running',
+      detail: 'Relay is reachable, but the local agent daemon (chatd) is offline.',
+      command: 'browserforce agent start',
+    };
+  }
+  if (normalized === 'extension_not_connected') {
+    return {
+      code: 'extension_not_connected',
+      statusText: 'Extension not connected',
+      title: 'Extension is not connected to relay',
+      detail: 'Open the BrowserForce extension popup and reconnect it to the relay.',
+      command: null,
+    };
+  }
+  if (normalized === 'relay_unreachable') {
+    return {
+      code: 'relay_unreachable',
+      statusText: 'Relay unreachable',
+      title: 'Relay is not reachable',
+      detail: 'Start relay first, then retry opening this side panel.',
+      command: 'browserforce serve',
+    };
+  }
+  return {
+    code: 'unknown',
+    statusText: 'Connection failed',
+    title: 'Unable to connect to BrowserForce Agent',
+    detail: fallbackMessage || 'Check relay and agent daemon status, then try again.',
+    command: null,
+  };
+}
+
 function setComposerEnabled(enabled) {
   chatInputEl.disabled = !enabled;
   autoResizeInput();
@@ -579,6 +618,31 @@ function renderTranscript({ preserveScrollTop = null } = {}) {
   }
 
   if (!chunks.length) {
+    const startupIssue = state.startupIssue;
+    if (startupIssue) {
+      const commandHtml = startupIssue.command
+        ? `<p class="empty-command"><code>${escapeHtml(startupIssue.command)}</code></p>`
+        : '';
+      transcriptEl.innerHTML = `
+        <div class="empty-state error-state">
+          <div class="empty-icon error">!</div>
+          <div>
+            <p class="empty-title">${escapeHtml(startupIssue.title || 'Unable to connect')}</p>
+            <p class="empty-sub">${escapeHtml(startupIssue.detail || '')}</p>
+            ${commandHtml}
+          </div>
+        </div>
+      `;
+      bindTranscriptHandlers();
+      if (Number.isFinite(preserveScrollTop)) {
+        transcriptEl.scrollTop = preserveScrollTop;
+      } else {
+        transcriptEl.scrollTop = transcriptEl.scrollHeight;
+      }
+      syncStatusIndicator();
+      syncComposerState();
+      return;
+    }
     transcriptEl.innerHTML = `
       <div class="empty-state">
         <div class="empty-icon">B</div>
@@ -814,10 +878,33 @@ async function getRelayHttpUrl() {
 async function loadAuth() {
   const relayHttpUrl = await getRelayHttpUrl();
   const extensionId = chrome?.runtime?.id;
-  const res = await fetch(`${relayHttpUrl}/chatd-url`, {
-    headers: extensionId ? { 'x-browserforce-extension-id': extensionId } : {},
-  });
-  if (!res.ok) throw new Error('daemon_unavailable');
+  let res;
+  try {
+    res = await fetch(`${relayHttpUrl}/chatd-url`, {
+      headers: extensionId ? { 'x-browserforce-extension-id': extensionId } : {},
+    });
+  } catch {
+    const error = new Error('relay_unreachable');
+    error.code = 'relay_unreachable';
+    throw error;
+  }
+  if (!res.ok) {
+    const body = await readJsonOrEmpty(res);
+    const relayError = String(body?.error || '').toLowerCase();
+    if (res.status === 404 && relayError.includes('chatd not running')) {
+      const error = new Error('agent_not_running');
+      error.code = 'agent_not_running';
+      throw error;
+    }
+    if (res.status === 503 && relayError.includes('extension not connected')) {
+      const error = new Error('extension_not_connected');
+      error.code = 'extension_not_connected';
+      throw error;
+    }
+    const error = new Error(body?.error || `chatd-url failed (${res.status})`);
+    error.code = 'daemon_unavailable';
+    throw error;
+  }
   const body = await res.json();
   state.auth = {
     baseUrl: `http://127.0.0.1:${body.port}`,
@@ -1188,6 +1275,7 @@ popoverBackdropEl.addEventListener('click', () => {
 
 (async function init() {
   try {
+    state.startupIssue = null;
     setComposerEnabled(false);
     setStatus('info', 'Connecting...');
     render();
@@ -1209,9 +1297,11 @@ popoverBackdropEl.addEventListener('click', () => {
     scheduleTabAttachRefresh(0);
     setStatus('ready', 'Ready');
     render();
-  } catch {
+  } catch (error) {
+    state.startupIssue = normalizeStartupError(error?.code, error?.message);
     setComposerEnabled(false);
     setTabAttachBannerState({ hidden: true });
-    setStatus('error', 'Daemon unavailable');
+    setStatus('error', state.startupIssue.statusText || 'Daemon unavailable');
+    render();
   }
 })();
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index 45a65f2..4dd1518 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -75,3 +75,12 @@ test('reasoning title rows use shimmer and enter transition treatment', () => {
   assert.match(css, /@keyframes reasoning-title-in/);
   assert.match(css, /@media\s*\(prefers-reduced-motion:\s*reduce\)/);
 });
+
+test('agent panel includes visible startup error empty-state treatment', () => {
+  assert.match(panelJs, /state\.startupIssue = null/);
+  assert.match(panelJs, /class="empty-state error-state"/);
+  assert.match(panelJs, /empty-command/);
+  assert.match(css, /\.empty-state\.error-state/);
+  assert.match(css, /\.empty-icon\.error/);
+  assert.match(css, /\.empty-command code/);
+});
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 2c062db..3ad524f 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -130,3 +130,20 @@ test('stale run pointer is reconciled from loaded messages so stop does not stay
   assert.match(js, /state\.currentRunBySession = clearSessionRunId\(state\.currentRunBySession, sessionId, runId\)/);
   assert.match(js, /async function loadMessages\(sessionId\)[\s\S]*reconcileSessionRunState\(sessionId\)/);
 });
+
+test('init maps relay/chatd boot failures into explicit startup issues', () => {
+  assert.match(js, /function normalizeStartupError\(code = '', fallbackMessage = 'Unable to connect to BrowserForce Agent'\)/);
+  assert.match(js, /agent_not_running/);
+  assert.match(js, /extension_not_connected/);
+  assert.match(js, /relay_unreachable/);
+  assert.match(js, /browserforce agent start/);
+  assert.match(js, /browserforce serve/);
+  assert.match(js, /state\.startupIssue = normalizeStartupError\(error\?\.code, error\?\.message\)/);
+});
+
+test('chatd-url auth bootstrap reports specific failure codes before generic daemon unavailable', () => {
+  assert.match(js, /async function loadAuth\(\)/);
+  assert.match(js, /if \(res\.status === 404 && relayError\.includes\('chatd not running'\)\)/);
+  assert.match(js, /if \(res\.status === 503 && relayError\.includes\('extension not connected'\)\)/);
+  assert.match(js, /error\.code = 'daemon_unavailable'/);
+});

From 5a3aea461a988e243e5a95609ee6d3f50d2406c1 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 22:18:05 +0530
Subject: [PATCH 167/192] agent-panel: add collapsed execute helper tree
 preview

- infer likely plugin/helper function calls from BrowserForce:execute detail lines

- render a tree-like branch preview under collapsed execute rows while keeping expanded raw details unchanged

- add lightweight branch styling and status-aware colors for running/done execute states

- extend panel contract tests for helper preview hooks and tree CSS
---
 extension/agent-panel.css                    | 49 +++++++++-
 extension/agent-panel.js                     | 98 +++++++++++++++++++-
 test/agent/agent-panel-contract.test.js      |  7 ++
 test/agent/agent-panel-send-contract.test.js |  9 ++
 4 files changed, 160 insertions(+), 3 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 808bca6..d21c972 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -377,11 +377,15 @@ body {
   padding: 0;
   color: inherit;
   cursor: pointer;
+  display: block;
+  text-align: left;
+}
+
+.step-toggle-main {
   display: flex;
   align-items: flex-start;
   justify-content: space-between;
   gap: 10px;
-  text-align: left;
 }
 
 .step-item.collapsible .step-label {
@@ -390,6 +394,49 @@ body {
   text-overflow: ellipsis;
 }
 
+.step-branch-preview {
+  list-style: none;
+  margin: 6px 0 0;
+  padding: 0 0 0 12px;
+  border-left: 1px solid var(--line);
+  display: flex;
+  flex-direction: column;
+  gap: 4px;
+}
+
+.step-branch-node {
+  position: relative;
+  padding-left: 10px;
+}
+
+.step-branch-node::before {
+  content: '';
+  position: absolute;
+  left: 0;
+  top: 7px;
+  width: 8px;
+  border-top: 1px solid var(--line);
+}
+
+.step-branch-call {
+  display: block;
+  font-size: 11px;
+  line-height: 1.35;
+  color: var(--text-muted);
+  font-family: 'SF Mono', 'Fira Code', 'Cascadia Code', monospace;
+  white-space: nowrap;
+  overflow: hidden;
+  text-overflow: ellipsis;
+}
+
+.step-branch-preview.done .step-branch-call {
+  color: var(--ok);
+}
+
+.step-item.latest .step-branch-preview.running .step-branch-call {
+  color: var(--crail-dark);
+}
+
 .step-caret::before {
   content: '›';
   display: inline-block;
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 066498b..2600482 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -459,6 +459,96 @@ function normalizeRunTimeline(run, fallbackText = '') {
   return timeline;
 }
 
+const EXECUTE_HELPER_EXCLUDE_CALLS = new Set([
+  'if',
+  'for',
+  'while',
+  'switch',
+  'catch',
+  'snapshot',
+  'reftolocator',
+  'waitforpageload',
+  'getlogs',
+  'clearlogs',
+  'screenshotwithaccessibilitylabels',
+  'cleanhtml',
+  'pagemarkdown',
+  'getcdpsession',
+  'plugincatalog',
+  'pluginhelp',
+  'fetch',
+  'settimeout',
+  'cleartimeout',
+  'promise',
+  'array',
+  'object',
+  'number',
+  'string',
+  'boolean',
+  'date',
+  'math',
+  'json',
+  'parseint',
+  'parsefloat',
+  'isnan',
+  'isfinite',
+  'encodeuri',
+  'decodeuri',
+]);
+
+function isBrowserForceExecuteStep(entry) {
+  const label = String(entry?.label || '').trim().toLowerCase();
+  return (
+    label === 'browserforce:execute'
+    || label === 'browserforce execute'
+    || label === 'mcp__browserforce__execute'
+    || label === 'execute'
+  );
+}
+
+function extractExecuteHelperCalls(details) {
+  if (!Array.isArray(details) || details.length === 0) return [];
+  const helperCalls = [];
+  const seen = new Set();
+  const callPattern = /(^|[^.\w$])([A-Za-z_$][\w$]{2,})\s*\(/g;
+
+  for (const line of details) {
+    const text = String(line || '');
+    if (!text) continue;
+    callPattern.lastIndex = 0;
+    for (const match of text.matchAll(callPattern)) {
+      const callName = String(match[2] || '').trim();
+      if (!callName) continue;
+      const normalized = callName.toLowerCase();
+      if (EXECUTE_HELPER_EXCLUDE_CALLS.has(normalized)) continue;
+      if (seen.has(normalized)) continue;
+      seen.add(normalized);
+      helperCalls.push(callName);
+      if (helperCalls.length >= 3) return helperCalls;
+    }
+  }
+
+  return helperCalls;
+}
+
+function renderExecuteHelperTreePreview(entry, expanded) {
+  if (expanded) return '';
+  if (!isBrowserForceExecuteStep(entry)) return '';
+  const details = Array.isArray(entry?.details) ? entry.details : [];
+  const helperCalls = extractExecuteHelperCalls(details);
+  if (!helperCalls.length) return '';
+  const status = String(entry?.status || '').toLowerCase() === 'done' ? 'done' : 'running';
+  return `
+    <ul class="step-branch-preview ${status}">
+      ${helperCalls.map((callName) => `
+        <li class="step-branch-node">
+          <span class="step-branch-call">${escapeHtml(callName)}()</span>
+        </li>
+      `).join('')}
+    </ul>
+  `;
+}
+
 function getLatestInFlightTimelineStepIndex(run, timeline) {
   if (!run || run.done) return -1;
   for (let index = timeline.length - 1; index >= 0; index -= 1) {
@@ -531,6 +621,7 @@ function renderRunTimeline(run, fallbackText = '') {
     const key = getTimelineEntryKey(entry, index);
     const expanded = !!state.expandedTimelineEntries[key];
     if (expanded) classes.push('expanded');
+    const helperTreePreviewHtml = renderExecuteHelperTreePreview(entry, expanded);
     const detailsHtml = details
       .map((line) => `<li>${renderInlineContent(line)}</li>`)
       .join('');
@@ -539,8 +630,11 @@ function renderRunTimeline(run, fallbackText = '') {
         <span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span>
         <div class="step-body">
           <button type="button" class="step-toggle" data-step-key="${escapeHtml(key)}" aria-expanded="${expanded ? 'true' : 'false'}">
-            <span class="${labelClasses.join(' ')}">${renderInlineContent(entry.label || 'Step')}</span>
-            <span class="step-caret" aria-hidden="true"></span>
+            <span class="step-toggle-main">
+              <span class="${labelClasses.join(' ')}">${renderInlineContent(entry.label || 'Step')}</span>
+              <span class="step-caret" aria-hidden="true"></span>
+            </span>
+            ${helperTreePreviewHtml}
           </button>
           ${expanded ? `<ul class="step-details">${detailsHtml}</ul>` : ''}
         </div>
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index 4dd1518..fbb9bca 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -84,3 +84,10 @@ test('agent panel includes visible startup error empty-state treatment', () => {
   assert.match(css, /\.empty-icon\.error/);
   assert.match(css, /\.empty-command code/);
 });
+
+test('collapsed execute helper preview has tree-like branch styling', () => {
+  assert.match(css, /\.step-branch-preview/);
+  assert.match(css, /\.step-branch-node/);
+  assert.match(css, /\.step-branch-node::before/);
+  assert.match(css, /\.step-branch-call/);
+});
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 3ad524f..90feeeb 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -147,3 +147,12 @@ test('chatd-url auth bootstrap reports specific failure codes before generic dae
   assert.match(js, /if \(res\.status === 503 && relayError\.includes\('extension not connected'\)\)/);
   assert.match(js, /error\.code = 'daemon_unavailable'/);
 });
+
+test('collapsed BrowserForce execute rows infer helper calls and render branch preview', () => {
+  assert.match(js, /function extractExecuteHelperCalls\(/);
+  assert.match(js, /function renderExecuteHelperTreePreview\(/);
+  assert.match(js, /isBrowserForceExecuteStep/);
+  assert.match(js, /step-branch-preview/);
+  assert.match(js, /class="step-branch-node"/);
+  assert.match(js, /class="step-branch-call"/);
+});

From c637698d284523a5bfc5b414b3dee668e65d9a69 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 22:41:17 +0530
Subject: [PATCH 168/192] agent-panel: prioritize top attach-progress state
 during auto-connect

- move initial auto-attach progress from bottom context note into top tab-attach banner

- suppress "Current tab is not connected" prompt while initial attach is still in flight

- keep context usage note focused on usage-only text and update contract tests
---
 extension/agent-panel.js                     | 20 +++++++++++++++++---
 test/agent/agent-panel-send-contract.test.js | 13 ++++++++-----
 2 files changed, 25 insertions(+), 8 deletions(-)

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 2600482..67a7722 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -140,9 +140,7 @@ function renderContextUsageChip() {
   const sessionId = state.value.activeSessionId;
   const usage = sessionId ? state.value.latestUsageBySession?.[sessionId] : null;
   const formatted = formatContextUsage(usage || {});
-  const note = state.initialTabAttachInFlight
-    ? 'Attaching active tab...'
-    : (formatted ? `Context: ${formatted}` : '');
+  const note = formatted ? `Context: ${formatted}` : '';
   contextUsageEl.classList.toggle('hidden', !note);
   if (!note) {
     contextUsageEl.textContent = '';
@@ -217,6 +215,16 @@ function setTabAttachBannerState({
   attachCurrentTabBtn.textContent = busy ? 'Attaching...' : 'Attach current tab';
 }
 
+function getTabAttachInProgressState() {
+  if (!state.initialTabAttachInFlight) return null;
+  return {
+    hidden: false,
+    text: 'Currently attaching active tab...',
+    canAttach: false,
+    busy: true,
+  };
+}
+
 function dispatch(action) {
   state.value = reduceState(state.value, action);
   render();
@@ -895,6 +903,11 @@ async function getCurrentTabAttachmentState() {
 
 async function refreshTabAttachBanner() {
   const token = ++tabAttachRefreshToken;
+  const inProgressState = getTabAttachInProgressState();
+  if (inProgressState) {
+    setTabAttachBannerState(inProgressState);
+    return;
+  }
   const next = await getCurrentTabAttachmentState();
   if (token !== tabAttachRefreshToken) return;
   setTabAttachBannerState(next);
@@ -931,6 +944,7 @@ function startInitialTabAttach() {
   if (state.initialTabAttachStarted) return;
   state.initialTabAttachStarted = true;
   state.initialTabAttachInFlight = true;
+  setTabAttachBannerState(getTabAttachInProgressState() || undefined);
   renderContextUsageChip();
   window.setTimeout(() => {
     ensureCurrentTabAttached()
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 90feeeb..ef01071 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -73,7 +73,7 @@ test('assistant transcript prefers ordered run timeline over grouped run steps',
 test('context usage renderer hides element when unavailable and only shows formatted values', () => {
   assert.match(js, /function renderContextUsageChip\(\)/);
   assert.match(js, /latestUsageBySession/);
-  assert.match(js, /const note = state\.initialTabAttachInFlight[\s\S]*formatted[\s\S]*Context: \$\{formatted\}/);
+  assert.match(js, /const note = formatted \? `Context: \$\{formatted\}` : '';/);
   assert.match(js, /contextUsageEl\.classList\.toggle\('hidden', !note\)/);
   assert.match(js, /contextUsageEl\.textContent = note/);
   assert.doesNotMatch(js, /Context:\s*unavailable/);
@@ -85,10 +85,13 @@ test('init opens smoothly by starting tab attach asynchronously', () => {
   assert.doesNotMatch(js, /\(async function init\(\)[\s\S]*await ensureCurrentTabAttached\(\);/);
 });
 
-test('bottom note can show async attach status and still hides when no note is available', () => {
-  assert.match(js, /initialTabAttachInFlight:\s*false/);
-  assert.match(js, /state\.initialTabAttachInFlight\s*\?\s*'Attaching active tab\.\.\.'/);
-  assert.match(js, /contextUsageEl\.classList\.toggle\('hidden', !note\)/);
+test('tab-attach banner shows progress during initial auto-attach and suppresses not-connected state', () => {
+  assert.match(js, /function getTabAttachInProgressState\(\)/);
+  assert.match(js, /text:\s*'Currently attaching active tab\.\.\.'/);
+  assert.match(js, /busy:\s*true/);
+  assert.match(js, /async function refreshTabAttachBanner\(\)[\s\S]*getTabAttachInProgressState\(\)/);
+  assert.match(js, /setTabAttachBannerState\(inProgressState\);/);
+  assert.match(js, /function startInitialTabAttach\(\)[\s\S]*setTabAttachBannerState\(getTabAttachInProgressState\(\) \|\| undefined\);/);
 });
 
 test('initial tab attach waits 2 seconds before attaching', () => {

From 7671b3a8e0b102e98a217e67000dbaf8440cb5a5 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 22:51:39 +0530
Subject: [PATCH 169/192] docs(plugins): document internal/private plugin
 workflow

- add internal plugin workflow guidance for ~/.browserforce/plugins/<name>

- clarify no .gitignore change is needed for home-directory plugins

- recommend private Git repos for versioning internal plugins

- document local-dev -> plugins/community promotion path after validation
---
 docs/BUILDING_PLUGINS.md | 117 +++++++++++++++++++++++++++++----------
 docs/PLUGINS.md          |  43 ++++++++++----
 2 files changed, 121 insertions(+), 39 deletions(-)

diff --git a/docs/BUILDING_PLUGINS.md b/docs/BUILDING_PLUGINS.md
index 34e6928..5a0fa11 100644
--- a/docs/BUILDING_PLUGINS.md
+++ b/docs/BUILDING_PLUGINS.md
@@ -1,11 +1,42 @@
 # Building BrowserForce Plugins
 
-Adding a plugin extends BrowserForce for yourself or the whole community. Personal plugins stay in `~/.browserforce/plugins/` and are never shared unless you choose to. Public plugins get reviewed and merged into the repo, appearing in the plugin directory for anyone to install.
+Adding a plugin extends BrowserForce for yourself or the whole community. Personal plugins stay in `~/.browserforce/plugins/<name>/` and are never shared unless you choose to. Public plugins get reviewed and merged into the repo, appearing in the plugin directory for anyone to install.
+
+Repo plugin layout is:
+- Official: `plugins/official/<name>/SKILL.md`
+- Community: `plugins/community/<name>/SKILL.md`
+
+No migration to `plugin/skills/<name>/` is required.
 
 This guide walks through everything: building, testing, and submitting a plugin.
 
 ---
 
+## Internal Plugins (Private Workflow)
+
+If your plugin is for internal QA or company-only automation, keep it local:
+
+- Create it at `~/.browserforce/plugins/<plugin-name>/`.
+- Keep plugin folders flat under `~/.browserforce/plugins/` (the loader scans one level deep).
+- Do not place internal plugins in this repo unless you intend to publish them.
+
+Because `~/.browserforce/plugins` is outside this repository, you do not need to change this repo's `.gitignore` for internal plugins.
+
+If you want version history, track internal plugins in a private Git repository:
+
+- Option 1: Run `git init` directly inside `~/.browserforce/plugins/<plugin-name>/` and push to a private remote.
+- Option 2: Keep plugins in a separate private repo directory and copy/sync them into `~/.browserforce/plugins/<plugin-name>/` when developing.
+
+Recommended development flow:
+
+1. Build and iterate in `~/.browserforce/plugins/<plugin-name>/` for fast local testing.
+2. Validate behavior end to end until checks pass.
+3. Promote the plugin to the BrowserForce repo at `plugins/community/<plugin-name>/` only when it is ready to publish.
+
+This keeps internal plugins private while still making them reproducible for your team.
+
+---
+
 ## 1. Build Your First Plugin
 
 ### Step 1 — Create the folder
@@ -18,7 +49,7 @@ touch ~/.browserforce/plugins/highlight/SKILL.md
 
 ### Step 2 — Write the export
 
-Start with just `name` and one helper. Here is a complete `highlight.js` plugin that visually highlights any element on the page:
+Start with just `name` and one helper. Here is a complete `highlight` plugin that visually highlights any element on the page:
 
 ```js
 // ~/.browserforce/plugins/highlight/index.js
@@ -108,11 +139,11 @@ await highlight(page, '.price', '#f0f', 0);       // permanent magenta on price
 
 ### Step 5 — Write a SKILL.md companion
 
-See [Section 4](#4-the-skillmd-companion) for what to include.
+See [Section 3](#3-the-skillmd-companion) for what to include.
 
 ### Step 6 — Submit as a PR (optional)
 
-See [Section 8](#8-submitting-a-plugin-pr-checklist) for the full checklist.
+See [Section 7](#7-submitting-a-plugin-pr-checklist) for the full checklist.
 
 ---
 
@@ -210,39 +241,69 @@ export default {
 
 ## 3. The SKILL.md Companion
 
-Every plugin should ship a `SKILL.md` alongside the `.js` file. This file is read by the AI agent at startup. It tells the agent when to use the plugin, when not to, and how to call it correctly. Without it, the agent has no context for the plugin's capabilities.
+Every plugin should ship a `SKILL.md` alongside `index.js`. BrowserForce now uses a metadata-first prompt model:
+
+- Default prompt includes plugin metadata only.
+- Agents call `pluginCatalog()` to discover plugins and helpers.
+- Agents call `pluginHelp(name, section?)` when full or sectioned detail is needed.
 
-**Required sections:**
+### Metadata source of truth
+
+For plugin prompt metadata, `SKILL.md` frontmatter is the source of truth. Do not treat `index.js` fields as metadata authority for prompt docs.
+
+### Frontmatter contract
+
+Frontmatter is expected at the top of `SKILL.md`:
 
 ```markdown
-# highlight plugin
+---
+name: highlight
+description: Visual outlining helpers for matching elements.
+when_to_use: ["Debugging selectors", "Previewing click targets"]
+helpers: ["highlight", "clearHighlights"]
+tools: []
+---
+```
 
-Use `highlight(page, selector, color, duration)` / `clearHighlights(page)` when you need to:
-- Visually mark an element for debugging or demonstration
-- Show a user which element the agent is about to interact with
-- Annotate a screenshot for reporting
+Supported canonical keys:
 
-## When NOT to use this
-- Don't highlight before taking a screenshot if you need the original unmodified view
-- Don't leave permanent highlights (duration: 0) unless intentional — they persist across agent turns
+| Key | Status | Type | Notes |
+| --- | --- | --- | --- |
+| `name` | Required | string | Plugin metadata name shown in catalog. |
+| `description` | Required | string | Short summary shown in metadata-only prompt and `pluginCatalog()`. |
+| `helpers` | Optional | JSON array string | Helper names this plugin exposes. |
+| `tools` | Optional | JSON array string | MCP tool names this plugin exposes. |
+| `when_to_use` | Optional | JSON array string or block scalar | Guidance for agent selection behavior. |
 
-## Parameters
-- `selector` — any valid CSS selector
-- `color` — any CSS color value: `'#f90'`, `'red'`, `'rgba(255,0,0,0.3)'`
-- `duration` — milliseconds to hold the highlight; `0` = permanent until `clearHighlights()`
+Notes:
+- Unknown frontmatter keys are ignored.
+- Keep arrays as JSON (`["item-a", "item-b"]`) for reliable parsing.
+- Block scalars (`|` or `>`) are supported for multiline text fields.
 
-## Example
-\`\`\`js
-// Highlight the submit button in orange for 3 seconds
-const { found } = await highlight(page, 'button[type="submit"]', '#f90', 3000);
-if (!found) return 'Submit button not found on this page';
-\`\`\`
+### SKILL body guidance (on-demand help)
+
+The markdown body after frontmatter powers `pluginHelp(...)`. Keep it structured with `##` sections so section lookup is useful.
 
-## Common mistakes
-- Calling `highlight` on a selector that matches zero elements — always check `result.found`
-- Forgetting to `clearHighlights()` before capturing a clean screenshot
+Recommended sections:
+
+```markdown
+## when to use
+## when not to use
+## parameters
+## examples
+## common mistakes
 ```
 
+### Legacy SKILL migration (no frontmatter)
+
+Legacy `SKILL.md` files without frontmatter still load, but they only provide on-demand body help and no structured metadata.
+
+Migration steps:
+1. Add a top `--- ... ---` frontmatter block with `name` and `description`.
+2. Add optional canonical keys (`helpers`, `tools`, `when_to_use`) as needed.
+3. Keep existing markdown body content below frontmatter.
+4. Restart MCP after install/update to refresh loaded metadata and help text.
+
 ---
 
 ## 4. Rules — What's Not Allowed
@@ -433,7 +494,7 @@ const { found } = await highlight(page, 'h1', '#f90');
 ## Full Plugin Shape Reference
 
 ```js
-// ~/.browserforce/plugins/my-plugin.js
+// ~/.browserforce/plugins/my-plugin/index.js
 
 export default {
   // Required. Unique across all plugins.
diff --git a/docs/PLUGINS.md b/docs/PLUGINS.md
index 0a0c362..bd3f754 100644
--- a/docs/PLUGINS.md
+++ b/docs/PLUGINS.md
@@ -2,12 +2,12 @@
 
 Extend BrowserForce with local JS files — no framework, no build step, no registry.
 
-Plugins live in `~/.browserforce/plugins/`. Each file exports a plain object. The MCP server loads them at startup and merges their helpers, tools, and hooks into the runtime.
+Plugins live in `~/.browserforce/plugins/<name>/`. Each plugin folder exports a plain object from `index.js`. The MCP server loads plugins at startup and merges their helpers, tools, and hooks into the runtime.
 
 **Minimal plugin — 10 lines:**
 
 ```js
-// ~/.browserforce/plugins/hello.js
+// ~/.browserforce/plugins/hello/index.js
 export default {
   name: 'hello',
   helpers: {
@@ -25,12 +25,30 @@ After installing, `greet(page)` is available as a global inside every `execute()
 
 ## How to Install a Plugin
 
-1. Drop a `.js` file in `~/.browserforce/plugins/`
-2. Restart the MCP server
+1. Drop a plugin folder at `~/.browserforce/plugins/<name>/` with at least `index.js`
+2. Restart the MCP server after every plugin install or update
 3. Done — helpers are injected, tools are registered
 
 No config changes. No manifest edits. The directory is auto-scanned on startup.
 
+### Internal and private plugins
+
+For company-internal plugins, use local folders under `~/.browserforce/plugins/<name>/`.
+
+- No `.gitignore` update is needed in this repo (that directory is outside repo git tracking).
+- Keep plugin folders one level deep (for example `~/.browserforce/plugins/ufe-qa/`).
+- If you need collaboration/versioning, track the plugin in a private Git repo and push there instead of the public BrowserForce repo.
+- Recommended flow: develop and test locally in `~/.browserforce/plugins/<name>/`, then move to `plugins/community/<name>/` in BrowserForce when all checks pass and you want to publish.
+
+### Prompt behavior (metadata-first)
+
+`SKILL.md` is no longer fully appended to the default `execute()` prompt. BrowserForce provides metadata first, then on-demand help:
+
+- `pluginCatalog()` returns installed plugin metadata (`name`, `description`, `helpers`, `sections`)
+- `pluginHelp(name, section?)` returns full `SKILL.md` text or just one section when requested
+
+Use `pluginCatalog()` before calling `pluginHelp(...)`; do not fetch every plugin's full help by default.
+
 ---
 
 ## For Developers
@@ -335,7 +353,7 @@ A single JSON file at `plugins/registry.json` in the repo is the source of truth
 | `audience`     | `"developer"`, `"headless"`, or both                                |
 | `capabilities` | Which plugin surfaces it uses: `helpers`, `tools`, `hooks`, `setup` |
 | `file`         | Path to `index.js` in the repo — fetched on install                 |
-| `skill`        | Path to `SKILL.md` — fetched on install, injected into AI context   |
+| `skill`        | Path to `SKILL.md` — fetched on install; metadata is exposed by default, full text via `pluginHelp(...)` |
 
 
 ---
@@ -354,7 +372,7 @@ Chrome extensions have no filesystem access. The relay runs at `127.0.0.1:19222`
 
 ```
 Extension UI
-    │  POST /plugins/install { name: "network" }
+    │  POST /v1/plugins/install { name: "network" }
     ▼
 Relay (127.0.0.1:19222)
     │  fetches index.js + SKILL.md from GitHub
@@ -368,9 +386,11 @@ Relay (127.0.0.1:19222)
 
 | Method   | Path               | Action                                       |
 | -------- | ------------------ | -------------------------------------------- |
-| `GET`    | `/plugins`         | List installed plugins + their metadata      |
-| `POST`   | `/plugins/install` | Download plugin from registry, write to disk |
-| `DELETE` | `/plugins/:name`   | Remove plugin file from disk                 |
+| `GET`    | `/v1/plugins`         | List installed plugins + their metadata      |
+| `POST`   | `/v1/plugins/install` | Download plugin from registry, write to disk |
+| `DELETE` | `/v1/plugins/:name`   | Remove plugin file from disk                 |
+
+Legacy non-versioned paths (`/plugins*`) remain accepted for backward compatibility.
 
 
 Plugins take effect on next MCP server restart (the extension shows a restart prompt).
@@ -398,7 +418,7 @@ browserforce plugin remove network
 browserforce plugin status
 ```
 
-`plugin install` fetches the JS directly from GitHub's raw content URL and writes it to `~/.browserforce/plugins/`. Same outcome as the extension UI, different path.
+`plugin install` fetches `index.js` (and `SKILL.md` when available) from GitHub and writes them to `~/.browserforce/plugins/<name>/`. Same outcome as the extension UI, different path.
 
 ---
 
@@ -427,6 +447,7 @@ plugins/
 ```
 
 Official plugins are maintained by the BrowserForce team. Community plugins are reviewed for safety (no `eval`, no network calls to external servers, no credential exfiltration) before merge.
+This layout is current and supported: `plugins/official/<name>/SKILL.md` and `plugins/community/<name>/SKILL.md`. No migration to `plugin/skills/<name>/` is required.
 
 ---
 
@@ -436,7 +457,7 @@ Plugins are arbitrary JS running in Node.js — they have full filesystem and ne
 
 - **Official plugins**: reviewed and maintained by BrowserForce
 - **Community plugins**: reviewed before merge (same bar as official)
-- **Local plugins**: `~/.browserforce/plugins/*.js` — user's own files, not from the registry, fully trusted
+- **Local plugins**: `~/.browserforce/plugins/<name>/` — user's own plugin folders, not from the registry, fully trusted
 
 The relay install endpoint only fetches from the known GitHub repo URL — no arbitrary URLs. The extension UI only shows registry plugins. Users who want to run untrusted code drop files manually into the plugins folder.
 

From 20bd89077446bc8c35c10e7ab7722ee8755cb203 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 22:37:05 +0530
Subject: [PATCH 170/192] agent: add per-session reasoning effort support

- persist reasoningEffort in session metadata with strict low/medium/high/xhigh validation

- resolve run effort from session value, then config/environment defaults, with medium fallback

- pass effort to codex exec via -c model_reasoning_effort=... in runner args

- expose configured defaultReasoningEffort in /v1/models for panel hydration

- extend agent tests for session-store, codex-runner, and chatd API coverage
---
 agent/src/chatd.js               | 50 +++++++++++++++++--
 agent/src/codex-runner.js        | 18 ++++++-
 agent/src/session-store.js       | 21 +++++++-
 test/agent/chatd-api.test.js     | 82 +++++++++++++++++++++++++++++++-
 test/agent/codex-runner.test.js  |  5 ++
 test/agent/session-store.test.js | 14 +++++-
 6 files changed, 181 insertions(+), 9 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 2cfdea4..a67b965 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -14,6 +14,7 @@ import {
   createSession,
   getSession,
   isValidModelId,
+  isValidReasoningEffort,
   isValidSessionId,
   listSessions,
   readMessages,
@@ -24,6 +25,7 @@ const BF_DIR = join(homedir(), '.browserforce');
 const CHATD_URL_PATH = join(BF_DIR, 'chatd-url.json');
 const CODEX_CONFIG_PATH = join(homedir(), '.codex', 'config.toml');
 const MODEL_LIST_TIMEOUT_MS = 5000;
+const DEFAULT_REASONING_EFFORT = 'medium';
 
 function parseTopLevelTomlString(raw, key) {
   const lines = String(raw || '').split(/\r?\n/);
@@ -55,6 +57,20 @@ async function resolveConfiguredModel() {
   return null;
 }
 
+async function resolveConfiguredReasoningEffort() {
+  const envEffort = String(process.env.BF_CHATD_DEFAULT_REASONING_EFFORT || '').trim().toLowerCase();
+  if (envEffort && isValidReasoningEffort(envEffort)) return envEffort;
+
+  try {
+    const raw = await fs.readFile(CODEX_CONFIG_PATH, 'utf8');
+    const effort = String(parseTopLevelTomlString(raw, 'model_reasoning_effort') || '').trim().toLowerCase();
+    if (effort && isValidReasoningEffort(effort)) return effort;
+  } catch {
+    // no local codex config is fine
+  }
+  return DEFAULT_REASONING_EFFORT;
+}
+
 function dedupeModelRows(rows) {
   const seen = new Set();
   const out = [{ value: null, label: 'Default' }];
@@ -214,6 +230,16 @@ async function listModelPresets({ storageRoot, modelFetcher } = {}) {
   return dedupeModelRows([...liveRows, ...configuredRow, ...sessionRows]);
 }
 
+function resolveEffectiveReasoningEffort(sessionReasoningEffort, fallbackReasoningEffort = DEFAULT_REASONING_EFFORT) {
+  const sessionValue = String(sessionReasoningEffort || '').trim().toLowerCase();
+  if (sessionValue && isValidReasoningEffort(sessionValue)) return sessionValue;
+
+  const fallbackValue = String(fallbackReasoningEffort || '').trim().toLowerCase();
+  if (fallbackValue && isValidReasoningEffort(fallbackValue)) return fallbackValue;
+
+  return DEFAULT_REASONING_EFFORT;
+}
+
 function nowIso() {
   return new Date().toISOString();
 }
@@ -923,11 +949,12 @@ async function clearChatdUrlFile({ writeChatdUrl = true, urlPath = CHATD_URL_PAT
 }
 
 function createDefaultRunExecutor({ codexCwd } = {}) {
-  return ({ runId, sessionId, message, model, resumeSessionId, onEvent, onExit, onError }) => startCodexRun({
+  return ({ runId, sessionId, message, model, reasoningEffort, resumeSessionId, onEvent, onExit, onError }) => startCodexRun({
     runId,
     sessionId,
     prompt: message,
     model,
+    reasoningEffort,
     resumeSessionId,
     cwd: codexCwd,
     onEvent,
@@ -949,6 +976,10 @@ export async function startChatd(opts = {}) {
     command: opts.codexCommand || process.env.BF_CHATD_CODEX_COMMAND || 'codex',
     timeoutMs: Number(process.env.BF_CHATD_MODEL_LIST_TIMEOUT_MS || MODEL_LIST_TIMEOUT_MS),
   }));
+  const configuredReasoningEffort = resolveEffectiveReasoningEffort(
+    opts.defaultReasoningEffort,
+    await resolveConfiguredReasoningEffort(),
+  );
 
   let desiredPort = Number.isFinite(opts.port) ? Number(opts.port) : Number(process.env.BF_CHATD_PORT || 0);
   if (!Number.isInteger(desiredPort) || desiredPort < 0) desiredPort = 0;
@@ -1063,7 +1094,7 @@ export async function startChatd(opts = {}) {
 
       if (url.pathname === '/v1/models' && req.method === 'GET') {
         const models = await listModelPresets({ storageRoot, modelFetcher });
-        json(res, 200, { models });
+        json(res, 200, { models, defaultReasoningEffort: configuredReasoningEffort });
         return;
       }
 
@@ -1079,6 +1110,7 @@ export async function startChatd(opts = {}) {
           const session = await createSession({
             title: body.title || 'New chat',
             model: body.model ?? null,
+            reasoningEffort: body.reasoningEffort ?? null,
             storageRoot,
           });
           json(res, 201, session);
@@ -1125,6 +1157,7 @@ export async function startChatd(opts = {}) {
             patch: {
               ...(Object.prototype.hasOwnProperty.call(body, 'title') ? { title: body.title } : {}),
               ...(Object.prototype.hasOwnProperty.call(body, 'model') ? { model: body.model } : {}),
+              ...(Object.prototype.hasOwnProperty.call(body, 'reasoningEffort') ? { reasoningEffort: body.reasoningEffort } : {}),
             },
             storageRoot,
           });
@@ -1215,6 +1248,10 @@ export async function startChatd(opts = {}) {
         }
         const browserContext = normalizeBrowserContext(body?.browserContext);
         const promptMessage = buildRunPrompt({ message, browserContext });
+        const runReasoningEffort = resolveEffectiveReasoningEffort(
+          session.reasoningEffort,
+          configuredReasoningEffort,
+        );
 
         const runId = randomBytes(12).toString('base64url');
         const run = {
@@ -1232,6 +1269,7 @@ export async function startChatd(opts = {}) {
           resumeSessionId: isValidSessionId(session?.providerState?.codex?.sessionId || '')
             ? session.providerState.codex.sessionId
             : null,
+          reasoningEffort: runReasoningEffort,
         };
 
         const enqueue = (fn) => {
@@ -1247,6 +1285,7 @@ export async function startChatd(opts = {}) {
             sessionId,
             message: promptMessage,
             model: session.model || null,
+            reasoningEffort: runReasoningEffort,
             resumeSessionId,
             onEvent: (evt) => {
               enqueue(async () => {
@@ -1377,7 +1416,12 @@ export async function startChatd(opts = {}) {
             event: 'run.started',
             runId,
             sessionId,
-            payload: { message, model: session.model || null, browserContext },
+            payload: {
+              message,
+              model: session.model || null,
+              reasoningEffort: runReasoningEffort,
+              browserContext,
+            },
           }));
           json(res, 202, { ok: true, runId, sessionId });
         } catch (error) {
diff --git a/agent/src/codex-runner.js b/agent/src/codex-runner.js
index 828fb61..3b57efe 100644
--- a/agent/src/codex-runner.js
+++ b/agent/src/codex-runner.js
@@ -465,7 +465,16 @@ export function normalizeCodexLine({ runId, sessionId, line }) {
   return envelope({ event: 'run.event', runId, sessionId, payload: parsed });
 }
 
-export function buildCodexExecArgs({ prompt, model, args, resumeSessionId } = {}) {
+function normalizeReasoningEffort(reasoningEffort) {
+  const normalized = String(reasoningEffort || '').trim().toLowerCase();
+  if (!normalized) return null;
+  if (normalized === 'low' || normalized === 'medium' || normalized === 'high' || normalized === 'xhigh') {
+    return normalized;
+  }
+  return null;
+}
+
+export function buildCodexExecArgs({ prompt, model, reasoningEffort, args, resumeSessionId } = {}) {
   if (Array.isArray(args) && args.length > 0) return args;
   const resumeId = typeof resumeSessionId === 'string' ? resumeSessionId.trim() : '';
   const resolved = resumeId
@@ -474,6 +483,10 @@ export function buildCodexExecArgs({ prompt, model, args, resumeSessionId } = {}
   if (typeof model === 'string' && model.trim()) {
     resolved.push('--model', model.trim());
   }
+  const normalizedReasoningEffort = normalizeReasoningEffort(reasoningEffort);
+  if (normalizedReasoningEffort) {
+    resolved.push('-c', `model_reasoning_effort="${normalizedReasoningEffort}"`);
+  }
   resolved.push(prompt || '');
   return resolved;
 }
@@ -489,10 +502,11 @@ export function startCodexRun({
   command,
   args,
   model,
+  reasoningEffort,
   resumeSessionId,
 } = {}) {
   const cmd = command || process.env.BF_CHATD_CODEX_COMMAND || 'codex';
-  const argv = buildCodexExecArgs({ prompt, model, args, resumeSessionId });
+  const argv = buildCodexExecArgs({ prompt, model, reasoningEffort, args, resumeSessionId });
 
   const child = spawn(cmd, argv, {
     cwd,
diff --git a/agent/src/session-store.js b/agent/src/session-store.js
index eb58851..12194dd 100644
--- a/agent/src/session-store.js
+++ b/agent/src/session-store.js
@@ -8,6 +8,7 @@ const INDEX_FILE = 'index.json';
 const SESSION_ID_RE = /^[A-Za-z0-9_-]{1,128}$/;
 const RUN_ID_RE = /^[A-Za-z0-9_-]{1,256}$/;
 const MODEL_ID_RE = /^[A-Za-z0-9._:/-]{1,128}$/;
+const REASONING_EFFORT_VALUES = new Set(['low', 'medium', 'high', 'xhigh']);
 const indexWriteQueues = new Map();
 
 function isObject(value) {
@@ -34,6 +35,10 @@ export function isValidModelId(model) {
   return typeof model === 'string' && MODEL_ID_RE.test(model);
 }
 
+export function isValidReasoningEffort(value) {
+  return typeof value === 'string' && REASONING_EFFORT_VALUES.has(value.trim().toLowerCase());
+}
+
 function assertValidSessionId(sessionId, fnName) {
   if (!isValidSessionId(sessionId)) {
     throw new Error(`${fnName} requires a safe sessionId`);
@@ -252,6 +257,16 @@ function normalizeModel(model) {
   return trimmed;
 }
 
+function normalizeReasoningEffort(reasoningEffort) {
+  if (reasoningEffort == null) return null;
+  const trimmed = String(reasoningEffort).trim().toLowerCase();
+  if (!trimmed) return null;
+  if (!isValidReasoningEffort(trimmed)) {
+    throw new Error('reasoningEffort must be one of: low, medium, high, xhigh');
+  }
+  return trimmed;
+}
+
 function normalizeUsageNumber(value, fieldName) {
   if (value == null) return null;
   const parsed = Number(value);
@@ -335,7 +350,7 @@ function sortSessionsNewestFirst(a, b) {
   return bTs - aTs;
 }
 
-export async function createSession({ title = 'New chat', model = null, storageRoot } = {}) {
+export async function createSession({ title = 'New chat', model = null, reasoningEffort = null, storageRoot } = {}) {
   const root = resolveStorageRoot(storageRoot);
   await ensureStorageRoot(root);
 
@@ -345,6 +360,7 @@ export async function createSession({ title = 'New chat', model = null, storageR
     sessionId,
     title,
     model: normalizeModel(model),
+    reasoningEffort: normalizeReasoningEffort(reasoningEffort),
     createdAt: now,
     updatedAt: now,
   };
@@ -399,6 +415,9 @@ export async function updateSession({ sessionId, patch = {}, storageRoot } = {})
     if (Object.prototype.hasOwnProperty.call(patch, 'model')) {
       next.model = normalizeModel(patch.model);
     }
+    if (Object.prototype.hasOwnProperty.call(patch, 'reasoningEffort')) {
+      next.reasoningEffort = normalizeReasoningEffort(patch.reasoningEffort);
+    }
     if (Object.prototype.hasOwnProperty.call(patch, 'providerState')) {
       const providerState = normalizeProviderState(patch.providerState, current.providerState);
       if (providerState == null) delete next.providerState;
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index 7221b1b..eff31e1 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -155,8 +155,9 @@ test('POST /v1/runs uses injected run executor and persists assistant output', a
   const daemon = await startChatd({
     port: 0,
     writeChatdUrl: false,
-    runExecutor: ({ runId, sessionId, model, onEvent, onExit }) => {
-      seenRuns.push({ runId, sessionId, model });
+    defaultReasoningEffort: 'medium',
+    runExecutor: ({ runId, sessionId, model, reasoningEffort, onEvent, onExit }) => {
+      seenRuns.push({ runId, sessionId, model, reasoningEffort });
       setTimeout(() => {
         onEvent({ event: 'chat.delta', runId, sessionId, payload: { delta: 'hel' } });
       }, 10);
@@ -199,6 +200,7 @@ test('POST /v1/runs uses injected run executor and persists assistant output', a
 
     await new Promise((resolve) => setTimeout(resolve, 60));
     assert.equal(seenRuns.at(-1)?.model, 'gpt-5');
+    assert.equal(seenRuns.at(-1)?.reasoningEffort, 'medium');
 
     const messagesBody = await fetch(
       `${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}/messages`,
@@ -211,6 +213,82 @@ test('POST /v1/runs uses injected run executor and persists assistant output', a
   }
 });
 
+test('POST /v1/runs uses per-session reasoning effort when configured', async () => {
+  const seenRuns = [];
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    defaultReasoningEffort: 'medium',
+    runExecutor: ({ runId, sessionId, reasoningEffort, onEvent, onExit }) => {
+      seenRuns.push({ runId, sessionId, reasoningEffort });
+      setTimeout(() => onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'ok' } }), 10);
+      setTimeout(() => onExit({ code: 0 }), 15);
+      return { abort() {} };
+    },
+  });
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'Effort' }),
+    }).then((res) => res.json());
+
+    const patched = await fetch(`${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}`, {
+      method: 'PATCH',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ reasoningEffort: 'high' }),
+    });
+    assert.equal(patched.status, 200);
+
+    const runRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'hi' }),
+    });
+    assert.equal(runRes.status, 202);
+
+    await new Promise((resolve) => setTimeout(resolve, 60));
+    assert.equal(seenRuns.at(-1)?.reasoningEffort, 'high');
+  } finally {
+    await daemon.stop();
+  }
+});
+
+test('PATCH /v1/sessions rejects invalid reasoning effort values', async () => {
+  const daemon = await startChatd({ port: 0, writeChatdUrl: false });
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'Invalid effort' }),
+    }).then((res) => res.json());
+
+    const patched = await fetch(`${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}`, {
+      method: 'PATCH',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ reasoningEffort: 'turbo' }),
+    });
+    assert.equal(patched.status, 400);
+  } finally {
+    await daemon.stop();
+  }
+});
+
 test('POST /v1/runs persists run steps so reopened sessions can render them', async () => {
   const daemon = await startChatd({
     port: 0,
diff --git a/test/agent/codex-runner.test.js b/test/agent/codex-runner.test.js
index 3ec4166..15280a4 100644
--- a/test/agent/codex-runner.test.js
+++ b/test/agent/codex-runner.test.js
@@ -44,6 +44,11 @@ test('buildCodexExecArgs includes --model when session model is set', () => {
   assert.deepEqual(args, ['exec', '--json', '--model', 'gpt-5', 'hi']);
 });
 
+test('buildCodexExecArgs includes reasoning effort override when set', () => {
+  const args = buildCodexExecArgs({ prompt: 'hi', reasoningEffort: 'medium' });
+  assert.deepEqual(args, ['exec', '--json', '-c', 'model_reasoning_effort="medium"', 'hi']);
+});
+
 test('buildCodexExecArgs emits resume invocation when codex session id is provided', () => {
   const args = buildCodexExecArgs({
     prompt: 'hi',
diff --git a/test/agent/session-store.test.js b/test/agent/session-store.test.js
index e17fccf..39d3df5 100644
--- a/test/agent/session-store.test.js
+++ b/test/agent/session-store.test.js
@@ -104,17 +104,29 @@ test('updateSession persists per-session model and title', async () => {
   const created = await createSession({ title: 'Before', storageRoot });
   const updated = await updateSession({
     sessionId: created.sessionId,
-    patch: { title: 'After', model: 'gpt-5' },
+    patch: { title: 'After', model: 'gpt-5', reasoningEffort: 'high' },
     storageRoot,
   });
 
   assert.equal(updated?.title, 'After');
   assert.equal(updated?.model, 'gpt-5');
+  assert.equal(updated?.reasoningEffort, 'high');
 
   const rows = await listSessions({ limit: 10, storageRoot });
   const row = rows.find((item) => item.sessionId === created.sessionId);
   assert.equal(row?.title, 'After');
   assert.equal(row?.model, 'gpt-5');
+  assert.equal(row?.reasoningEffort, 'high');
+});
+
+test('updateSession supports clearing reasoning effort back to config default', async () => {
+  const created = await createSession({ title: 'Before', storageRoot });
+  const updated = await updateSession({
+    sessionId: created.sessionId,
+    patch: { reasoningEffort: null },
+    storageRoot,
+  });
+  assert.equal(updated?.reasoningEffort, null);
 });
 
 test('updateSession persists codex provider session mapping', async () => {

From 5ad5718131742f8b00df6fdc2dae80f6c389b15c Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 22:37:16 +0530
Subject: [PATCH 171/192] panel: add thinking selector to model popup

- add a Thinking Level list in the existing model popover with Default/Low/Medium/High/Extra High choices

- map Default to config-derived defaultReasoningEffort and persist per-session overrides through PATCH /v1/sessions

- hydrate panel state from /v1/models defaultReasoningEffort and keep medium as local fallback

- extend panel contract coverage for the new thinking list surface
---
 extension/agent-panel.html              |   2 +
 extension/agent-panel.js                | 181 +++++++++++++++++++++---
 test/agent/agent-panel-contract.test.js |   7 +
 3 files changed, 167 insertions(+), 23 deletions(-)

diff --git a/extension/agent-panel.html b/extension/agent-panel.html
index 53a2344..13d2f1a 100644
--- a/extension/agent-panel.html
+++ b/extension/agent-panel.html
@@ -69,6 +69,8 @@
     <section id="bf-model-panel" class="popover-panel hidden" role="listbox" aria-label="Available models">
       <p class="popover-label">Available Models</p>
       <ul id="bf-model-list" class="popover-list"></ul>
+      <p class="popover-label">Thinking Level</p>
+      <ul id="bf-thinking-list" class="popover-list"></ul>
     </section>
 
     <section id="bf-session-panel" class="popover-panel hidden" role="listbox" aria-label="Sessions">
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 67a7722..209f761 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -9,14 +9,24 @@ import {
   shouldApplySessionSelection,
 } from './agent-panel-runtime.js';
 
+const REASONING_PRESETS = [
+  { value: null, label: 'Default (Config)' },
+  { value: 'low', label: 'Low' },
+  { value: 'medium', label: 'Medium' },
+  { value: 'high', label: 'High' },
+  { value: 'xhigh', label: 'Extra High' },
+];
+
 const state = {
   value: initialState,
   auth: null,
   modelPresets: [{ value: null, label: 'Default' }],
+  defaultReasoningEffort: 'medium',
   currentRunBySession: {},
   expandedTimelineEntries: {},
   latestReasoningTitleByRun: {},
   transcriptHandlersBound: false,
+  tabAttachWatchersBound: false,
   initialTabAttachInFlight: false,
   initialTabAttachStarted: false,
   editingSessionId: null,
@@ -45,6 +55,7 @@ const popoverBackdropEl = document.getElementById('bf-popover-backdrop');
 const modelPanelEl = document.getElementById('bf-model-panel');
 const sessionPanelEl = document.getElementById('bf-session-panel');
 const modelListEl = document.getElementById('bf-model-list');
+const thinkingListEl = document.getElementById('bf-thinking-list');
 const switchSessionListEl = document.getElementById('bf-switch-session-list');
 const transcriptEl = document.getElementById('bf-transcript');
 const chatFormEl = document.getElementById('bf-chat-form');
@@ -194,6 +205,15 @@ function normalizeStartupError(code = '', fallbackMessage = 'Unable to connect t
   };
 }
 
+function startupActionsForIssue(startupIssue) {
+  const code = String(startupIssue?.code || '').trim().toLowerCase();
+  const actions = [{ key: 'retry', label: 'Retry' }];
+  if (code === 'extension_not_connected' || code === 'relay_unreachable') {
+    actions.push({ key: 'refresh-connection', label: 'Refresh connection' });
+  }
+  return actions;
+}
+
 function setComposerEnabled(enabled) {
   chatInputEl.disabled = !enabled;
   autoResizeInput();
@@ -245,6 +265,23 @@ function formatModelLabel(model) {
   return model && String(model).trim() ? model : 'Default';
 }
 
+function normalizeReasoningEffort(value) {
+  const normalized = String(value || '').trim().toLowerCase();
+  if (normalized === 'low' || normalized === 'medium' || normalized === 'high' || normalized === 'xhigh') {
+    return normalized;
+  }
+  return null;
+}
+
+function formatReasoningEffortLabel(value) {
+  const normalized = normalizeReasoningEffort(value);
+  if (normalized === 'low') return 'Low';
+  if (normalized === 'medium') return 'Medium';
+  if (normalized === 'high') return 'High';
+  if (normalized === 'xhigh') return 'Extra High';
+  return 'Medium';
+}
+
 function isDefaultSessionTitle(title) {
   const lowered = String(title || '').trim().toLowerCase();
   return !lowered || lowered === 'new session' || lowered === 'new chat';
@@ -302,8 +339,10 @@ function renderSelectors() {
 }
 
 function renderModelList() {
+  if (!modelListEl || !thinkingListEl) return;
   const activeSession = getActiveSession();
   const activeModel = activeSession?.model || null;
+  const activeReasoningEffort = normalizeReasoningEffort(activeSession?.reasoningEffort);
 
   const rows = state.modelPresets.map((preset) => {
     const active = (preset.value || null) === activeModel ? 'active' : '';
@@ -312,6 +351,14 @@ function renderModelList() {
   rows.push('<li><button type="button" data-model-custom="1" class="popover-item custom-item"><span>Custom...</span></button></li>');
 
   modelListEl.innerHTML = rows.join('');
+  thinkingListEl.innerHTML = REASONING_PRESETS.map((preset) => {
+    const active = (preset.value || null) === (activeReasoningEffort || null) ? 'active' : '';
+    let label = preset.label;
+    if (preset.value == null) {
+      label = `Default (Config: ${formatReasoningEffortLabel(state.defaultReasoningEffort)})`;
+    }
+    return `<li><button type="button" data-reasoning-effort="${escapeHtml(preset.value || '')}" class="popover-item ${active}"><span>${escapeHtml(label)}</span></button></li>`;
+  }).join('');
 
   modelListEl.querySelectorAll('button[data-model]').forEach((button) => {
     button.addEventListener('click', () => {
@@ -336,6 +383,16 @@ function renderModelList() {
       }
     });
   }
+
+  thinkingListEl.querySelectorAll('button[data-reasoning-effort]').forEach((button) => {
+    button.addEventListener('click', () => {
+      const value = button.dataset.reasoningEffort || null;
+      const reasoningEffort = normalizeReasoningEffort(value);
+      updateActiveSessionReasoningEffort(reasoningEffort).catch((error) => {
+        setStatus('error', error.message || 'Unable to update thinking level');
+      });
+    });
+  });
 }
 
 function renderSessions() {
@@ -659,7 +716,20 @@ function renderContent(value) {
 
 function bindTranscriptHandlers() {
   if (state.transcriptHandlersBound) return;
-  transcriptEl.addEventListener('click', (event) => {
+  transcriptEl.addEventListener('click', async (event) => {
+    const startupActionBtn = event.target.closest('button[data-startup-action]');
+    if (startupActionBtn && transcriptEl.contains(startupActionBtn)) {
+      event.preventDefault();
+      const msgAction = startupActionBtn.getAttribute('data-startup-action');
+      if (msgAction === 'retry') {
+        await retryStartup();
+        return;
+      }
+      if (msgAction === 'refresh-connection') {
+        await retryStartup({ refreshConnection: true });
+        return;
+      }
+    }
     const toggleBtn = event.target.closest('button[data-step-key]');
     if (!toggleBtn || !transcriptEl.contains(toggleBtn)) return;
     const stepKey = toggleBtn.getAttribute('data-step-key');
@@ -725,6 +795,20 @@ function renderTranscript({ preserveScrollTop = null } = {}) {
       const commandHtml = startupIssue.command
         ? `<p class="empty-command"><code>${escapeHtml(startupIssue.command)}</code></p>`
         : '';
+      const actions = startupActionsForIssue(startupIssue);
+      const actionsHtml = actions.length > 0
+        ? `
+          <div class="empty-actions">
+            ${actions.map((action) => `
+              <button
+                type="button"
+                class="empty-action-btn${action.key === 'refresh-connection' ? ' secondary' : ''}"
+                data-startup-action="${escapeHtml(action.key)}"
+              >${escapeHtml(action.label)}</button>
+            `).join('')}
+          </div>
+        `
+        : '';
       transcriptEl.innerHTML = `
         <div class="empty-state error-state">
           <div class="empty-icon error">!</div>
@@ -732,6 +816,7 @@ function renderTranscript({ preserveScrollTop = null } = {}) {
             <p class="empty-title">${escapeHtml(startupIssue.title || 'Unable to connect')}</p>
             <p class="empty-sub">${escapeHtml(startupIssue.detail || '')}</p>
             ${commandHtml}
+            ${actionsHtml}
           </div>
         </div>
       `;
@@ -921,6 +1006,8 @@ function scheduleTabAttachRefresh(delayMs = 0) {
 }
 
 function bindTabAttachWatchers() {
+  if (state.tabAttachWatchersBound) return;
+  state.tabAttachWatchersBound = true;
   if (chrome?.tabs?.onActivated?.addListener) {
     chrome.tabs.onActivated.addListener(() => {
       scheduleTabAttachRefresh(40);
@@ -983,6 +1070,13 @@ async function getRelayHttpUrl() {
   return 'http://127.0.0.1:19222';
 }
 
+async function refreshExtensionConnection() {
+  const stored = await chrome.storage.local.get(['relayUrl']);
+  const relayUrl = stored.relayUrl || 'ws://127.0.0.1:19222/extension';
+  const response = await runtimeMessage({ type: 'updateRelayUrl', relayUrl });
+  if (response?.error) throw new Error(response.error);
+}
+
 async function loadAuth() {
   const relayHttpUrl = await getRelayHttpUrl();
   const extensionId = chrome?.runtime?.id;
@@ -1081,6 +1175,7 @@ async function loadModelPresets() {
   await ensureOk(res, 'Failed to load models');
   const body = await readJsonOrEmpty(res);
   state.modelPresets = normalizeModelRows(body.models);
+  state.defaultReasoningEffort = normalizeReasoningEffort(body.defaultReasoningEffort) || 'medium';
 }
 
 async function loadMessages(sessionId) {
@@ -1209,6 +1304,24 @@ async function updateActiveSessionModel(model) {
   setStatus('ready', 'Ready');
 }
 
+async function updateActiveSessionReasoningEffort(reasoningEffort) {
+  const sessionId = state.value.activeSessionId;
+  if (!sessionId) return;
+
+  const res = await api(`/v1/sessions/${encodeURIComponent(sessionId)}`, {
+    method: 'PATCH',
+    body: JSON.stringify({ reasoningEffort }),
+  });
+  if (!res.ok) {
+    const body = await res.json().catch(() => ({}));
+    throw new Error(body.error || 'Unable to update thinking level');
+  }
+
+  await loadSessions(sessionId);
+  setPopover('none');
+  setStatus('ready', 'Ready');
+}
+
 async function consumeEventStream(body, loopToken) {
   if (!body) return;
   const reader = body.getReader();
@@ -1311,6 +1424,49 @@ async function stopRun() {
   });
 }
 
+async function initializePanel() {
+  state.startupIssue = null;
+  setComposerEnabled(false);
+  setStatus('info', 'Connecting...');
+  render();
+  startInitialTabAttach();
+  await loadAuth();
+  bindTabAttachWatchers();
+  try {
+    await loadModelPresets();
+  } catch {
+    state.modelPresets = [{ value: null, label: 'Default' }];
+    state.defaultReasoningEffort = 'medium';
+  }
+  await loadSessions();
+  if (!state.value.activeSessionId) {
+    await createSession();
+  } else {
+    await selectSession(state.value.activeSessionId);
+  }
+  setComposerEnabled(true);
+  scheduleTabAttachRefresh(0);
+  setStatus('ready', 'Ready');
+  render();
+}
+
+async function retryStartup({ refreshConnection = false } = {}) {
+  try {
+    setStatus('info', refreshConnection ? 'Refreshing connection...' : 'Retrying...');
+    render();
+    if (refreshConnection) {
+      await refreshExtensionConnection();
+    }
+    await initializePanel();
+  } catch (error) {
+    state.startupIssue = normalizeStartupError(error?.code, error?.message);
+    setComposerEnabled(false);
+    setTabAttachBannerState({ hidden: true });
+    setStatus('error', state.startupIssue.statusText || 'Daemon unavailable');
+    render();
+  }
+}
+
 chatFormEl.addEventListener('submit', async (event) => {
   event.preventDefault();
   const text = chatInputEl.value;
@@ -1383,28 +1539,7 @@ popoverBackdropEl.addEventListener('click', () => {
 
 (async function init() {
   try {
-    state.startupIssue = null;
-    setComposerEnabled(false);
-    setStatus('info', 'Connecting...');
-    render();
-    startInitialTabAttach();
-    await loadAuth();
-    bindTabAttachWatchers();
-    try {
-      await loadModelPresets();
-    } catch {
-      state.modelPresets = [{ value: null, label: 'Default' }];
-    }
-    await loadSessions();
-    if (!state.value.activeSessionId) {
-      await createSession();
-    } else {
-      await selectSession(state.value.activeSessionId);
-    }
-    setComposerEnabled(true);
-    scheduleTabAttachRefresh(0);
-    setStatus('ready', 'Ready');
-    render();
+    await initializePanel();
   } catch (error) {
     state.startupIssue = normalizeStartupError(error?.code, error?.message);
     setComposerEnabled(false);
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index fbb9bca..e923306 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -14,6 +14,7 @@ test('agent panel has inline model and session selectors with popovers', () => {
   assert.match(html, /id="bf-model-panel"/);
   assert.match(html, /id="bf-session-panel"/);
   assert.match(html, /id="bf-model-list"/);
+  assert.match(html, /id="bf-thinking-list"/);
   assert.match(html, /id="bf-switch-session-list"/);
   assert.match(html, /id="bf-tab-attach-banner"/);
   assert.match(html, /id="bf-tab-attach-text"/);
@@ -91,3 +92,9 @@ test('collapsed execute helper preview has tree-like branch styling', () => {
   assert.match(css, /\.step-branch-node::before/);
   assert.match(css, /\.step-branch-call/);
 });
+
+test('startup error card action buttons have dedicated styling hooks', () => {
+  assert.match(css, /\.empty-actions/);
+  assert.match(css, /\.empty-action-btn/);
+  assert.match(css, /\.empty-action-btn\.secondary/);
+});

From e18c59088509f7ec71233b1b7e69111f0fc107fa Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 22:37:18 +0530
Subject: [PATCH 172/192] agent-panel: add startup retry and refresh-connection
 actions

- add action buttons to startup error card: Retry and Refresh connection

- wire transcript action handling to retry panel bootstrap and trigger extension relay reconnect via updateRelayUrl

- refactor bootstrap flow into initializePanel/retryStartup and guard duplicate tab watcher bindings

- add contract coverage for startup action hooks and styles
---
 extension/agent-panel.css                    | 35 ++++++++++++++++++++
 test/agent/agent-panel-send-contract.test.js | 18 ++++++++--
 2 files changed, 51 insertions(+), 2 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index d21c972..0ab0a28 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -266,6 +266,41 @@ body {
   word-break: break-word;
 }
 
+.empty-actions {
+  margin-top: 10px;
+  display: flex;
+  flex-wrap: wrap;
+  gap: 8px;
+}
+
+.empty-action-btn {
+  height: 28px;
+  border-radius: 8px;
+  border: 1px solid var(--crail);
+  background: var(--crail);
+  color: #fff;
+  padding: 0 10px;
+  font-size: 11px;
+  line-height: 1;
+  cursor: pointer;
+}
+
+.empty-action-btn.secondary {
+  background: var(--linen);
+  border-color: var(--line);
+  color: var(--crail-dark);
+}
+
+.empty-action-btn:hover {
+  background: var(--crail-dark);
+  border-color: var(--crail-dark);
+}
+
+.empty-action-btn.secondary:hover {
+  background: var(--sand);
+  border-color: var(--line);
+}
+
 .message {
   display: flex;
   flex-direction: column;
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index ef01071..a79b92c 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -81,8 +81,11 @@ test('context usage renderer hides element when unavailable and only shows forma
 
 test('init opens smoothly by starting tab attach asynchronously', () => {
   assert.match(js, /function startInitialTabAttach\(\)/);
-  assert.match(js, /\(async function init\(\)[\s\S]*startInitialTabAttach\(\);/);
-  assert.doesNotMatch(js, /\(async function init\(\)[\s\S]*await ensureCurrentTabAttached\(\);/);
+  assert.match(js, /async function initializePanel\(\)[\s\S]*startInitialTabAttach\(\);/);
+  const initMatch = js.match(/\(async function init\(\)[\s\S]*?\n}\)\(\);/);
+  assert.ok(initMatch, 'init block should be present');
+  const initBlock = initMatch[0];
+  assert.doesNotMatch(initBlock, /await ensureCurrentTabAttached\(\);/);
 });
 
 test('tab-attach banner shows progress during initial auto-attach and suppresses not-connected state', () => {
@@ -159,3 +162,14 @@ test('collapsed BrowserForce execute rows infer helper calls and render branch p
   assert.match(js, /class="step-branch-node"/);
   assert.match(js, /class="step-branch-call"/);
 });
+
+test('startup error card supports retry and refresh connection actions', () => {
+  assert.match(js, /function refreshExtensionConnection\(/);
+  assert.match(js, /function retryStartup\(/);
+  assert.match(js, /data-startup-action=/);
+  assert.match(js, /key:\s*'retry'/);
+  assert.match(js, /key:\s*'refresh-connection'/);
+  assert.match(js, /msgAction === 'retry'/);
+  assert.match(js, /msgAction === 'refresh-connection'/);
+  assert.match(js, /runtimeMessage\(\{\s*type:\s*'updateRelayUrl'/);
+});

From 4deb317eb136c1fda9293a4243f2581db650f2f8 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 23:05:27 +0530
Subject: [PATCH 173/192] feat(plugins): metadata-first skill loading and v1
 plugin API

---
 README.frontpage.md                     |  19 +-
 README.md                               |  21 +-
 bin.js                                  |   6 +-
 mcp/src/exec-engine.js                  |  80 +++++++
 mcp/src/index.js                        |  18 +-
 mcp/test/mcp-plugin-integration.test.js |  49 ++++-
 mcp/test/mcp-tools.test.js              |   3 +
 mcp/test/plugin-loader.test.js          | 271 +++++++++++++++++++++++-
 plugins/official/google-sheets/SKILL.md |   9 +
 plugins/official/google-sheets/index.js | 210 +++++++++++++++---
 plugins/official/highlight/SKILL.md     |   8 +
 plugins/official/openclaw/SKILL.md      |   8 +
 relay/src/index.js                      |   8 +-
 13 files changed, 650 insertions(+), 60 deletions(-)

diff --git a/README.frontpage.md b/README.frontpage.md
index f11e4f5..0c67331 100644
--- a/README.frontpage.md
+++ b/README.frontpage.md
@@ -410,7 +410,14 @@ Plugins add custom helpers directly into the `execute` tool scope. Install once
 browserforce plugin install highlight
 ```
 
-That's it. Restart MCP (or Claude Desktop) and `highlight()` is available in every `execute` call.
+That's it. Restart MCP (or Claude Desktop) after every plugin install or update, then `highlight()` is available in every `execute` call.
+
+### Prompt behavior (metadata-first)
+
+Plugin `SKILL.md` content is no longer fully inlined into the default `execute` prompt. BrowserForce now exposes plugin metadata first (name, description, helpers), then loads details on demand:
+
+- Call `pluginCatalog()` to discover installed plugins, helper names, and available sections.
+- Call `pluginHelp(name, section?)` only when you need plugin-specific instructions.
 
 ### Official plugins
 
@@ -442,7 +449,13 @@ browserforce plugin list        # See what's installed
 browserforce plugin remove highlight   # Uninstall
 ```
 
-Plugins are stored at `~/.browserforce/plugins/`. Each one is a folder with an `index.js`.
+Plugins are stored at `~/.browserforce/plugins/<name>/`. Each plugin folder contains an `index.js` and can include a `SKILL.md`.
+
+Repo layout remains:
+- Official plugins: `plugins/official/<name>/SKILL.md`
+- Community plugins: `plugins/community/<name>/SKILL.md`
+
+No migration to `plugin/skills/<name>/` is required.
 
 ### Write your own
 
@@ -463,7 +476,7 @@ export default {
 
 Drop it in `~/.browserforce/plugins/my-plugin/`, restart MCP, and call `await scrollToBottom()` or `await countLinks()` from any `execute` call.
 
-Add a `SKILL.md` file alongside `index.js` and its content is automatically appended to the `execute` tool's description — so your agent knows the helpers exist without you having to explain them every time.
+Add a `SKILL.md` file alongside `index.js` to publish plugin metadata and help text. The default prompt includes only metadata; fetch full or sectioned guidance on demand with `pluginHelp('my-plugin')` or `pluginHelp('my-plugin', 'examples')`.
 
 ### Any Playwright Script
 
diff --git a/README.md b/README.md
index 7e5539d..de5b348 100644
--- a/README.md
+++ b/README.md
@@ -161,7 +161,7 @@ flowchart LR
   MCP --> RELAY["Relay (`127.0.0.1:19222`)"]
   RELAY --> EXT["Chrome Extension (MV3)"]
   EXT --> CHROME["User's Real Chrome Session"]
-  SETUP["`browserforce setup openclaw`"] --> PLUGIN["Auto-install `openclaw` plugin\n(SKILL appended to execute prompt)"]
+  SETUP["`browserforce setup openclaw`"] --> PLUGIN["Auto-install `openclaw` plugin\n(metadata shown in prompt, details via pluginHelp())"]
   PLUGIN --> MCP
 ```
 
@@ -437,7 +437,14 @@ Plugins add custom helpers directly into the `execute` tool scope. Install once
 browserforce plugin install highlight
 ```
 
-That's it. Restart MCP (or Claude Desktop) and `highlight()` is available in every `execute` call.
+That's it. Restart MCP (or Claude Desktop) after every plugin install or update, then `highlight()` is available in every `execute` call.
+
+### Prompt behavior (metadata-first)
+
+Plugin `SKILL.md` content is no longer fully inlined into the default `execute` prompt. BrowserForce now exposes plugin metadata first (name, description, helpers), then loads details on demand:
+
+- Call `pluginCatalog()` to discover installed plugins, helper names, and available sections.
+- Call `pluginHelp(name, section?)` only when you need plugin-specific instructions.
 
 ### Official plugins
 
@@ -471,7 +478,13 @@ browserforce plugin list        # See what's installed
 browserforce plugin remove highlight   # Uninstall
 ```
 
-Plugins are stored at `~/.browserforce/plugins/`. Each one is a folder with an `index.js`.
+Plugins are stored at `~/.browserforce/plugins/<name>/`. Each plugin folder contains an `index.js` and can include a `SKILL.md`.
+
+Repo layout remains:
+- Official plugins: `plugins/official/<name>/SKILL.md`
+- Community plugins: `plugins/community/<name>/SKILL.md`
+
+No migration to `plugin/skills/<name>/` is required.
 
 ### Write your own
 
@@ -492,7 +505,7 @@ export default {
 
 Drop it in `~/.browserforce/plugins/my-plugin/`, restart MCP, and call `await scrollToBottom()` or `await countLinks()` from any `execute` call.
 
-Add a `SKILL.md` file alongside `index.js` and its content is automatically appended to the `execute` tool's description — so your agent knows the helpers exist without you having to explain them every time.
+Add a `SKILL.md` file alongside `index.js` to publish plugin metadata and help text. The default prompt includes only metadata; fetch full or sectioned guidance on demand with `pluginHelp('my-plugin')` or `pluginHelp('my-plugin', 'examples')`.
 
 ### Any Playwright Script
 
diff --git a/bin.js b/bin.js
index 773980c..4f99e26 100644
--- a/bin.js
+++ b/bin.js
@@ -289,7 +289,7 @@ async function cmdPlugin() {
   try { authToken = readFileSync(tokenFile, 'utf8').trim(); } catch { /* no token file */ }
 
   if (sub === 'list') {
-    const data = await httpGet(`${baseUrl}/plugins`);
+    const data = await httpGet(`${baseUrl}/v1/plugins`);
     if (values.json) {
       output(data, true);
     } else {
@@ -306,7 +306,7 @@ async function cmdPlugin() {
   if (sub === 'install') {
     const name = positionals[2];
     if (!name) { console.error('Usage: browserforce plugin install <name>'); process.exit(1); }
-    const { status, body } = await httpFetch('POST', `${baseUrl}/plugins/install`, { name }, authToken);
+    const { status, body } = await httpFetch('POST', `${baseUrl}/v1/plugins/install`, { name }, authToken);
     if (status >= 400) {
       console.error(`Error: ${body.error || JSON.stringify(body)}`);
       process.exit(1);
@@ -318,7 +318,7 @@ async function cmdPlugin() {
   if (sub === 'remove') {
     const name = positionals[2];
     if (!name) { console.error('Usage: browserforce plugin remove <name>'); process.exit(1); }
-    const { status, body } = await httpFetch('DELETE', `${baseUrl}/plugins/${encodeURIComponent(name)}`, null, authToken);
+    const { status, body } = await httpFetch('DELETE', `${baseUrl}/v1/plugins/${encodeURIComponent(name)}`, null, authToken);
     if (status >= 400) {
       console.error(`Error: ${body.error || JSON.stringify(body)}`);
       process.exit(1);
diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index 2169faa..e538aca 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -554,6 +554,7 @@ export function buildExecContext(
   pluginHelpers = {},
   agentPreferences = {},
   runtimeRestrictions = {},
+  pluginSkillRuntime = {},
 ) {
   const { consoleLogs, setupConsoleCapture } = consoleHelpers;
   const lastSnapshots = userState.__lastSnapshots || (userState.__lastSnapshots = new WeakMap());
@@ -669,9 +670,87 @@ export function buildExecContext(
     instructions: typeof runtimeRestrictions?.instructions === 'string' ? runtimeRestrictions.instructions : '',
   };
 
+  const pluginCatalog = () => {
+    const catalog = Array.isArray(pluginSkillRuntime?.catalog) ? pluginSkillRuntime.catalog : [];
+    return catalog.map((entry) => ({
+      ...entry,
+      helpers: Array.isArray(entry?.helpers) ? [...entry.helpers] : [],
+      sections: Array.isArray(entry?.sections) ? [...entry.sections] : [],
+    }));
+  };
+
+  const pluginHelp = (name, section) => {
+    const requestedName = String(name || '').trim().toLowerCase();
+    if (!requestedName) {
+      throw new Error('pluginHelp(name, section?) requires a plugin name');
+    }
+
+    const lookup = pluginSkillRuntime?.byName && typeof pluginSkillRuntime.byName === 'object'
+      ? pluginSkillRuntime.byName
+      : {};
+    const plugin = lookup[requestedName];
+    if (!plugin) {
+      const available = pluginCatalog().map((entry) => entry.name).join(', ') || '(none)';
+      throw new Error(`Unknown plugin "${name}". Available plugins: ${available}`);
+    }
+
+    if (section === undefined || section === null || String(section).trim() === '') {
+      if (plugin.text && plugin.text.trim()) return plugin.text;
+      if (plugin.description && plugin.description.trim()) {
+        return `${plugin.name}: ${plugin.description.trim()}`;
+      }
+      return `${plugin.name} has no SKILL.md help text.`;
+    }
+
+    const normalizedSection = String(section)
+      .toLowerCase()
+      .trim()
+      .replace(/^[\d.)\s-]+/, '')
+      .replace(/[^\p{L}\p{N}\s-]/gu, '')
+      .replace(/\s+/g, ' ')
+      .trim();
+    const sections = plugin.sections && typeof plugin.sections === 'object' ? plugin.sections : {};
+    if (sections[normalizedSection]) return sections[normalizedSection];
+    const availableSections = Object.keys(sections).join(', ') || '(none)';
+    throw new Error(
+      `Unknown section "${section}" for plugin "${plugin.name}". Available sections: ${availableSections}`
+    );
+  };
+
+  const reservedContextNames = new Set([
+    'browserforceSettings',
+    'browserforceRestrictions',
+    'page',
+    'context',
+    'state',
+    'snapshot',
+    'refToLocator',
+    'waitForPageLoad',
+    'getLogs',
+    'clearLogs',
+    'getCDPSession',
+    'screenshotWithAccessibilityLabels',
+    'cleanHTML',
+    'pageMarkdown',
+    'pluginCatalog',
+    'pluginHelp',
+    'fetch',
+    'URL',
+    'URLSearchParams',
+    'Buffer',
+    'setTimeout',
+    'clearTimeout',
+    'TextEncoder',
+    'TextDecoder',
+  ]);
+
   // Wrap plugin helpers to auto-inject (page, ctx, state) as first three args
   const wrappedPluginHelpers = {};
   for (const [name, fn] of Object.entries(pluginHelpers)) {
+    if (reservedContextNames.has(name)) {
+      process.stderr.write(`[bf-plugins] Ignoring helper "${name}" because it conflicts with a built-in\n`);
+      continue;
+    }
     wrappedPluginHelpers[name] = (...args) => {
       let pg = null;
       try { pg = activePage(); } catch { /* no active page */ }
@@ -686,6 +765,7 @@ export function buildExecContext(
     page: defaultPage, context: ctx, state: userState,
     snapshot, refToLocator, waitForPageLoad, getLogs, clearLogs, getCDPSession,
     screenshotWithAccessibilityLabels, cleanHTML, pageMarkdown,
+    pluginCatalog, pluginHelp,
     fetch, URL, URLSearchParams, Buffer, setTimeout, clearTimeout,
     TextEncoder, TextDecoder,
   };
diff --git a/mcp/src/index.js b/mcp/src/index.js
index 541796c..802dd77 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -11,7 +11,12 @@ import {
   ensureRelay, connectOverCdpWithBusyRetry,
   CodeExecutionTimeoutError, buildExecContext, runCode, formatResult,
 } from './exec-engine.js';
-import { loadPlugins, buildPluginHelpers, buildPluginSkillAppendix } from './plugin-loader.js';
+import {
+  loadPlugins,
+  buildPluginHelpers,
+  buildPluginSkillAppendix,
+  buildPluginSkillRuntime,
+} from './plugin-loader.js';
 import { checkForUpdate } from './update-check.js';
 
 // ─── Console Log Capture ─────────────────────────────────────────────────────
@@ -261,6 +266,7 @@ async function getBrowserforceRestrictionsForSession() {
 
 let plugins = [];
 let pluginHelpers = {};
+let pluginSkillRuntime = { catalog: [], byName: {} };
 
 // ─── Update State ────────────────────────────────────────────────────────────
 // Checked once at startup; notice injected into first execute response only.
@@ -309,9 +315,16 @@ Helpers:
                                      Falls back to raw body text for non-article pages.
   getCDPSession({ page })            Create a relay-safe raw CDP session for a page.
                                      Use this instead of page.context().newCDPSession(page).
+  pluginCatalog()                    Returns installed plugin metadata (metadata-first discovery).
+  pluginHelp(name, section?)         Returns on-demand SKILL help for one plugin from in-memory cache.
 
 Globals: fetch, URL, URLSearchParams, Buffer, setTimeout, clearTimeout, TextEncoder, TextDecoder
 
+Plugin workflow (metadata-first):
+  1) Call pluginCatalog() to discover plugin names, helper names, and available sections.
+  2) Call pluginHelp(name, section?) only when you need plugin-specific instructions.
+  3) Avoid calling pluginHelp blindly for every plugin.
+
 ═══ FIRST CALL — PAGE SETUP ═══
 
 IMPORTANT: Do NOT navigate the user's existing tabs. Always create or reuse a dedicated tab.
@@ -491,7 +504,7 @@ function registerExecuteTool(skillAppendix = '') {
       if (page) setupConsoleCapture(page);
       const execCtx = buildExecContext(page, ctx, userState, {
         consoleLogs, setupConsoleCapture,
-      }, pluginHelpers, agentPreferences, browserforceRestrictions);
+      }, pluginHelpers, agentPreferences, browserforceRestrictions, pluginSkillRuntime);
       try {
         const result = await runCode(code, execCtx, timeout);
         const formatted = formatResult(result);
@@ -550,6 +563,7 @@ async function initPlugins() {
   try {
     plugins = await loadPlugins();
     pluginHelpers = buildPluginHelpers(plugins);
+    pluginSkillRuntime = buildPluginSkillRuntime(plugins);
     if (plugins.length > 0) {
       process.stderr.write(`[bf-mcp] Loaded ${plugins.length} plugin(s): ${plugins.map(p => p.name).join(', ')}\n`);
     }
diff --git a/mcp/test/mcp-plugin-integration.test.js b/mcp/test/mcp-plugin-integration.test.js
index ec128c6..1567fd3 100644
--- a/mcp/test/mcp-plugin-integration.test.js
+++ b/mcp/test/mcp-plugin-integration.test.js
@@ -6,7 +6,12 @@ import { mkdtemp, mkdir, writeFile, rm } from 'node:fs/promises';
 import { join } from 'node:path';
 import { tmpdir } from 'node:os';
 import { buildExecContext, runCode } from '../src/exec-engine.js';
-import { loadPlugins, buildPluginHelpers, buildPluginSkillAppendix } from '../src/plugin-loader.js';
+import {
+  loadPlugins,
+  buildPluginHelpers,
+  buildPluginSkillAppendix,
+  buildPluginSkillRuntime,
+} from '../src/plugin-loader.js';
 
 test('plugin helper is callable in execute scope after loadPlugins', async () => {
   const dir = await mkdtemp(join(tmpdir(), 'bf-mcp-test-'));
@@ -33,18 +38,52 @@ test('plugin helper is callable in execute scope after loadPlugins', async () =>
   await rm(dir, { recursive: true });
 });
 
-test('plugin SKILL.md content is included in plugin appendix', async () => {
+test('plugin appendix is metadata-only and runtime help remains available', async () => {
   const dir = await mkdtemp(join(tmpdir(), 'bf-mcp-test-'));
   const pluginDir = join(dir, 'tagger');
   await mkdir(pluginDir);
-  await writeFile(join(pluginDir, 'index.js'), `export default { name: 'tagger', helpers: {} };`);
-  await writeFile(join(pluginDir, 'SKILL.md'), 'Use tagger() to tag elements.');
+  await writeFile(join(pluginDir, 'index.js'), `
+    export default {
+      name: 'tagger',
+      helpers: { tagger: () => 'ok' },
+    };
+  `);
+  await writeFile(join(pluginDir, 'SKILL.md'), `---
+name: tagger
+description: Tags elements with labels.
+---
+Use tagger() to tag elements.
+
+## examples
+- tagger('hero')`);
 
   const plugins = await loadPlugins(dir);
   const appendix = buildPluginSkillAppendix(plugins);
+  const pluginSkillRuntime = buildPluginSkillRuntime(plugins);
+  const mockPage = { isClosed: () => false, url: () => 'about:blank', title: async () => '' };
+
+  const ctx = buildExecContext(
+    mockPage,
+    { pages: () => [mockPage] },
+    {},
+    {},
+    buildPluginHelpers(plugins),
+    {},
+    {},
+    pluginSkillRuntime,
+  );
 
   assert.ok(appendix.includes('PLUGIN: tagger'));
-  assert.ok(appendix.includes('Use tagger() to tag elements.'));
+  assert.ok(appendix.includes('Tags elements with labels.'));
+  assert.ok(!appendix.includes('Use tagger() to tag elements.'));
+
+  const catalog = await runCode('return pluginCatalog()', ctx, 5000);
+  assert.equal(Array.isArray(catalog), true);
+  assert.equal(catalog[0].name, 'tagger');
+  assert.equal(catalog[0].description, 'Tags elements with labels.');
+
+  const help = await runCode('return pluginHelp("tagger", "examples")', ctx, 5000);
+  assert.ok(help.includes("tagger('hero')"));
 
   await rm(dir, { recursive: true });
 });
diff --git a/mcp/test/mcp-tools.test.js b/mcp/test/mcp-tools.test.js
index cd0b543..c3cb82e 100644
--- a/mcp/test/mcp-tools.test.js
+++ b/mcp/test/mcp-tools.test.js
@@ -123,6 +123,9 @@ describe('Tool Definitions', () => {
     assert.ok(promptBlock.includes('getCDPSession({ page })'), 'should mention relay-safe getCDPSession helper usage');
     assert.ok(promptBlock.includes('cleanHTML'), 'should mention cleanHTML helper');
     assert.ok(promptBlock.includes('pageMarkdown'), 'should mention pageMarkdown helper');
+    assert.ok(promptBlock.includes('pluginCatalog()'), 'should mention pluginCatalog built-in helper');
+    assert.ok(promptBlock.includes('pluginHelp(name, section?)'), 'should mention pluginHelp built-in helper');
+    assert.ok(promptBlock.includes('metadata-first'), 'should guide plugin usage as metadata-first');
     assert.ok(promptBlock.includes('newPage'), 'should mention creating new tabs');
     // Anti-patterns section
     assert.ok(promptBlock.includes('ANTI-PATTERN') || promptBlock.includes('Don\'t') || promptBlock.includes('✗'), 'should include anti-patterns');
diff --git a/mcp/test/plugin-loader.test.js b/mcp/test/plugin-loader.test.js
index 6ed6666..1166ea8 100644
--- a/mcp/test/plugin-loader.test.js
+++ b/mcp/test/plugin-loader.test.js
@@ -3,6 +3,7 @@ import assert from 'node:assert/strict';
 import { mkdtemp, mkdir, writeFile, rm } from 'node:fs/promises';
 import { join } from 'node:path';
 import { tmpdir } from 'node:os';
+import { fileURLToPath } from 'node:url';
 
 test('loadPlugins returns empty array when dir does not exist', async () => {
   const { loadPlugins } = await import('../src/plugin-loader.js');
@@ -17,7 +18,14 @@ test('loadPlugins loads a valid plugin folder', async () => {
   await writeFile(join(pluginDir, 'index.js'),
     `export default { name: 'hello', helpers: { greet: async (page) => 'hi' } };`
   );
-  await writeFile(join(pluginDir, 'SKILL.md'), '# hello\nUse greet() to say hi.');
+  const skillSource = `---
+name: hello-skill
+description: Friendly hello helper
+tags: greeting,starter
+---
+# hello
+Use greet() to say hi.`;
+  await writeFile(join(pluginDir, 'SKILL.md'), skillSource);
 
   const { loadPlugins } = await import('../src/plugin-loader.js');
   const plugins = await loadPlugins(dir);
@@ -25,7 +33,136 @@ test('loadPlugins loads a valid plugin folder', async () => {
   assert.equal(plugins.length, 1);
   assert.equal(plugins[0].name, 'hello');
   assert.equal(typeof plugins[0].helpers.greet, 'function');
-  assert.equal(plugins[0]._skill, '# hello\nUse greet() to say hi.');
+  assert.equal(plugins[0]._skill, skillSource);
+  assert.deepEqual(plugins[0]._skillMeta, {
+    name: 'hello-skill',
+    description: 'Friendly hello helper',
+  });
+  assert.equal(plugins[0]._skillBody, '# hello\nUse greet() to say hi.');
+
+  await rm(dir, { recursive: true });
+});
+
+test('loadPlugins ignores unknown SKILL frontmatter keys', async () => {
+  const dir = await mkdtemp(join(tmpdir(), 'bf-test-'));
+  const pluginDir = join(dir, 'meta-keys');
+  await mkdir(pluginDir);
+  await writeFile(
+    join(pluginDir, 'index.js'),
+    `export default { name: 'meta-keys', helpers: { noop: async () => null } };`
+  );
+  await writeFile(
+    join(pluginDir, 'SKILL.md'),
+    `---
+name: meta-keys
+description: plugin metadata
+helpers: noop
+unknown: should-be-ignored
+tags: also-ignored
+---
+# Meta Keys
+Details`
+  );
+
+  const { loadPlugins } = await import('../src/plugin-loader.js');
+  const plugins = await loadPlugins(dir);
+
+  assert.equal(plugins.length, 1);
+  assert.deepEqual(plugins[0]._skillMeta, {
+    name: 'meta-keys',
+    description: 'plugin metadata',
+    helpers: ['noop'],
+  });
+
+  await rm(dir, { recursive: true });
+});
+
+test('loadPlugins preserves description text after first colon', async () => {
+  const dir = await mkdtemp(join(tmpdir(), 'bf-test-'));
+  const pluginDir = join(dir, 'desc-colons');
+  await mkdir(pluginDir);
+  await writeFile(
+    join(pluginDir, 'index.js'),
+    `export default { name: 'desc-colons', helpers: { noop: async () => null } };`
+  );
+  await writeFile(
+    join(pluginDir, 'SKILL.md'),
+    `---
+name: desc-colons
+description: A: B: C
+---
+# Desc Colons
+Details`
+  );
+
+  const { loadPlugins } = await import('../src/plugin-loader.js');
+  const plugins = await loadPlugins(dir);
+
+  assert.equal(plugins.length, 1);
+  assert.equal(plugins[0]._skillMeta.description, 'A: B: C');
+
+  await rm(dir, { recursive: true });
+});
+
+test('loadPlugins parses YAML list frontmatter values for canonical list keys', async () => {
+  const dir = await mkdtemp(join(tmpdir(), 'bf-test-'));
+  const pluginDir = join(dir, 'yaml-lists');
+  await mkdir(pluginDir);
+  await writeFile(
+    join(pluginDir, 'index.js'),
+    `export default { name: 'yaml-lists', helpers: { alpha: async () => null } };`
+  );
+  await writeFile(
+    join(pluginDir, 'SKILL.md'),
+    `---
+name: yaml-lists
+description: Uses YAML list values
+helpers:
+  - alpha
+  - beta
+tools:
+  - read_sheet
+when_to_use:
+  - First scenario
+  - Second scenario
+---
+# YAML Lists
+Body`
+  );
+
+  const { loadPlugins } = await import('../src/plugin-loader.js');
+  const plugins = await loadPlugins(dir);
+
+  assert.equal(plugins.length, 1);
+  assert.deepEqual(plugins[0]._skillMeta.helpers, ['alpha', 'beta']);
+  assert.deepEqual(plugins[0]._skillMeta.tools, ['read_sheet']);
+  assert.deepEqual(plugins[0]._skillMeta.when_to_use, ['First scenario', 'Second scenario']);
+
+  await rm(dir, { recursive: true });
+});
+
+test('loadPlugins tolerates malformed frontmatter without crashing', async () => {
+  const dir = await mkdtemp(join(tmpdir(), 'bf-test-'));
+  const pluginDir = join(dir, 'malformed');
+  await mkdir(pluginDir);
+  await writeFile(
+    join(pluginDir, 'index.js'),
+    `export default { name: 'malformed', helpers: { noop: async () => null } };`
+  );
+  const malformedSkill = `---
+name malformed
+description missing colon
+# no closing fence
+# Malformed
+Still loads`;
+  await writeFile(join(pluginDir, 'SKILL.md'), malformedSkill);
+
+  const { loadPlugins } = await import('../src/plugin-loader.js');
+  const plugins = await loadPlugins(dir);
+
+  assert.equal(plugins.length, 1);
+  assert.deepEqual(plugins[0]._skillMeta, {});
+  assert.equal(plugins[0]._skillBody, malformedSkill);
 
   await rm(dir, { recursive: true });
 });
@@ -63,13 +200,135 @@ test('buildPluginHelpers merges helpers from multiple plugins', async () => {
 test('buildPluginSkillAppendix skips plugins with empty skill', async () => {
   const { buildPluginSkillAppendix } = await import('../src/plugin-loader.js');
   const plugins = [
-    { name: 'a', _skill: 'Use foo() for X.' },
-    { name: 'b', _skill: '' },
-    { name: 'c', _skill: 'Use bar() for Y.' },
+    {
+      name: 'a',
+      helpers: { foo: () => 'x' },
+      _skillMeta: { description: 'Helper for X' },
+      _skillBody: 'Detailed instructions for X',
+    },
+    { name: 'b', helpers: { noop: () => null }, _skillMeta: {}, _skillBody: '' },
+    {
+      name: 'c',
+      helpers: { bar: () => 'y' },
+      _skillMeta: { description: 'Helper for Y' },
+      _skillBody: 'Detailed instructions for Y',
+    },
   ];
   const appendix = buildPluginSkillAppendix(plugins);
+
+  assert.ok(appendix.includes('pluginCatalog()'));
+  assert.ok(appendix.includes('pluginHelp(name, section?)'));
   assert.ok(appendix.includes('PLUGIN: a'));
-  assert.ok(appendix.includes('Use foo() for X.'));
+  assert.ok(appendix.includes('Helper for X'));
+  assert.ok(appendix.includes('foo'));
   assert.ok(appendix.includes('PLUGIN: c'));
+  assert.ok(appendix.includes('Helper for Y'));
   assert.ok(!appendix.includes('PLUGIN: b'));
+  assert.ok(!appendix.includes('Detailed instructions for X'));
+  assert.ok(!appendix.includes('Detailed instructions for Y'));
+});
+
+test('loadPlugins parses block scalar frontmatter values for canonical keys', async () => {
+  const dir = await mkdtemp(join(tmpdir(), 'bf-test-'));
+  const pluginDir = join(dir, 'block-scalars');
+  await mkdir(pluginDir);
+  await writeFile(
+    join(pluginDir, 'index.js'),
+    `export default { name: 'block-scalars', helpers: { noop: async () => null } };`
+  );
+  await writeFile(
+    join(pluginDir, 'SKILL.md'),
+    `---
+name: block-scalars
+description: |
+  First line.
+  Second line.
+when_to_use: >
+  Use this helper
+  when pages are ready.
+---
+# Block Scalars
+Body`
+  );
+
+  const { loadPlugins } = await import('../src/plugin-loader.js');
+  const plugins = await loadPlugins(dir);
+
+  assert.equal(plugins.length, 1);
+  assert.equal(plugins[0]._skillMeta.description, 'First line.\nSecond line.');
+  assert.deepEqual(plugins[0]._skillMeta.when_to_use, ['Use this helper when pages are ready.']);
+
+  await rm(dir, { recursive: true });
+});
+
+test('buildPluginSkillRuntime ignores section headings inside fenced code blocks', async () => {
+  const { buildPluginSkillRuntime } = await import('../src/plugin-loader.js');
+  const runtime = buildPluginSkillRuntime([
+    {
+      name: 'fences',
+      helpers: {},
+      _skillMeta: {},
+      _skillBody: `Intro
+
+## usage
+Visible section text.
+
+\`\`\`md
+## not-a-section
+\`\`\`
+
+## examples
+Real examples section.`,
+    },
+  ]);
+
+  const usage = runtime.byName.fences.sections.usage;
+  assert.ok(usage.includes('Visible section text.'));
+  assert.ok(usage.includes('## not-a-section'));
+  assert.deepEqual(Object.keys(runtime.byName.fences.sections), ['usage', 'examples']);
+});
+
+test('buildPluginSkillRuntime keeps first plugin for duplicate normalized names and warns', async () => {
+  const { buildPluginSkillRuntime } = await import('../src/plugin-loader.js');
+
+  const originalStderrWrite = process.stderr.write;
+  let stderr = '';
+  process.stderr.write = function patchedWrite(chunk, ...args) {
+    stderr += String(chunk);
+    const maybeCallback = args[args.length - 1];
+    if (typeof maybeCallback === 'function') maybeCallback();
+    return true;
+  };
+
+  try {
+    const runtime = buildPluginSkillRuntime([
+      { name: 'Dupe', helpers: {}, _skillMeta: {}, _skillBody: '## one\nfirst' },
+      { name: 'dupe', helpers: {}, _skillMeta: {}, _skillBody: '## one\nsecond' },
+    ]);
+
+    assert.equal(runtime.catalog.length, 1);
+    assert.equal(runtime.catalog[0].name, 'Dupe');
+    assert.equal(runtime.byName.dupe.name, 'Dupe');
+    assert.equal(runtime.byName.dupe.sections.one, 'first');
+    assert.match(stderr, /Duplicate plugin skill name/i);
+    assert.match(stderr, /Keeping first/i);
+  } finally {
+    process.stderr.write = originalStderrWrite;
+  }
+});
+
+test('loadPlugins parses metadata shape from official google-sheets SKILL fixture', async () => {
+  const officialPluginsDir = fileURLToPath(new URL('../../plugins/official', import.meta.url));
+  const { loadPlugins } = await import('../src/plugin-loader.js');
+  const plugins = await loadPlugins(officialPluginsDir);
+  const googleSheets = plugins.find((plugin) => plugin.name === 'google-sheets');
+
+  assert.ok(googleSheets);
+  assert.equal(googleSheets._skillMeta.name, 'google-sheets');
+  assert.equal(typeof googleSheets._skillMeta.description, 'string');
+  assert.equal(Array.isArray(googleSheets._skillMeta.when_to_use), true);
+  assert.equal(Array.isArray(googleSheets._skillMeta.helpers), true);
+  assert.equal(Array.isArray(googleSheets._skillMeta.tools), true);
+  assert.ok(googleSheets._skillMeta.when_to_use.length > 0);
+  assert.ok(googleSheets._skillMeta.helpers.length > 0);
 });
diff --git a/plugins/official/google-sheets/SKILL.md b/plugins/official/google-sheets/SKILL.md
index 82a0fca..3bae8d5 100644
--- a/plugins/official/google-sheets/SKILL.md
+++ b/plugins/official/google-sheets/SKILL.md
@@ -1,3 +1,11 @@
+---
+name: google-sheets
+description: Google Sheets helpers for reading, summarizing, formatting, and issue logging in the active sheet.
+when_to_use: ["Summarizing an active Google Sheet quickly", "Reading specific cells or contiguous used rows", "Applying bullet splitting and sparse bold formatting across ranges", "Logging extraction or formatting failures for follow-up"]
+helpers: ["gsGetMeta", "gsGotoCell", "gsReadCell", "gsReadContiguousRows", "gsSummarizeSheet", "gsSplitBulletsInRange", "gsRebalanceBoldInRange", "gsFormatBulletsInRange", "gsLogIssue", "gsIssueLogPath"]
+tools: []
+---
+
 ## google-sheets plugin
 
 Use Google Sheets helpers when work involves reading, summarizing, or structuring sheet content from the active page without guesswork.
@@ -32,6 +40,7 @@ When the user says "summarize this page/sheet", "read this sheet", or equivalent
 - Use `gsReadContiguousRows({ columns: ['A','B'], startRow: 1, maxRows: 30, emptyStreakStop: 2 })`.
 - Always report `scannedRows`, `usedRowCount`, and `stopReason` when summarizing extraction.
 - For summary requests, prefer `gsSummarizeSheet()` over ad-hoc DOM probing loops.
+- `gsSummarizeSheet()` reuses a recent in-session scan by default; set `forceRefresh: true` when the user asks for a guaranteed fresh pull.
 - Prefer `gsFormatBulletsInRange()` for multi-cell content cleanup tasks.
 - Use `dryRun: true` first for formatting helpers when changing many cells.
 - Log every process failure or unexpected behavior with `gsLogIssue(...)`.
diff --git a/plugins/official/google-sheets/index.js b/plugins/official/google-sheets/index.js
index 1ede7cd..fa60bb6 100644
--- a/plugins/official/google-sheets/index.js
+++ b/plugins/official/google-sheets/index.js
@@ -5,6 +5,9 @@ import { homedir } from 'node:os';
 const DEFAULT_SCAN_MAX_ROWS = 30;
 const DEFAULT_EMPTY_STREAK_STOP = 2;
 const DEFAULT_EDITOR_WAIT_MS = 35;
+const DEFAULT_SUMMARY_CACHE_TTL_MS = 5 * 60 * 1000;
+const SUMMARY_CACHE_MAX_ENTRIES = 24;
+const SUMMARY_CACHE_STATE_KEY = '__gsSummaryCache';
 const DEFAULT_LOG_PATH = join(homedir(), '.browserforce', 'logs', 'google-sheets-issues.jsonl');
 const SHEETS_URL_RE = /^https:\/\/docs\.google\.com\/spreadsheets\//;
 
@@ -511,6 +514,124 @@ async function inferColumnsFromHeaderRow(page, options = {}) {
   return fallback;
 }
 
+function getSummaryScanConfig(options = {}, explicitColumns = null) {
+  const startRow = Number.isInteger(options.startRow) && options.startRow > 0 ? options.startRow : 1;
+  const maxRows = Number.isInteger(options.maxRows) && options.maxRows > 0
+    ? options.maxRows
+    : DEFAULT_SCAN_MAX_ROWS;
+  const emptyStreakStop = Number.isInteger(options.emptyStreakStop) && options.emptyStreakStop > 0
+    ? options.emptyStreakStop
+    : DEFAULT_EMPTY_STREAK_STOP;
+  const trim = options.trim !== false;
+
+  if (explicitColumns) {
+    return {
+      mode: 'explicit',
+      columns: explicitColumns,
+      startRow,
+      maxRows,
+      emptyStreakStop,
+      trim,
+    };
+  }
+
+  const maxColumns = Number.isInteger(options.maxColumns) && options.maxColumns > 0
+    ? options.maxColumns
+    : 8;
+  const emptyColumnStreakStop = Number.isInteger(options.emptyColumnStreakStop) && options.emptyColumnStreakStop > 0
+    ? options.emptyColumnStreakStop
+    : 1;
+  const fallbackColumnsCount = Number.isInteger(options.fallbackColumnsCount) && options.fallbackColumnsCount > 0
+    ? options.fallbackColumnsCount
+    : 2;
+  const startColumn = normalizeColumns([options.startColumn || 'A'])[0];
+
+  return {
+    mode: 'auto',
+    startRow,
+    maxRows,
+    emptyStreakStop,
+    trim,
+    startColumn,
+    maxColumns,
+    emptyColumnStreakStop,
+    fallbackColumnsCount,
+  };
+}
+
+function buildSummaryCacheKey(sheetMeta, options = {}, explicitColumns = null) {
+  const identity = {
+    spreadsheetId: sheetMeta?.spreadsheetId || null,
+    gid: sheetMeta?.gid || null,
+  };
+  const config = getSummaryScanConfig(options, explicitColumns);
+  return JSON.stringify({ identity, config });
+}
+
+function getSummaryCacheMap(state) {
+  if (!state || typeof state !== 'object') return null;
+  if (!(state[SUMMARY_CACHE_STATE_KEY] instanceof Map)) {
+    state[SUMMARY_CACHE_STATE_KEY] = new Map();
+  }
+  return state[SUMMARY_CACHE_STATE_KEY];
+}
+
+function readSummaryCacheEntry(state, cacheKey, ttlMs) {
+  const cache = getSummaryCacheMap(state);
+  if (!cache) return null;
+
+  const entry = cache.get(cacheKey);
+  if (!entry) return null;
+
+  const ageMs = Date.now() - entry.cachedAt;
+  if (ttlMs >= 0 && ageMs > ttlMs) {
+    cache.delete(cacheKey);
+    return null;
+  }
+
+  return entry;
+}
+
+function writeSummaryCacheEntry(state, cacheKey, value) {
+  const cache = getSummaryCacheMap(state);
+  if (!cache) return;
+
+  cache.delete(cacheKey);
+  cache.set(cacheKey, { cachedAt: Date.now(), ...value });
+
+  while (cache.size > SUMMARY_CACHE_MAX_ENTRIES) {
+    const oldestKey = cache.keys().next().value;
+    cache.delete(oldestKey);
+  }
+}
+
+function clearSummaryCache(state) {
+  const cache = getSummaryCacheMap(state);
+  if (cache) cache.clear();
+}
+
+function buildSummaryResult(sheet, columns, scanResult, options = {}) {
+  const includeRows = options.includeRows === true;
+  const previewRows = Number.isInteger(options.previewRows) && options.previewRows > 0 ? options.previewRows : 8;
+  const preview = scanResult.rows.slice(0, previewRows).map((entry) => ({ row: entry.row, cells: entry.cells }));
+  const firstDataRow = scanResult.rows[0] || null;
+  const headerCandidate = scanResult.rows.find((entry) => entry.row === scanResult.config.startRow) || null;
+
+  return {
+    sheet,
+    columns,
+    scan: {
+      scannedRows: scanResult.scannedRows,
+      usedRowCount: scanResult.usedRowCount,
+      stopReason: scanResult.stopReason,
+    },
+    firstDataRow: firstDataRow ? { row: firstDataRow.row, cells: firstDataRow.cells } : null,
+    headerCandidate: headerCandidate ? { row: headerCandidate.row, cells: headerCandidate.cells } : null,
+    preview,
+    ...(includeRows ? { rows: scanResult.rows } : {}),
+  };
+}
+
 export default {
   name: 'google-sheets',
   description: 'Google Sheets helpers for reliable row scanning, cell reads, and issue logging',
@@ -543,30 +664,30 @@ export default {
     gsSummarizeSheet: async (page, ctx, state, options = {}) => {
       assertGoogleSheet(page, 'gsSummarizeSheet');
       const title = await page.title();
-      const sheet = { ...parseSheetMeta(page.url()), title };
-      const includeRows = options.includeRows === true;
-      const previewRows = Number.isInteger(options.previewRows) && options.previewRows > 0 ? options.previewRows : 8;
-      const columns = options.columns
-        ? normalizeColumns(options.columns)
-        : await inferColumnsFromHeaderRow(page, options);
+      const sheetMeta = parseSheetMeta(page.url());
+      const sheet = { ...sheetMeta, title };
+      const explicitColumns = options.columns ? normalizeColumns(options.columns) : null;
+      const forceRefresh = options.forceRefresh === true;
+      const useCache = options.useCache !== false;
+      const cacheTtlMs = Number.isInteger(options.cacheTtlMs) && options.cacheTtlMs >= 0
+        ? options.cacheTtlMs
+        : DEFAULT_SUMMARY_CACHE_TTL_MS;
+      const cacheKey = buildSummaryCacheKey(sheetMeta, options, explicitColumns);
+
+      if (useCache && !forceRefresh) {
+        const cached = readSummaryCacheEntry(state, cacheKey, cacheTtlMs);
+        if (cached) {
+          return buildSummaryResult(sheet, cached.columns, cached.scanResult, options);
+        }
+      }
+
+      const columns = explicitColumns || await inferColumnsFromHeaderRow(page, options);
       const scanResult = await scanContiguousRows(page, { ...options, columns });
-      const preview = scanResult.rows.slice(0, previewRows).map((entry) => ({ row: entry.row, cells: entry.cells }));
-      const firstDataRow = scanResult.rows[0] || null;
-      const headerCandidate = scanResult.rows.find((entry) => entry.row === scanResult.config.startRow) || null;
+      if (useCache) {
+        writeSummaryCacheEntry(state, cacheKey, { columns, scanResult });
+      }
 
-      return {
-        sheet,
-        columns,
-        scan: {
-          scannedRows: scanResult.scannedRows,
-          usedRowCount: scanResult.usedRowCount,
-          stopReason: scanResult.stopReason,
-        },
-        firstDataRow: firstDataRow ? { row: firstDataRow.row, cells: firstDataRow.cells } : null,
-        headerCandidate: headerCandidate ? { row: headerCandidate.row, cells: headerCandidate.cells } : null,
-        preview,
-        ...(includeRows ? { rows: scanResult.rows } : {}),
-      };
+      return buildSummaryResult(sheet, columns, scanResult, options);
     },
 
     gsLogIssue: async (page, ctx, state, summary, details = {}, options = {}) => {
@@ -654,13 +775,20 @@ export default {
         }
       }
 
+      const changedCount = results.filter((r) => r.changed).length;
+      const unchangedCount = results.filter((r) => r.status === 'unchanged').length;
+      const okCount = results.filter((r) => r.status === 'ok' || r.status === 'dry_run').length;
+      const failedCount = results.filter((r) => r.status === 'error' || r.status === 'verify_failed').length;
+
+      if (!dryRun && changedCount > 0) clearSummaryCache(state);
+
       return {
         rangeRef: String(rangeRef),
         total: results.length,
-        changed: results.filter((r) => r.changed).length,
-        unchanged: results.filter((r) => r.status === 'unchanged').length,
-        ok: results.filter((r) => r.status === 'ok' || r.status === 'dry_run').length,
-        failed: results.filter((r) => r.status === 'error' || r.status === 'verify_failed').length,
+        changed: changedCount,
+        unchanged: unchangedCount,
+        ok: okCount,
+        failed: failedCount,
         results,
       };
     },
@@ -743,13 +871,20 @@ export default {
         }
       }
 
+      const changedCount = results.filter((r) => r.changed).length;
+      const unchangedCount = results.filter((r) => r.status === 'unchanged').length;
+      const okCount = results.filter((r) => r.status === 'ok' || r.status === 'dry_run').length;
+      const failedCount = results.filter((r) => r.status === 'error' || r.status === 'verify_failed').length;
+
+      if (!dryRun && changedCount > 0) clearSummaryCache(state);
+
       return {
         rangeRef: String(rangeRef),
         total: results.length,
-        changed: results.filter((r) => r.changed).length,
-        unchanged: results.filter((r) => r.status === 'unchanged').length,
-        ok: results.filter((r) => r.status === 'ok' || r.status === 'dry_run').length,
-        failed: results.filter((r) => r.status === 'error' || r.status === 'verify_failed').length,
+        changed: changedCount,
+        unchanged: unchangedCount,
+        ok: okCount,
+        failed: failedCount,
         results,
       };
     },
@@ -843,13 +978,20 @@ export default {
         }
       }
 
+      const changedCount = results.filter((r) => r.changed).length;
+      const unchangedCount = results.filter((r) => r.status === 'unchanged').length;
+      const okCount = results.filter((r) => r.status === 'ok' || r.status === 'dry_run').length;
+      const failedCount = results.filter((r) => r.status === 'error' || r.status === 'verify_failed').length;
+
+      if (!dryRun && changedCount > 0) clearSummaryCache(state);
+
       return {
         rangeRef: String(rangeRef),
         total: results.length,
-        changed: results.filter((r) => r.changed).length,
-        unchanged: results.filter((r) => r.status === 'unchanged').length,
-        ok: results.filter((r) => r.status === 'ok' || r.status === 'dry_run').length,
-        failed: results.filter((r) => r.status === 'error' || r.status === 'verify_failed').length,
+        changed: changedCount,
+        unchanged: unchangedCount,
+        ok: okCount,
+        failed: failedCount,
         results,
       };
     },
diff --git a/plugins/official/highlight/SKILL.md b/plugins/official/highlight/SKILL.md
index 6578142..4ae7633 100644
--- a/plugins/official/highlight/SKILL.md
+++ b/plugins/official/highlight/SKILL.md
@@ -1,3 +1,11 @@
+---
+name: highlight
+description: Visual outlining helpers that highlight matching elements and clear applied outlines.
+when_to_use: ["Visually identifying matched elements before interaction", "Debugging selectors on complex pages", "Clearing temporary visual outlines after inspection"]
+helpers: ["highlight", "clearHighlights"]
+tools: []
+---
+
 ## highlight(selector, color?)
 Visually highlight matching elements with a colored outline. Default color: red.
 Returns the number of elements highlighted.
diff --git a/plugins/official/openclaw/SKILL.md b/plugins/official/openclaw/SKILL.md
index 4069ff9..57855cd 100644
--- a/plugins/official/openclaw/SKILL.md
+++ b/plugins/official/openclaw/SKILL.md
@@ -1,3 +1,11 @@
+---
+name: openclaw
+description: BrowserForce tab-attachment policy notes for OpenClaw operating constraints.
+when_to_use: ["Applying OpenClaw tab policy in BrowserForce sessions", "Deciding whether to auto-create a dedicated tab", "Reporting actionable attach/share blockers to the user"]
+helpers: []
+tools: []
+---
+
 ## BrowserForce tab policy (OpenClaw)
 
 - Do not ask the user to click Attach/Share by default.
diff --git a/relay/src/index.js b/relay/src/index.js
index 1335f1f..2545d22 100644
--- a/relay/src/index.js
+++ b/relay/src/index.js
@@ -409,7 +409,8 @@ class RelayServer {
 
     // ─── Plugin Routes ───────────────────────────────────────────────────────
 
-    if (url.pathname === '/plugins' && req.method === 'GET') {
+    const isPluginsListPath = url.pathname === '/plugins' || url.pathname === '/v1/plugins';
+    if (isPluginsListPath && req.method === 'GET') {
       try {
         const entries = fs.existsSync(this.pluginsDir)
           ? fs.readdirSync(this.pluginsDir, { withFileTypes: true })
@@ -424,7 +425,8 @@ class RelayServer {
       return;
     }
 
-    if (url.pathname === '/plugins/install' && req.method === 'POST') {
+    const isPluginsInstallPath = url.pathname === '/plugins/install' || url.pathname === '/v1/plugins/install';
+    if (isPluginsInstallPath && req.method === 'POST') {
       if (!this._requireAuth(req, res)) return;
       let body = '';
       req.on('data', chunk => { body += chunk; });
@@ -447,7 +449,7 @@ class RelayServer {
       return;
     }
 
-    const deleteMatch = url.pathname.match(/^\/plugins\/([a-z0-9_-]+)$/);
+    const deleteMatch = url.pathname.match(/^\/(?:v1\/)?plugins\/([a-z0-9_-]+)$/);
     if (deleteMatch && req.method === 'DELETE') {
       if (!this._requireAuth(req, res)) return;
       const name = deleteMatch[1];

From 81d51ba4ba15249ada75520ec29aa99728b55e2a Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 23:33:37 +0530
Subject: [PATCH 174/192] agent-panel: start fresh session when opened from
 popup

- persist a one-shot open-agent request in popup before opening side panel

- consume/watch that request in agent panel to create a new conversation on open

- keep existing behavior for normal tab switching while panel remains open

- add contract tests for popup signal and panel fresh-session handling
---
 extension/agent-panel.js                     | 75 +++++++++++++++++++-
 extension/popup.js                           |  9 +++
 test/agent/agent-panel-send-contract.test.js | 16 +++++
 test/agent/popup-contract.test.js            |  4 ++
 4 files changed, 103 insertions(+), 1 deletion(-)

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 209f761..66b59d1 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -16,6 +16,8 @@ const REASONING_PRESETS = [
   { value: 'high', label: 'High' },
   { value: 'xhigh', label: 'Extra High' },
 ];
+const BROWSERFORCE_AGENT_OPEN_REQUEST_KEY = 'browserforceAgentOpenRequest';
+const BROWSERFORCE_AGENT_OPEN_REQUEST_MAX_AGE_MS = 60_000;
 
 const state = {
   value: initialState,
@@ -27,6 +29,9 @@ const state = {
   latestReasoningTitleByRun: {},
   transcriptHandlersBound: false,
   tabAttachWatchersBound: false,
+  agentOpenRequestWatcherBound: false,
+  lastHandledAgentOpenRequestId: null,
+  pendingAgentOpenRequest: null,
   initialTabAttachInFlight: false,
   initialTabAttachStarted: false,
   editingSessionId: null,
@@ -885,6 +890,65 @@ function escapeHtml(value) {
   return div.innerHTML;
 }
 
+function normalizeAgentOpenRequest(raw) {
+  if (!raw || typeof raw !== 'object') return null;
+  const requestId = String(raw.requestId || '').trim();
+  const requestedAt = Number(raw.requestedAt);
+  if (!requestId || !Number.isFinite(requestedAt)) return null;
+  if ((Date.now() - requestedAt) > BROWSERFORCE_AGENT_OPEN_REQUEST_MAX_AGE_MS) return null;
+  return {
+    requestId,
+    requestedAt,
+    source: String(raw.source || '').trim() || null,
+  };
+}
+
+async function consumePendingAgentOpenRequest() {
+  if (!chrome?.storage?.local?.get || !chrome?.storage?.local?.remove) return null;
+  try {
+    const stored = await chrome.storage.local.get([BROWSERFORCE_AGENT_OPEN_REQUEST_KEY]);
+    const request = normalizeAgentOpenRequest(stored?.[BROWSERFORCE_AGENT_OPEN_REQUEST_KEY]);
+    if (!request) return null;
+    await chrome.storage.local.remove(BROWSERFORCE_AGENT_OPEN_REQUEST_KEY);
+    state.lastHandledAgentOpenRequestId = request.requestId;
+    return request;
+  } catch {
+    return null;
+  }
+}
+
+async function startFreshSessionFromOpenRequest(rawRequest) {
+  const request = normalizeAgentOpenRequest(rawRequest);
+  if (!request) return;
+  if (state.lastHandledAgentOpenRequestId === request.requestId) return;
+  state.lastHandledAgentOpenRequestId = request.requestId;
+  if (!state.auth) {
+    state.pendingAgentOpenRequest = request;
+    return;
+  }
+  try {
+    await chrome.storage.local.remove(BROWSERFORCE_AGENT_OPEN_REQUEST_KEY);
+  } catch {
+    // best-effort cleanup
+  }
+  state.pendingAgentOpenRequest = null;
+  await createSession();
+}
+
+function bindAgentOpenRequestWatcher() {
+  if (state.agentOpenRequestWatcherBound) return;
+  if (!chrome?.storage?.onChanged?.addListener) return;
+  state.agentOpenRequestWatcherBound = true;
+  chrome.storage.onChanged.addListener((changes, areaName) => {
+    if (areaName !== 'local') return;
+    const change = changes?.[BROWSERFORCE_AGENT_OPEN_REQUEST_KEY];
+    if (!change?.newValue) return;
+    startFreshSessionFromOpenRequest(change.newValue).catch((error) => {
+      setStatus('error', error?.message || 'Unable to start a new conversation');
+    });
+  });
+}
+
 function sleep(ms) {
   return new Promise((resolve) => setTimeout(resolve, ms));
 }
@@ -1429,6 +1493,15 @@ async function initializePanel() {
   setComposerEnabled(false);
   setStatus('info', 'Connecting...');
   render();
+  bindAgentOpenRequestWatcher();
+  const openRequest = await consumePendingAgentOpenRequest();
+  let shouldStartFreshSession = !!openRequest;
+  if (shouldStartFreshSession) {
+    state.pendingAgentOpenRequest = null;
+  } else if (state.pendingAgentOpenRequest) {
+    shouldStartFreshSession = true;
+    state.pendingAgentOpenRequest = null;
+  }
   startInitialTabAttach();
   await loadAuth();
   bindTabAttachWatchers();
@@ -1439,7 +1512,7 @@ async function initializePanel() {
     state.defaultReasoningEffort = 'medium';
   }
   await loadSessions();
-  if (!state.value.activeSessionId) {
+  if (shouldStartFreshSession || !state.value.activeSessionId) {
     await createSession();
   } else {
     await selectSession(state.value.activeSessionId);
diff --git a/extension/popup.js b/extension/popup.js
index a0d4bee..70ba4ed 100644
--- a/extension/popup.js
+++ b/extension/popup.js
@@ -1,6 +1,7 @@
 // BrowserForce — Popup UI
 
 const RELAY_URL_DEFAULT = 'ws://127.0.0.1:19222/extension';
+const BROWSERFORCE_AGENT_OPEN_REQUEST_KEY = 'browserforceAgentOpenRequest';
 
 // Auto-generated instruction lines per restriction
 const RESTRICTION_LINES = {
@@ -199,6 +200,14 @@ attachBtn.addEventListener('click', () => {
 openAgentBtn.addEventListener('click', async () => {
   try {
     const [tab] = await chrome.tabs.query({ active: true, currentWindow: true });
+    await chrome.storage.local.set({
+      [BROWSERFORCE_AGENT_OPEN_REQUEST_KEY]: {
+        requestId: (globalThis.crypto?.randomUUID?.() || `bf-open-${Date.now()}`),
+        requestedAt: Date.now(),
+        source: 'popup-open-agent',
+        tabId: Number.isFinite(tab?.id) ? Number(tab.id) : null,
+      },
+    });
     await chrome.sidePanel.open({ windowId: tab?.windowId });
     window.close();
   } catch {
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index a79b92c..8fdec08 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -88,6 +88,22 @@ test('init opens smoothly by starting tab attach asynchronously', () => {
   assert.doesNotMatch(initBlock, /await ensureCurrentTabAttached\(\);/);
 });
 
+test('popup open-agent request can force a fresh session on panel init', () => {
+  assert.match(js, /BROWSERFORCE_AGENT_OPEN_REQUEST_KEY/);
+  assert.match(js, /function normalizeAgentOpenRequest\(/);
+  assert.match(js, /async function consumePendingAgentOpenRequest\(/);
+  assert.match(js, /async function initializePanel\(\)[\s\S]*consumePendingAgentOpenRequest\(\)/);
+  assert.match(js, /if \(shouldStartFreshSession \|\| !state\.value\.activeSessionId\)\s*\{\s*await createSession\(\);/);
+});
+
+test('panel watches open-agent request changes and starts a fresh session when already open', () => {
+  assert.match(js, /function bindAgentOpenRequestWatcher\(/);
+  assert.match(js, /chrome\.storage\.onChanged\.addListener/);
+  assert.match(js, /changes\?\.\[BROWSERFORCE_AGENT_OPEN_REQUEST_KEY\]/);
+  assert.match(js, /startFreshSessionFromOpenRequest\(change\.newValue\)/);
+  assert.match(js, /if \(!state\.auth\)\s*\{[\s\S]*state\.pendingAgentOpenRequest = request;/);
+});
+
 test('tab-attach banner shows progress during initial auto-attach and suppresses not-connected state', () => {
   assert.match(js, /function getTabAttachInProgressState\(\)/);
   assert.match(js, /text:\s*'Currently attaching active tab\.\.\.'/);
diff --git a/test/agent/popup-contract.test.js b/test/agent/popup-contract.test.js
index 72cd6f5..f83a5e1 100644
--- a/test/agent/popup-contract.test.js
+++ b/test/agent/popup-contract.test.js
@@ -17,6 +17,10 @@ test('logs viewer requests include extension identity header', () => {
 });
 
 test('open agent action opens side panel and closes popup', () => {
+  assert.match(popupJs, /BROWSERFORCE_AGENT_OPEN_REQUEST_KEY/);
+  assert.match(popupJs, /chrome\.storage\.local\.set\(/);
+  assert.match(popupJs, /source:\s*'popup-open-agent'/);
+  assert.match(popupJs, /await chrome\.storage\.local\.set\([\s\S]*await chrome\.sidePanel\.open\(/);
   assert.match(popupJs, /chrome\.sidePanel\.open\(/);
   assert.match(popupJs, /window\.close\(\)/);
 });

From 3f0529e14ff4278a83f363377b63f8f4f99bf5ef Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 23:51:34 +0530
Subject: [PATCH 175/192] agent-panel: render full markdown in assistant chat

- add block-level markdown renderer for assistant messages (headings, lists, task lists, blockquotes, code fences, tables, hr)

- keep inline markdown rendering for timeline labels/details with safe links/images and emphasis

- switch assistant bubble message rendering to markdown block output

- add markdown styling for md-content blocks and expand runtime/contract coverage
---
 extension/agent-panel-runtime.js             | 359 ++++++++++++++++++-
 extension/agent-panel.css                    | 167 +++++++++
 extension/agent-panel.js                     |   7 +-
 test/agent/agent-panel-runtime.test.js       |  68 ++++
 test/agent/agent-panel-send-contract.test.js |   6 +
 5 files changed, 601 insertions(+), 6 deletions(-)

diff --git a/extension/agent-panel-runtime.js b/extension/agent-panel-runtime.js
index 0cb7eef..8faa89a 100644
--- a/extension/agent-panel-runtime.js
+++ b/extension/agent-panel-runtime.js
@@ -36,10 +36,363 @@ function escapeHtml(value) {
     .replace(/'/g, '&#39;');
 }
 
+function escapeRegex(value) {
+  return String(value || '').replace(/[.*+?^${}()|[\]\\]/g, '\\$&');
+}
+
+function matchFencedBlockStart(line) {
+  const match = String(line || '').match(/^\s*(`{3,}|~{3,})\s*([\w+-]+)?\s*$/);
+  if (!match) return null;
+  return {
+    fence: match[1],
+    language: String(match[2] || '').trim().toLowerCase(),
+  };
+}
+
+function isFencedBlockClose(line, fence) {
+  if (!fence) return false;
+  const fenceChar = fence[0];
+  const fenceLength = fence.length;
+  const matcher = new RegExp(`^\\s*${escapeRegex(fenceChar)}{${fenceLength},}\\s*$`);
+  return matcher.test(String(line || ''));
+}
+
+function isMarkdownHeading(line) {
+  return /^\s{0,3}#{1,6}\s+\S/.test(String(line || ''));
+}
+
+function isMarkdownHorizontalRule(line) {
+  return /^\s{0,3}(?:\*{3,}|-{3,}|_{3,})\s*$/.test(String(line || ''));
+}
+
+function isMarkdownBlockquote(line) {
+  return /^\s{0,3}>\s?/.test(String(line || ''));
+}
+
+function matchMarkdownListItem(line) {
+  const match = String(line || '').match(/^(\s*)([-+*]|\d+\.)\s+(.+)$/);
+  if (!match) return null;
+  const marker = match[2];
+  return {
+    indent: match[1].length,
+    ordered: /\d+\./.test(marker),
+    content: String(match[3] || '').trim(),
+  };
+}
+
+function isMarkdownTableSeparator(line) {
+  const normalized = String(line || '').trim();
+  if (!normalized.includes('|')) return false;
+  return /^\|?\s*:?-{3,}:?\s*(\|\s*:?-{3,}:?\s*)+\|?$/.test(normalized);
+}
+
+function splitMarkdownTableRow(line) {
+  const raw = String(line || '').trim();
+  if (!raw) return [];
+  const startTrimmed = raw.startsWith('|') ? raw.slice(1) : raw;
+  const value = startTrimmed.endsWith('|') ? startTrimmed.slice(0, -1) : startTrimmed;
+  return value.split('|').map((cell) => cell.trim());
+}
+
+function parseMarkdownTableAlignments(separatorLine) {
+  const cells = splitMarkdownTableRow(separatorLine);
+  return cells.map((cell) => {
+    const value = String(cell || '').trim();
+    const left = value.startsWith(':');
+    const right = value.endsWith(':');
+    if (left && right) return 'center';
+    if (right) return 'right';
+    if (left) return 'left';
+    return '';
+  });
+}
+
+function looksLikeImageUrl(url) {
+  const value = String(url || '').trim();
+  if (!value) return false;
+  if (/^data:image\//i.test(value)) return true;
+  const cleaned = value.split('#')[0].split('?')[0];
+  return /\.(png|jpe?g|gif|webp|bmp|svg|avif)$/i.test(cleaned);
+}
+
+function normalizeRenderableUrl(url) {
+  const value = String(url || '').trim();
+  if (!value) return '';
+
+  if (
+    /^\/(?:tmp|private|var|Users|home|Volumes)\//.test(value)
+    || /^\/var\/folders\//.test(value)
+  ) {
+    return `file://${value}`;
+  }
+
+  return value;
+}
+
+function isSafeRenderableUrl(url) {
+  const value = String(url || '').trim();
+  if (!value) return false;
+  return (
+    /^https?:\/\//i.test(value)
+    || /^file:\/\//i.test(value)
+    || /^blob:/i.test(value)
+    || /^data:image\//i.test(value)
+    || /^(?:\.{1,2}\/)/.test(value)
+    || /^\/(?!\/)/.test(value)
+  );
+}
+
+function createMarkdownTokenStore() {
+  const tokens = [];
+  return {
+    put(html) {
+      const key = `__BF_INLINE_TOKEN_${tokens.length}__`;
+      tokens.push({ key, html });
+      return key;
+    },
+    apply(text) {
+      let output = text;
+      for (const token of tokens) {
+        output = output.replaceAll(token.key, token.html);
+      }
+      return output;
+    },
+  };
+}
+
 export function renderInlineContent(value) {
-  return escapeHtml(value)
-    .replace(/`([^`]+)`/g, '<code>$1</code>')
-    .replace(/\*\*([^*]+)\*\*/g, '<strong>$1</strong>');
+  const store = createMarkdownTokenStore();
+  const source = String(value ?? '');
+
+  const withCodeTokens = source.replace(/`([^`\n]+)`/g, (_match, codeRaw) => (
+    store.put(`<code>${escapeHtml(codeRaw)}</code>`)
+  ));
+
+  const withImageAndLinks = withCodeTokens.replace(/(!)?\[([^\]]*)\]\(([^)]+)\)/g, (match, imageMark, labelRaw, urlRaw) => {
+    const normalizedUrl = normalizeRenderableUrl(urlRaw);
+    if (!isSafeRenderableUrl(normalizedUrl)) return match;
+
+    const href = escapeHtml(normalizedUrl);
+    if (imageMark || looksLikeImageUrl(urlRaw)) {
+      const altText = String(labelRaw || '').trim() || 'Screenshot';
+      const alt = escapeHtml(altText);
+      return store.put(
+        `<a class="inline-image-link" href="${href}" target="_blank" rel="noopener noreferrer"><img class="inline-image" src="${href}" alt="${alt}" loading="lazy"></a>`,
+      );
+    }
+
+    const label = escapeHtml(String(labelRaw || '').trim() || normalizedUrl);
+    return store.put(`<a class="inline-link" href="${href}" target="_blank" rel="noopener noreferrer">${label}</a>`);
+  });
+
+  const withAutolinks = withImageAndLinks.replace(/(^|[\s(>])((https?:\/\/[^\s<]+))/g, (match, prefix, urlRaw) => {
+    const normalizedUrl = normalizeRenderableUrl(urlRaw);
+    if (!isSafeRenderableUrl(normalizedUrl)) return match;
+    const href = escapeHtml(normalizedUrl);
+    const label = escapeHtml(urlRaw);
+    return `${prefix}${store.put(`<a class="inline-link" href="${href}" target="_blank" rel="noopener noreferrer">${label}</a>`)}`;
+  });
+
+  const escaped = escapeHtml(withAutolinks);
+  const withEmphasis = escaped
+    .replace(/\*\*([^*]+)\*\*/g, '<strong>$1</strong>')
+    .replace(/~~([^~]+)~~/g, '<del>$1</del>')
+    .replace(/(^|[^\*])\*([^*\n]+)\*(?!\*)/g, '$1<em>$2</em>');
+
+  return store.apply(withEmphasis);
+}
+
+function isMarkdownBlockStarter(line, nextLine = '') {
+  if (!String(line || '').trim()) return true;
+  if (matchFencedBlockStart(line)) return true;
+  if (isMarkdownHeading(line)) return true;
+  if (isMarkdownHorizontalRule(line)) return true;
+  if (isMarkdownBlockquote(line)) return true;
+  if (matchMarkdownListItem(line)) return true;
+  if (String(line || '').includes('|') && isMarkdownTableSeparator(nextLine)) return true;
+  return false;
+}
+
+function renderMarkdownListBlock(lines, startIndex) {
+  let index = startIndex;
+  let html = '';
+  let currentType = null;
+  let items = [];
+
+  const flush = () => {
+    if (!items.length || !currentType) return;
+    const tag = currentType === 'ol' ? 'ol' : 'ul';
+    const itemsHtml = items.map((item) => {
+      const task = String(item || '').match(/^\[( |x|X)\]\s+([\s\S]+)$/);
+      if (!task) {
+        return `<li>${renderInlineContent(item).replace(/\n/g, '<br>')}</li>`;
+      }
+      const checked = String(task[1]).toLowerCase() === 'x';
+      return `
+        <li class="md-task-item">
+          <span class="md-task-box${checked ? ' checked' : ''}" aria-hidden="true"></span>
+          <span>${renderInlineContent(task[2]).replace(/\n/g, '<br>')}</span>
+        </li>
+      `;
+    }).join('');
+    html += `<${tag} class="md-list">${itemsHtml}</${tag}>`;
+    items = [];
+  };
+
+  while (index < lines.length) {
+    const line = lines[index];
+    if (!String(line || '').trim()) break;
+    const item = matchMarkdownListItem(line);
+    if (item) {
+      const nextType = item.ordered ? 'ol' : 'ul';
+      if (currentType && currentType !== nextType) {
+        flush();
+      }
+      currentType = nextType;
+      let itemText = item.content;
+      index += 1;
+      while (index < lines.length) {
+        const continuation = lines[index];
+        if (!String(continuation || '').trim()) break;
+        if (matchMarkdownListItem(continuation)) break;
+        if (isMarkdownBlockStarter(continuation, lines[index + 1])) break;
+        if (/^\s{2,}\S/.test(continuation)) {
+          itemText += `\n${continuation.trim()}`;
+          index += 1;
+          continue;
+        }
+        break;
+      }
+      items.push(itemText);
+      continue;
+    }
+    break;
+  }
+
+  flush();
+  return { html, nextIndex: index };
+}
+
+function renderMarkdownTableBlock(lines, startIndex) {
+  const headerCells = splitMarkdownTableRow(lines[startIndex]);
+  const alignments = parseMarkdownTableAlignments(lines[startIndex + 1]);
+  let index = startIndex + 2;
+  const bodyRows = [];
+
+  while (index < lines.length) {
+    const line = String(lines[index] || '');
+    if (!line.trim() || !line.includes('|')) break;
+    bodyRows.push(splitMarkdownTableRow(line));
+    index += 1;
+  }
+
+  const headHtml = `<tr>${headerCells.map((cell, cellIndex) => {
+    const align = alignments[cellIndex] ? ` style="text-align:${alignments[cellIndex]};"` : '';
+    return `<th${align}>${renderInlineContent(cell)}</th>`;
+  }).join('')}</tr>`;
+
+  const bodyHtml = bodyRows.map((row) => (
+    `<tr>${row.map((cell, cellIndex) => {
+      const align = alignments[cellIndex] ? ` style="text-align:${alignments[cellIndex]};"` : '';
+      return `<td${align}>${renderInlineContent(cell)}</td>`;
+    }).join('')}</tr>`
+  )).join('');
+
+  return {
+    html: `<table class="md-table"><thead>${headHtml}</thead>${bodyRows.length ? `<tbody>${bodyHtml}</tbody>` : ''}</table>`,
+    nextIndex: index,
+  };
+}
+
+function renderMarkdownBlocks(source) {
+  const normalized = String(source ?? '').replace(/\r\n?/g, '\n');
+  const lines = normalized.split('\n');
+  const chunks = [];
+  let index = 0;
+
+  while (index < lines.length) {
+    const line = String(lines[index] || '');
+    const nextLine = String(lines[index + 1] || '');
+    if (!line.trim()) {
+      index += 1;
+      continue;
+    }
+
+    const fenced = matchFencedBlockStart(line);
+    if (fenced) {
+      const codeLines = [];
+      index += 1;
+      while (index < lines.length && !isFencedBlockClose(lines[index], fenced.fence)) {
+        codeLines.push(lines[index]);
+        index += 1;
+      }
+      if (index < lines.length) index += 1;
+      const language = /^[a-z0-9_-]{1,32}$/i.test(fenced.language) ? fenced.language : '';
+      const className = language ? ` class="language-${escapeHtml(language)}"` : '';
+      chunks.push(`<pre class="md-pre"><code${className}>${escapeHtml(codeLines.join('\n'))}</code></pre>`);
+      continue;
+    }
+
+    const heading = line.match(/^\s{0,3}(#{1,6})\s+(.+?)\s*#*\s*$/);
+    if (heading) {
+      const level = Math.min(6, heading[1].length);
+      chunks.push(`<h${level} class="md-h${level}">${renderInlineContent(heading[2])}</h${level}>`);
+      index += 1;
+      continue;
+    }
+
+    if (isMarkdownHorizontalRule(line)) {
+      chunks.push('<hr class="md-hr">');
+      index += 1;
+      continue;
+    }
+
+    if (isMarkdownBlockquote(line)) {
+      const quoteLines = [];
+      while (index < lines.length && isMarkdownBlockquote(lines[index])) {
+        quoteLines.push(String(lines[index] || '').replace(/^\s{0,3}>\s?/, ''));
+        index += 1;
+      }
+      const quoteInner = renderMarkdownBlocks(quoteLines.join('\n'));
+      chunks.push(`<blockquote class="md-blockquote">${quoteInner || '<p></p>'}</blockquote>`);
+      continue;
+    }
+
+    if (matchMarkdownListItem(line)) {
+      const list = renderMarkdownListBlock(lines, index);
+      if (list.html) chunks.push(list.html);
+      index = list.nextIndex;
+      continue;
+    }
+
+    if (line.includes('|') && isMarkdownTableSeparator(nextLine)) {
+      const table = renderMarkdownTableBlock(lines, index);
+      chunks.push(table.html);
+      index = table.nextIndex;
+      continue;
+    }
+
+    const paragraphLines = [line];
+    index += 1;
+    while (index < lines.length) {
+      const candidate = String(lines[index] || '');
+      const candidateNext = String(lines[index + 1] || '');
+      if (!candidate.trim()) break;
+      if (isMarkdownBlockStarter(candidate, candidateNext)) break;
+      paragraphLines.push(candidate);
+      index += 1;
+    }
+    const paragraphText = paragraphLines.join('\n');
+    chunks.push(`<p>${renderInlineContent(paragraphText).replace(/\n/g, '<br>')}</p>`);
+  }
+
+  return chunks.join('');
+}
+
+export function renderMarkdownContent(value) {
+  const html = renderMarkdownBlocks(value);
+  if (!html) return '';
+  return `<div class="md-content">${html}</div>`;
 }
 
 export function getLatestInFlightStepIndex(run = {}) {
diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 0ab0a28..54b7a11 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -370,6 +370,144 @@ body {
   word-break: break-word;
 }
 
+.bubble-assistant .md-content {
+  display: flex;
+  flex-direction: column;
+  gap: 8px;
+}
+
+.bubble-assistant .md-content p {
+  margin: 0;
+}
+
+.bubble-assistant .md-content h1,
+.bubble-assistant .md-content h2,
+.bubble-assistant .md-content h3,
+.bubble-assistant .md-content h4,
+.bubble-assistant .md-content h5,
+.bubble-assistant .md-content h6 {
+  color: var(--text);
+  line-height: 1.28;
+  margin: 0;
+}
+
+.bubble-assistant .md-content .md-h1 { font-size: 18px; font-weight: 700; }
+.bubble-assistant .md-content .md-h2 { font-size: 16px; font-weight: 700; }
+.bubble-assistant .md-content .md-h3 { font-size: 14px; font-weight: 650; }
+.bubble-assistant .md-content .md-h4,
+.bubble-assistant .md-content .md-h5,
+.bubble-assistant .md-content .md-h6 { font-size: 13px; font-weight: 650; }
+
+.bubble-assistant .md-content .md-list {
+  margin: 0;
+  padding-left: 18px;
+  display: flex;
+  flex-direction: column;
+  gap: 4px;
+}
+
+.bubble-assistant .md-content .md-list li {
+  font-size: 13.5px;
+  line-height: 1.55;
+}
+
+.bubble-assistant .md-content .md-task-item {
+  list-style: none;
+  margin-left: -18px;
+  display: flex;
+  align-items: flex-start;
+  gap: 8px;
+}
+
+.bubble-assistant .md-content .md-task-box {
+  width: 13px;
+  height: 13px;
+  margin-top: 3px;
+  border-radius: 4px;
+  border: 1px solid var(--line);
+  background: #fff;
+  flex-shrink: 0;
+}
+
+.bubble-assistant .md-content .md-task-box.checked {
+  border-color: var(--ok);
+  background: var(--ok);
+  position: relative;
+}
+
+.bubble-assistant .md-content .md-task-box.checked::after {
+  content: '';
+  position: absolute;
+  left: 3px;
+  top: 1px;
+  width: 4px;
+  height: 7px;
+  border-right: 2px solid #fff;
+  border-bottom: 2px solid #fff;
+  transform: rotate(40deg);
+}
+
+.bubble-assistant .md-content .md-blockquote {
+  margin: 0;
+  padding: 6px 10px;
+  border-left: 3px solid var(--line);
+  background: var(--linen);
+  border-radius: 0 8px 8px 0;
+  display: flex;
+  flex-direction: column;
+  gap: 6px;
+}
+
+.bubble-assistant .md-content .md-pre {
+  margin: 0;
+  padding: 10px 12px;
+  border-radius: 10px;
+  border: 1px solid var(--line);
+  background: #f6f4ef;
+  overflow-x: auto;
+}
+
+.bubble-assistant .md-content .md-pre code {
+  background: transparent;
+  border: 0;
+  border-radius: 0;
+  padding: 0;
+  font-size: 11.5px;
+  line-height: 1.55;
+  color: var(--text);
+  white-space: pre;
+}
+
+.bubble-assistant .md-content .md-table {
+  width: 100%;
+  border-collapse: collapse;
+  table-layout: fixed;
+  border: 1px solid var(--line);
+  border-radius: 8px;
+  overflow: hidden;
+  font-size: 12px;
+}
+
+.bubble-assistant .md-content .md-table th,
+.bubble-assistant .md-content .md-table td {
+  border: 1px solid var(--line);
+  padding: 6px 8px;
+  vertical-align: top;
+  overflow-wrap: anywhere;
+  word-break: break-word;
+}
+
+.bubble-assistant .md-content .md-table th {
+  background: var(--linen);
+  font-weight: 600;
+}
+
+.bubble-assistant .md-content .md-hr {
+  border: 0;
+  border-top: 1px solid var(--line);
+  margin: 2px 0;
+}
+
 .bubble-assistant code {
   font-family: 'SF Mono', 'Fira Code', 'Cascadia Code', monospace;
   font-size: 11.5px;
@@ -383,6 +521,35 @@ body {
   word-break: break-word;
 }
 
+.inline-link {
+  color: var(--crail-dark);
+  text-decoration: underline;
+  text-underline-offset: 2px;
+}
+
+.inline-link:hover {
+  color: var(--crail-press);
+}
+
+.inline-image-link {
+  display: block;
+  width: 100%;
+  margin-top: 6px;
+  border-radius: 10px;
+  overflow: hidden;
+  border: 1px solid var(--line);
+  background: #fff;
+}
+
+.inline-image {
+  display: block;
+  width: 100%;
+  height: auto;
+  max-height: 280px;
+  object-fit: contain;
+  background: var(--linen);
+}
+
 .run-timeline {
   display: flex;
   flex-direction: column;
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 66b59d1..1cb0f72 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -5,6 +5,7 @@ import {
   clearSessionRunId,
   formatContextUsage,
   getSessionRunId,
+  renderMarkdownContent,
   renderInlineContent,
   shouldApplySessionSelection,
 } from './agent-panel-runtime.js';
@@ -662,7 +663,7 @@ function renderRunTimeline(run, fallbackText = '') {
     <div class="run-timeline">
       ${timeline.map((entry, index) => {
     if (entry.type === 'text') {
-      return `<div class="bubble-assistant"><p>${renderContent(entry.text || '')}</p></div>`;
+      return `<div class="bubble-assistant">${renderContent(entry.text || '')}</div>`;
     }
     const status = entry?.status || 'running';
     const normalizedStatus = String(status || '').toLowerCase();
@@ -716,7 +717,7 @@ function renderRunTimeline(run, fallbackText = '') {
 }
 
 function renderContent(value) {
-  return renderInlineContent(value);
+  return renderMarkdownContent(value);
 }
 
 function bindTranscriptHandlers() {
@@ -769,7 +770,7 @@ function renderTranscript({ preserveScrollTop = null } = {}) {
 
     const messageRun = msg.runId ? state.value.runs[msg.runId] : null;
     const timelineHtml = renderRunTimeline(messageRun, msg.text || '');
-    const fallbackHtml = `<div class="bubble-assistant"><p>${renderContent(msg.text || '')}</p></div>`;
+    const fallbackHtml = `<div class="bubble-assistant">${renderContent(msg.text || '')}</div>`;
     return `
       <article class="message assistant">
         <div class="msg-meta"><span class="msg-author">BrowserForce</span></div>
diff --git a/test/agent/agent-panel-runtime.test.js b/test/agent/agent-panel-runtime.test.js
index 7a613cf..346b789 100644
--- a/test/agent/agent-panel-runtime.test.js
+++ b/test/agent/agent-panel-runtime.test.js
@@ -7,6 +7,7 @@ import {
   formatContextUsage,
   getLatestInFlightStepIndex,
   getSessionRunId,
+  renderMarkdownContent,
   renderInlineContent,
   shouldApplySessionSelection,
 } from '../../extension/agent-panel-runtime.js';
@@ -61,6 +62,73 @@ test('renders safe inline markdown for bold and code spans', () => {
   );
 });
 
+test('renders screenshot markdown links as image previews', () => {
+  const rendered = renderInlineContent('- Screenshot saved: [shopify-direct-1772647808095.png](/tmp/shopify-direct-1772647808095.png)');
+  assert.match(rendered, /Screenshot saved:/);
+  assert.match(rendered, /class="inline-image-link"/);
+  assert.match(rendered, /class="inline-image"/);
+  assert.match(rendered, /src="file:\/\/\/tmp\/shopify-direct-1772647808095\.png"/);
+});
+
+test('renders non-image markdown links as clickable anchors', () => {
+  const rendered = renderInlineContent('Open [BrowserForce](https://github.com/ivalsaraj/browserforce)');
+  assert.match(rendered, /class="inline-link"/);
+  assert.match(rendered, /href="https:\/\/github\.com\/ivalsaraj\/browserforce"/);
+});
+
+test('does not render unsafe markdown link protocols as HTML anchors', () => {
+  const rendered = renderInlineContent('[bad](javascript:alert(1))');
+  assert.equal(rendered, '[bad](javascript:alert(1))');
+});
+
+test('renders markdown blocks for headings, emphasis, list, quote, and hr', () => {
+  const rendered = renderMarkdownContent([
+    '# Heading',
+    '',
+    'Paragraph with *italic*, **bold**, and ~~strike~~.',
+    '',
+    '- Item one',
+    '- [x] Done task',
+    '',
+    '> quoted line',
+    '',
+    '---',
+  ].join('\n'));
+  assert.match(rendered, /class="md-content"/);
+  assert.match(rendered, /class="md-h1"/);
+  assert.match(rendered, /<em>italic<\/em>/);
+  assert.match(rendered, /<strong>bold<\/strong>/);
+  assert.match(rendered, /<del>strike<\/del>/);
+  assert.match(rendered, /class="md-list"/);
+  assert.match(rendered, /class="md-task-item"/);
+  assert.match(rendered, /class="md-blockquote"/);
+  assert.match(rendered, /class="md-hr"/);
+});
+
+test('renders fenced code blocks and table markdown', () => {
+  const rendered = renderMarkdownContent([
+    '```js',
+    'const ok = true;',
+    '```',
+    '',
+    '| Name | Value |',
+    '| :--- | ---: |',
+    '| foo | 42 |',
+  ].join('\n'));
+  assert.match(rendered, /class="md-pre"/);
+  assert.match(rendered, /language-js/);
+  assert.match(rendered, /const ok = true;/);
+  assert.match(rendered, /class="md-table"/);
+  assert.match(rendered, /<th style="text-align:left;">Name<\/th>/);
+  assert.match(rendered, /<th style="text-align:right;">Value<\/th>/);
+});
+
+test('escapes raw html inside markdown blocks', () => {
+  const rendered = renderMarkdownContent('Text <script>alert(1)</script>');
+  assert.match(rendered, /&lt;script&gt;alert\(1\)&lt;\/script&gt;/);
+  assert.doesNotMatch(rendered, /<script>/);
+});
+
 test('tracks latest step index for active runs only', () => {
   assert.equal(getLatestInFlightStepIndex({ done: false, steps: [{}, {}, {}] }), 2);
   assert.equal(getLatestInFlightStepIndex({ done: true, steps: [{}, {}] }), -1);
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 8fdec08..51b011f 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -70,6 +70,12 @@ test('assistant transcript prefers ordered run timeline over grouped run steps',
   assert.match(js, /const timelineHtml = renderRunTimeline\(messageRun, msg\.text \|\| ''\)/);
 });
 
+test('assistant transcript renders message bodies with markdown block renderer', () => {
+  assert.match(js, /renderMarkdownContent/);
+  assert.match(js, /function renderContent\(value\)\s*\{\s*return renderMarkdownContent\(value\);\s*\}/);
+  assert.match(js, /<div class="bubble-assistant">\$\{renderContent\(entry\.text \|\| ''\)\}<\/div>/);
+});
+
 test('context usage renderer hides element when unavailable and only shows formatted values', () => {
   assert.match(js, /function renderContextUsageChip\(\)/);
   assert.match(js, /latestUsageBySession/);

From c845eb13ac116e635424713e75b2473e2cb6c2aa Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Wed, 4 Mar 2026 23:56:06 +0530
Subject: [PATCH 176/192] agent-panel: render local screenshot images via
 authenticated chatd endpoint

- add /v1/local-file route in chatd to stream local image files (auth required, image types only, size capped)

- render local markdown image paths as local-image placeholders instead of file:// URLs

- hydrate placeholders in side panel by fetching blobs with bearer auth and assigning object URLs

- add loading shimmer style and tests for endpoint + runtime + panel integration
---
 agent/src/chatd.js                           | 61 ++++++++++++++++++-
 extension/agent-panel-runtime.js             | 27 +++++----
 extension/agent-panel.css                    | 25 +++++++-
 extension/agent-panel.js                     | 63 ++++++++++++++++++++
 test/agent/agent-panel-runtime.test.js       |  7 ++-
 test/agent/agent-panel-send-contract.test.js | 10 ++++
 test/agent/chatd-api.test.js                 | 39 +++++++++++-
 7 files changed, 216 insertions(+), 16 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index a67b965..4c28472 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -3,7 +3,7 @@ import { spawn } from 'node:child_process';
 import { randomBytes } from 'node:crypto';
 import { promises as fs } from 'node:fs';
 import { homedir, tmpdir } from 'node:os';
-import { dirname, join } from 'node:path';
+import { dirname, extname, join } from 'node:path';
 import { fileURLToPath } from 'node:url';
 
 import { pickChatdPort } from './port-resolver.js';
@@ -26,6 +26,17 @@ const CHATD_URL_PATH = join(BF_DIR, 'chatd-url.json');
 const CODEX_CONFIG_PATH = join(homedir(), '.codex', 'config.toml');
 const MODEL_LIST_TIMEOUT_MS = 5000;
 const DEFAULT_REASONING_EFFORT = 'medium';
+const LOCAL_FILE_MAX_BYTES = 15 * 1024 * 1024;
+const LOCAL_IMAGE_CONTENT_TYPES = {
+  '.png': 'image/png',
+  '.jpg': 'image/jpeg',
+  '.jpeg': 'image/jpeg',
+  '.gif': 'image/gif',
+  '.webp': 'image/webp',
+  '.bmp': 'image/bmp',
+  '.svg': 'image/svg+xml',
+  '.avif': 'image/avif',
+};
 
 function parseTopLevelTomlString(raw, key) {
   const lines = String(raw || '').split(/\r?\n/);
@@ -258,6 +269,17 @@ function safeDecodeComponent(value) {
   }
 }
 
+function normalizeLocalFilePath(value) {
+  const path = String(value || '').trim();
+  if (!path || !path.startsWith('/') || path.startsWith('//') || path.includes('\0')) return null;
+  return path;
+}
+
+function localImageContentTypeForPath(path) {
+  const extension = extname(String(path || '')).toLowerCase();
+  return LOCAL_IMAGE_CONTENT_TYPES[extension] || null;
+}
+
 function sanitizeContextText(value, maxLen = 320) {
   if (value == null) return '';
   const normalized = String(value).replace(/\s+/g, ' ').trim();
@@ -1086,6 +1108,43 @@ export async function startChatd(opts = {}) {
         }
       }
 
+      if (url.pathname === '/v1/local-file' && req.method === 'GET') {
+        const localPath = normalizeLocalFilePath(url.searchParams.get('path'));
+        if (!localPath) {
+          json(res, 400, { error: 'path is required' });
+          return;
+        }
+
+        const contentType = localImageContentTypeForPath(localPath);
+        if (!contentType) {
+          json(res, 415, { error: 'Unsupported file type' });
+          return;
+        }
+
+        let fileStat;
+        try {
+          fileStat = await fs.stat(localPath);
+        } catch {
+          json(res, 404, { error: 'File not found' });
+          return;
+        }
+        if (!fileStat?.isFile?.()) {
+          json(res, 404, { error: 'File not found' });
+          return;
+        }
+        if (fileStat.size > LOCAL_FILE_MAX_BYTES) {
+          json(res, 413, { error: 'File too large' });
+          return;
+        }
+
+        const data = await fs.readFile(localPath);
+        res.statusCode = 200;
+        res.setHeader('content-type', contentType);
+        res.setHeader('cache-control', 'no-store');
+        res.end(data);
+        return;
+      }
+
       if (url.pathname === '/v1/sessions' && req.method === 'GET') {
         const sessions = await listSessions({ storageRoot });
         json(res, 200, { sessions });
diff --git a/extension/agent-panel-runtime.js b/extension/agent-panel-runtime.js
index 8faa89a..a52546b 100644
--- a/extension/agent-panel-runtime.js
+++ b/extension/agent-panel-runtime.js
@@ -118,17 +118,13 @@ function looksLikeImageUrl(url) {
 function normalizeRenderableUrl(url) {
   const value = String(url || '').trim();
   if (!value) return '';
-
-  if (
-    /^\/(?:tmp|private|var|Users|home|Volumes)\//.test(value)
-    || /^\/var\/folders\//.test(value)
-  ) {
-    return `file://${value}`;
-  }
-
   return value;
 }
 
+function isLocalAbsolutePath(value) {
+  return /^\/(?!\/)/.test(String(value || '').trim());
+}
+
 function isSafeRenderableUrl(url) {
   const value = String(url || '').trim();
   if (!value) return false;
@@ -170,17 +166,28 @@ export function renderInlineContent(value) {
 
   const withImageAndLinks = withCodeTokens.replace(/(!)?\[([^\]]*)\]\(([^)]+)\)/g, (match, imageMark, labelRaw, urlRaw) => {
     const normalizedUrl = normalizeRenderableUrl(urlRaw);
-    if (!isSafeRenderableUrl(normalizedUrl)) return match;
+    const localAbsolutePath = isLocalAbsolutePath(normalizedUrl);
+    if (!localAbsolutePath && !isSafeRenderableUrl(normalizedUrl)) return match;
 
-    const href = escapeHtml(normalizedUrl);
     if (imageMark || looksLikeImageUrl(urlRaw)) {
       const altText = String(labelRaw || '').trim() || 'Screenshot';
       const alt = escapeHtml(altText);
+      if (localAbsolutePath) {
+        const localPath = escapeHtml(normalizedUrl);
+        return store.put(
+          `<span class="inline-image-link local-image" data-local-path="${localPath}"><img class="inline-image inline-local-image" data-local-path="${localPath}" alt="${alt}" loading="lazy"></span>`,
+        );
+      }
+
+      const href = escapeHtml(normalizedUrl);
       return store.put(
         `<a class="inline-image-link" href="${href}" target="_blank" rel="noopener noreferrer"><img class="inline-image" src="${href}" alt="${alt}" loading="lazy"></a>`,
       );
     }
 
+    if (localAbsolutePath) return match;
+
+    const href = escapeHtml(normalizedUrl);
     const label = escapeHtml(String(labelRaw || '').trim() || normalizedUrl);
     return store.put(`<a class="inline-link" href="${href}" target="_blank" rel="noopener noreferrer">${label}</a>`);
   });
diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 54b7a11..43245fd 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -541,6 +541,10 @@ body {
   background: #fff;
 }
 
+.inline-image-link.local-image {
+  cursor: default;
+}
+
 .inline-image {
   display: block;
   width: 100%;
@@ -550,6 +554,15 @@ body {
   background: var(--linen);
 }
 
+.inline-local-image:not([src]) {
+  min-height: 120px;
+  background:
+    linear-gradient(110deg, rgba(0, 0, 0, 0.03) 8%, rgba(0, 0, 0, 0.08) 18%, rgba(0, 0, 0, 0.03) 33%),
+    var(--linen);
+  background-size: 200% 100%, auto;
+  animation: local-image-loading 1.2s linear infinite;
+}
+
 .run-timeline {
   display: flex;
   flex-direction: column;
@@ -1326,11 +1339,21 @@ body {
   }
 }
 
+@keyframes local-image-loading {
+  0% {
+    background-position: 100% 0, 0 0;
+  }
+  100% {
+    background-position: -100% 0, 0 0;
+  }
+}
+
 @media (prefers-reduced-motion: reduce) {
   .step-item.pulse .run-step-icon,
   .step-label.title-label.shimmer-text,
   .step-label.title-label.title-transition-in,
-  .step-label.title-label.shimmer-text.title-transition-in {
+  .step-label.title-label.shimmer-text.title-transition-in,
+  .inline-local-image:not([src]) {
     animation: none;
   }
 }
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 1cb0f72..f0f1d63 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -33,6 +33,8 @@ const state = {
   agentOpenRequestWatcherBound: false,
   lastHandledAgentOpenRequestId: null,
   pendingAgentOpenRequest: null,
+  localImageBlobUrlByPath: {},
+  localImageLoadsByPath: {},
   initialTabAttachInFlight: false,
   initialTabAttachStarted: false,
   editingSessionId: null,
@@ -720,6 +722,66 @@ function renderContent(value) {
   return renderMarkdownContent(value);
 }
 
+async function loadLocalImageBlobUrl(localPath) {
+  const path = String(localPath || '').trim();
+  if (!path || !state.auth?.baseUrl || !state.auth?.token) return null;
+  if (state.localImageBlobUrlByPath[path]) return state.localImageBlobUrlByPath[path];
+  if (state.localImageLoadsByPath[path]) return state.localImageLoadsByPath[path];
+
+  const loadPromise = (async () => {
+    const url = `${state.auth.baseUrl}/v1/local-file?path=${encodeURIComponent(path)}`;
+    const response = await fetch(url, {
+      method: 'GET',
+      headers: {
+        authorization: `Bearer ${state.auth.token}`,
+      },
+    });
+    if (!response.ok) return null;
+    const blob = await response.blob();
+    const blobUrl = URL.createObjectURL(blob);
+    state.localImageBlobUrlByPath = {
+      ...(state.localImageBlobUrlByPath || {}),
+      [path]: blobUrl,
+    };
+    return blobUrl;
+  })().finally(() => {
+    const nextLoads = { ...(state.localImageLoadsByPath || {}) };
+    delete nextLoads[path];
+    state.localImageLoadsByPath = nextLoads;
+  });
+
+  state.localImageLoadsByPath = {
+    ...(state.localImageLoadsByPath || {}),
+    [path]: loadPromise,
+  };
+  return loadPromise;
+}
+
+function hydrateLocalImagePreviews() {
+  if (!transcriptEl) return;
+  const imageNodes = transcriptEl.querySelectorAll('img.inline-local-image[data-local-path]');
+  for (const node of imageNodes) {
+    const localPath = String(node.getAttribute('data-local-path') || '').trim();
+    if (!localPath) continue;
+    const cached = state.localImageBlobUrlByPath?.[localPath];
+    if (cached) {
+      if (node.getAttribute('src') !== cached) node.setAttribute('src', cached);
+      continue;
+    }
+    loadLocalImageBlobUrl(localPath)
+      .then((blobUrl) => {
+        if (!blobUrl) return;
+        transcriptEl.querySelectorAll('img.inline-local-image[data-local-path]').forEach((img) => {
+          if (String(img.getAttribute('data-local-path') || '').trim() !== localPath) return;
+          img.setAttribute('src', blobUrl);
+        });
+      })
+      .catch(() => {
+        // best-effort preview only
+      });
+  }
+}
+
 function bindTranscriptHandlers() {
   if (state.transcriptHandlersBound) return;
   transcriptEl.addEventListener('click', async (event) => {
@@ -850,6 +912,7 @@ function renderTranscript({ preserveScrollTop = null } = {}) {
   }
 
   bindTranscriptHandlers();
+  hydrateLocalImagePreviews();
   if (Number.isFinite(preserveScrollTop)) {
     transcriptEl.scrollTop = preserveScrollTop;
   } else {
diff --git a/test/agent/agent-panel-runtime.test.js b/test/agent/agent-panel-runtime.test.js
index 346b789..7091e1b 100644
--- a/test/agent/agent-panel-runtime.test.js
+++ b/test/agent/agent-panel-runtime.test.js
@@ -65,9 +65,10 @@ test('renders safe inline markdown for bold and code spans', () => {
 test('renders screenshot markdown links as image previews', () => {
   const rendered = renderInlineContent('- Screenshot saved: [shopify-direct-1772647808095.png](/tmp/shopify-direct-1772647808095.png)');
   assert.match(rendered, /Screenshot saved:/);
-  assert.match(rendered, /class="inline-image-link"/);
-  assert.match(rendered, /class="inline-image"/);
-  assert.match(rendered, /src="file:\/\/\/tmp\/shopify-direct-1772647808095\.png"/);
+  assert.match(rendered, /class="inline-image-link local-image"/);
+  assert.match(rendered, /inline-local-image/);
+  assert.match(rendered, /data-local-path="\/tmp\/shopify-direct-1772647808095\.png"/);
+  assert.doesNotMatch(rendered, /file:\/\/\/tmp\//);
 });
 
 test('renders non-image markdown links as clickable anchors', () => {
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 51b011f..52937ff 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -76,6 +76,16 @@ test('assistant transcript renders message bodies with markdown block renderer',
   assert.match(js, /<div class="bubble-assistant">\$\{renderContent\(entry\.text \|\| ''\)\}<\/div>/);
 });
 
+test('local screenshot markdown images hydrate through authenticated chatd fetch', () => {
+  assert.match(js, /function loadLocalImageBlobUrl\(localPath\)/);
+  assert.match(js, /\/v1\/local-file\?path=/);
+  assert.match(js, /authorization:\s*`Bearer \$\{state\.auth\.token\}`/);
+  assert.match(js, /URL\.createObjectURL\(blob\)/);
+  assert.match(js, /function hydrateLocalImagePreviews\(\)/);
+  assert.match(js, /img\.inline-local-image\[data-local-path\]/);
+  assert.match(js, /hydrateLocalImagePreviews\(\);/);
+});
+
 test('context usage renderer hides element when unavailable and only shows formatted values', () => {
   assert.match(js, /function renderContextUsageChip\(\)/);
   assert.match(js, /latestUsageBySession/);
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index eff31e1..5dd2ad7 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -1,6 +1,6 @@
 import test from 'node:test';
 import assert from 'node:assert/strict';
-import { existsSync, mkdtempSync, rmSync } from 'node:fs';
+import { existsSync, mkdtempSync, rmSync, writeFileSync } from 'node:fs';
 import { tmpdir } from 'node:os';
 import { join } from 'node:path';
 import { startChatd } from '../../agent/src/chatd.js';
@@ -106,6 +106,43 @@ test('GET /v1/sessions/:id/messages rejects malformed encoded id', async () => {
   }
 });
 
+test('GET /v1/local-file serves local image bytes for authenticated requests', async () => {
+  const tempDir = mkdtempSync(join(tmpdir(), 'bf-chatd-local-image-'));
+  const imagePath = join(tempDir, 'preview.png');
+  writeFileSync(imagePath, Buffer.from('89504e470d0a1a0a0000000d49484452', 'hex'));
+
+  const daemon = await startChatd({ port: 0, writeChatdUrl: false });
+  try {
+    const response = await fetch(`${daemon.baseUrl}/v1/local-file?path=${encodeURIComponent(imagePath)}`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(response.status, 200);
+    assert.equal(response.headers.get('content-type'), 'image/png');
+    const body = Buffer.from(await response.arrayBuffer());
+    assert.equal(body.length > 0, true);
+  } finally {
+    await daemon.stop();
+    rmSync(tempDir, { recursive: true, force: true });
+  }
+});
+
+test('GET /v1/local-file rejects unsupported extensions', async () => {
+  const tempDir = mkdtempSync(join(tmpdir(), 'bf-chatd-local-text-'));
+  const textPath = join(tempDir, 'note.txt');
+  writeFileSync(textPath, 'not-an-image', 'utf8');
+
+  const daemon = await startChatd({ port: 0, writeChatdUrl: false });
+  try {
+    const response = await fetch(`${daemon.baseUrl}/v1/local-file?path=${encodeURIComponent(textPath)}`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(response.status, 415);
+  } finally {
+    await daemon.stop();
+    rmSync(tempDir, { recursive: true, force: true });
+  }
+});
+
 test('POST /v1/runs rejects unsafe sessionId', async () => {
   const daemon = await startChatd({ port: 0, writeChatdUrl: false });
   try {

From 6c7aa525f25a7107b35b9aa5d0fa2e26544ccbd9 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 00:07:47 +0530
Subject: [PATCH 177/192] fix(agent): treat zero context window as unavailable

- Normalize model context window to positive-only values in run usage parsing

- Ignore and drop model_context_window=0 when persisting session usage metadata

- Add regression coverage for token_count payloads that report zero context window
---
 agent/src/chatd.js              |  8 +++++++-
 agent/src/codex-runner.js       |  8 +++++++-
 test/agent/codex-runner.test.js | 15 +++++++++++++++
 3 files changed, 29 insertions(+), 2 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 4c28472..dcc784e 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -302,9 +302,15 @@ function normalizeUsageNumber(value) {
   return Math.round(parsed);
 }
 
+function normalizePositiveUsageNumber(value) {
+  const parsed = Number(value);
+  if (!Number.isFinite(parsed) || parsed <= 0) return null;
+  return Math.round(parsed);
+}
+
 function normalizeUsagePayload(payload) {
   if (!payload || typeof payload !== 'object') return null;
-  const modelContextWindow = normalizeUsageNumber(payload.modelContextWindow);
+  const modelContextWindow = normalizePositiveUsageNumber(payload.modelContextWindow);
   const totalTokens = normalizeUsageNumber(payload.totalTokens);
   const inputTokens = normalizeUsageNumber(payload.inputTokens);
   const cachedInputTokens = normalizeUsageNumber(payload.cachedInputTokens);
diff --git a/agent/src/codex-runner.js b/agent/src/codex-runner.js
index 3b57efe..2cbf9cb 100644
--- a/agent/src/codex-runner.js
+++ b/agent/src/codex-runner.js
@@ -86,13 +86,19 @@ function toCount(value) {
   return Math.round(parsed);
 }
 
+function toPositiveCount(value) {
+  const parsed = Number(value);
+  if (!Number.isFinite(parsed) || parsed <= 0) return null;
+  return Math.round(parsed);
+}
+
 function toUsagePayload(source = {}) {
   const inputTokens = toCount(source.input_tokens ?? source.inputTokens);
   const cachedInputTokens = toCount(source.cached_input_tokens ?? source.cachedInputTokens);
   const outputTokens = toCount(source.output_tokens ?? source.outputTokens);
   const reasoningOutputTokens = toCount(source.reasoning_output_tokens ?? source.reasoningOutputTokens);
   const explicitTotalTokens = toCount(source.total_tokens ?? source.totalTokens);
-  const modelContextWindow = toCount(source.model_context_window ?? source.modelContextWindow);
+  const modelContextWindow = toPositiveCount(source.model_context_window ?? source.modelContextWindow);
 
   const totalTokens = explicitTotalTokens != null
     ? explicitTotalTokens
diff --git a/test/agent/codex-runner.test.js b/test/agent/codex-runner.test.js
index 15280a4..a78e4ef 100644
--- a/test/agent/codex-runner.test.js
+++ b/test/agent/codex-runner.test.js
@@ -174,6 +174,21 @@ test('maps codex token_count into run.usage event', () => {
   assert.equal(evt.payload.reasoningOutputTokens, 14);
 });
 
+test('treats zero model_context_window in token_count as missing context window', () => {
+  const line = JSON.stringify({
+    type: 'token_count',
+    info: {
+      total_token_usage: { input_tokens: 1000, cached_input_tokens: 700, output_tokens: 120, total_tokens: 1120 },
+      model_context_window: 0,
+      reasoning_output_tokens: 14,
+    },
+  });
+  const evt = normalizeCodexLine({ runId: 'r1', sessionId: 's1', line });
+  assert.equal(evt.event, 'run.usage');
+  assert.equal(evt.payload.modelContextWindow, null);
+  assert.equal(evt.payload.totalTokens, 1120);
+});
+
 test('maps codex thread.started provider session id event to run.provider_session', () => {
   const line = JSON.stringify({
     type: 'thread.started',

From aea3e9482555b4714eb8a2ef1acbe9e7e2cd117a Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 00:11:51 +0530
Subject: [PATCH 178/192] Use branded animated SVG for completed tool-call
 icons

- Replace static pseudo-element done icon with an inline SVG stroke-draw animation for tool-call timeline rows\n- Route timeline icon rendering through renderRunStepIcon() so done states can render custom SVG markup\n- Apply BrowserForce brand tokens to completed icon states and collapsed execute-branch done text\n- Add/extend panel contract tests to lock in SVG hooks, keyframes, and branded done-state styling
---
 extension/agent-panel.css                    | 85 +++++++++++++++-----
 extension/agent-panel.js                     | 19 ++++-
 test/agent/agent-panel-contract.test.js      | 10 +++
 test/agent/agent-panel-send-contract.test.js |  7 ++
 4 files changed, 98 insertions(+), 23 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 43245fd..9b3c8f3 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -645,7 +645,7 @@ body {
 }
 
 .step-branch-preview.done .step-branch-call {
-  color: var(--ok);
+  color: var(--crail-dark);
 }
 
 .step-item.latest .step-branch-preview.running .step-branch-call {
@@ -755,6 +755,39 @@ body {
   position: relative;
 }
 
+.run-step-icon.icon-done {
+  width: 16px;
+  height: 16px;
+  margin-top: 0;
+  color: var(--crail);
+}
+
+.run-step-icon-done-svg {
+  width: 100%;
+  height: 100%;
+  display: block;
+}
+
+.run-step-icon-done-ring {
+  stroke: var(--crail);
+  stroke-width: 3;
+  stroke-linecap: round;
+  stroke-linejoin: round;
+  stroke-dasharray: 151;
+  stroke-dashoffset: 151;
+  animation: run-step-done-ring-draw 380ms ease-out forwards;
+}
+
+.run-step-icon-done-check {
+  stroke: var(--crail-dark);
+  stroke-width: 4;
+  stroke-linecap: round;
+  stroke-linejoin: round;
+  stroke-dasharray: 36;
+  stroke-dashoffset: 36;
+  animation: run-step-done-check-draw 260ms ease-out 220ms forwards;
+}
+
 .step-item.latest .step-label {
   color: var(--text);
 }
@@ -849,25 +882,6 @@ body {
   box-shadow: 0 4px 0 currentColor, 0 8px 0 currentColor;
 }
 
-.run-step-icon.icon-done::before {
-  top: 1px;
-  left: 1px;
-  width: 12px;
-  height: 12px;
-  border: 1.5px solid currentColor;
-  border-radius: 999px;
-}
-
-.run-step-icon.icon-done::after {
-  top: 6px;
-  left: 4px;
-  width: 5px;
-  height: 3px;
-  border-left: 1.5px solid currentColor;
-  border-bottom: 1.5px solid currentColor;
-  transform: rotate(-45deg);
-}
-
 .run-step-icon.icon-failed::before {
   top: 1px;
   left: 1px;
@@ -886,7 +900,7 @@ body {
 }
 
 .step-item.done .run-step-icon {
-  color: var(--ok);
+  color: var(--crail);
 }
 
 .step-item.failed .run-step-icon {
@@ -1348,6 +1362,28 @@ body {
   }
 }
 
+@keyframes run-step-done-ring-draw {
+  from {
+    stroke-dashoffset: 151;
+    opacity: 0.35;
+  }
+  to {
+    stroke-dashoffset: 0;
+    opacity: 1;
+  }
+}
+
+@keyframes run-step-done-check-draw {
+  from {
+    stroke-dashoffset: 36;
+    opacity: 0;
+  }
+  to {
+    stroke-dashoffset: 0;
+    opacity: 1;
+  }
+}
+
 @media (prefers-reduced-motion: reduce) {
   .step-item.pulse .run-step-icon,
   .step-label.title-label.shimmer-text,
@@ -1356,4 +1392,11 @@ body {
   .inline-local-image:not([src]) {
     animation: none;
   }
+
+  .run-step-icon-done-ring,
+  .run-step-icon-done-check {
+    animation: none;
+    stroke-dashoffset: 0;
+    opacity: 1;
+  }
 }
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index f0f1d63..c2b0665 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -688,7 +688,7 @@ function renderRunTimeline(run, fallbackText = '') {
     if (isLatest) classes.push('latest');
     if (shouldPulse) classes.push('pulse');
     if (!isCollapsible) {
-      return `<div class="${classes.join(' ')}"><span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span><span class="${labelClasses.join(' ')}">${renderInlineContent(entry.label || 'Step')}</span></div>`;
+      return `<div class="${classes.join(' ')}">${renderRunStepIcon(icon)}<span class="${labelClasses.join(' ')}">${renderInlineContent(entry.label || 'Step')}</span></div>`;
     }
     classes.push('collapsible');
     const key = getTimelineEntryKey(entry, index);
@@ -700,7 +700,7 @@ function renderRunTimeline(run, fallbackText = '') {
       .join('');
     return `
       <div class="${classes.join(' ')}">
-        <span class="run-step-icon icon-${escapeHtml(icon)}" aria-hidden="true"></span>
+        ${renderRunStepIcon(icon)}
         <div class="step-body">
           <button type="button" class="step-toggle" data-step-key="${escapeHtml(key)}" aria-expanded="${expanded ? 'true' : 'false'}">
             <span class="step-toggle-main">
@@ -718,6 +718,21 @@ function renderRunTimeline(run, fallbackText = '') {
   `;
 }
 
+function renderRunStepIcon(icon) {
+  const iconName = String(icon || '').trim().toLowerCase();
+  if (iconName === 'done') {
+    return `
+      <span class="run-step-icon icon-done" aria-hidden="true">
+        <svg class="run-step-icon-done-svg" viewBox="0 0 52 52" aria-hidden="true" focusable="false">
+          <circle class="run-step-icon-done-ring" cx="26" cy="26" r="24" fill="none"></circle>
+          <path class="run-step-icon-done-check" fill="none" d="M14 27.5l8.5 8.5L38.5 19"></path>
+        </svg>
+      </span>
+    `;
+  }
+  return `<span class="run-step-icon icon-${escapeHtml(iconName)}" aria-hidden="true"></span>`;
+}
+
 function renderContent(value) {
   return renderMarkdownContent(value);
 }
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index e923306..a81e1be 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -77,6 +77,15 @@ test('reasoning title rows use shimmer and enter transition treatment', () => {
   assert.match(css, /@media\s*\(prefers-reduced-motion:\s*reduce\)/);
 });
 
+test('done step icon uses branded animated svg check treatment', () => {
+  assert.match(css, /\.run-step-icon\.icon-done/);
+  assert.match(css, /\.run-step-icon-done-svg/);
+  assert.match(css, /\.run-step-icon-done-ring/);
+  assert.match(css, /\.run-step-icon-done-check/);
+  assert.match(css, /@keyframes run-step-done-ring-draw/);
+  assert.match(css, /@keyframes run-step-done-check-draw/);
+});
+
 test('agent panel includes visible startup error empty-state treatment', () => {
   assert.match(panelJs, /state\.startupIssue = null/);
   assert.match(panelJs, /class="empty-state error-state"/);
@@ -91,6 +100,7 @@ test('collapsed execute helper preview has tree-like branch styling', () => {
   assert.match(css, /\.step-branch-node/);
   assert.match(css, /\.step-branch-node::before/);
   assert.match(css, /\.step-branch-call/);
+  assert.match(css, /\.step-branch-preview\.done \.step-branch-call[\s\S]*var\(--crail-dark\)/);
 });
 
 test('startup error card action buttons have dedicated styling hooks', () => {
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 52937ff..08e8813 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -150,6 +150,13 @@ test('tool-call timeline entries render collapsed toggle rows with click-to-expa
   assert.match(js, /closest\('button\[data-step-key\]'\)/);
 });
 
+test('done tool-call icon renders animated svg check markup', () => {
+  assert.match(js, /function renderRunStepIcon\(icon\)/);
+  assert.match(js, /run-step-icon-done-svg/);
+  assert.match(js, /run-step-icon-done-ring/);
+  assert.match(js, /run-step-icon-done-check/);
+});
+
 test('composer toggles single-line and multiline visual state from textarea height', () => {
   assert.match(js, /const composerBoxEl = chatFormEl\.querySelector\('\.composer-box'\)/);
   assert.match(js, /function syncComposerLayoutState\(\)/);

From 9fd2d5cfff263084c994c12d2221e2f8b44f1ee6 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 00:14:54 +0530
Subject: [PATCH 179/192] docs(agent): add dedicated BrowserForce Agent
 documentation

- add new docs/BROWSERFORCE_AGENT.md with setup, API, session model, config files, env vars, and troubleshooting

- add README side-panel section link to dedicated agent guide for discoverability
---
 README.md                  |   2 +
 docs/BROWSERFORCE_AGENT.md | 149 +++++++++++++++++++++++++++++++++++++
 2 files changed, 151 insertions(+)
 create mode 100644 docs/BROWSERFORCE_AGENT.md

diff --git a/README.md b/README.md
index de5b348..7245812 100644
--- a/README.md
+++ b/README.md
@@ -392,6 +392,8 @@ Each `-e` command is one-shot — state does not persist between calls. For pers
 
 BrowserForce now includes a side-panel chat UI in the Chrome extension for resumable local sessions.
 
+Dedicated guide: [`docs/BROWSERFORCE_AGENT.md`](docs/BROWSERFORCE_AGENT.md) (setup, API, config files/env vars, and troubleshooting).
+
 - Open popup -> `Open BrowserForce Agent` to open the side panel.
 - Use the session list to switch between chats; transcripts hydrate per selected `sessionId`.
 - Session identity is explicit and persisted; there is no fixed/hardcoded chat session ID.
diff --git a/docs/BROWSERFORCE_AGENT.md b/docs/BROWSERFORCE_AGENT.md
new file mode 100644
index 0000000..b4bb030
--- /dev/null
+++ b/docs/BROWSERFORCE_AGENT.md
@@ -0,0 +1,149 @@
+# BrowserForce Agent
+
+BrowserForce Agent is the local chat daemon (`chatd`) plus the Chrome extension side-panel UI.
+It gives you resumable, multi-session chat backed by Codex, while keeping data local on loopback.
+
+## What This Covers
+
+- Side-panel chat flow and session model
+- Daemon lifecycle commands
+- Agent HTTP API (`/v1/*`)
+- Config files and environment variables
+- Security boundaries and common troubleshooting
+
+## Quick Start
+
+1. Start relay:
+
+```bash
+browserforce serve
+```
+
+2. Start agent daemon:
+
+```bash
+browserforce agent start
+browserforce agent status
+```
+
+3. In the extension popup, click `Open BrowserForce Agent`.
+
+4. Send a message in the side panel.
+
+Stop daemon when needed:
+
+```bash
+browserforce agent stop
+```
+
+## Runtime Flow
+
+1. Side panel asks relay for `GET /chatd-url`.
+2. Relay validates extension origin/ID and returns `{ port, token }` from `~/.browserforce/chatd-url.json`.
+3. Side panel calls chatd directly on `127.0.0.1:<port>` with `Authorization: Bearer <token>`.
+4. Chat events stream over SSE from `/v1/events`.
+
+## Session Model
+
+- Session IDs are explicit and user-selectable. There is no fixed/hardcoded chat session.
+- Sessions persist under `~/.browserforce/agent/sessions/`.
+- BrowserForce stores Codex continuity under `providerState.codex.sessionId`.
+- New runs attempt `codex exec resume <sessionId> --json` when mapping exists.
+- If resume fails with an invalid-session signature, chatd retries once with a fresh run.
+- Usage telemetry from `run.usage` is persisted at `providerState.codex.latestUsage` and used to hydrate the context usage chip.
+
+## API Surface
+
+All `/v1/*` endpoints require `Authorization: Bearer <token>`.
+
+- `GET /health`
+  - No bearer required.
+  - Returns daemon status (`ok`, `pid`, `port`, `uptimeMs`).
+- `GET /v1/sessions`
+  - List sessions.
+- `POST /v1/sessions`
+  - Create session (`title`, optional `model`, optional `reasoningEffort`).
+- `GET /v1/sessions/:sessionId`
+  - Fetch session metadata (includes `providerState` when present).
+- `PATCH /v1/sessions/:sessionId`
+  - Update session `title`, `model`, or `reasoningEffort`.
+- `GET /v1/sessions/:sessionId/messages?limit=200`
+  - Read transcript messages.
+- `GET /v1/models`
+  - Returns available model presets and default reasoning effort.
+- `GET /v1/events?sessionId=<id>`
+  - SSE stream (`chat.delta`, `chat.final`, `run.provider_session`, `run.usage`, etc.).
+- `POST /v1/runs`
+  - Start run for `{ sessionId, message, browserContext? }`.
+- `POST /v1/runs/:runId/abort` or `DELETE /v1/runs/:runId/abort`
+  - Abort active run.
+
+## Config Files and Storage
+
+Generated and runtime files:
+
+- `~/.browserforce/chatd-url.json`
+  - Shape: `{ "port": <number>, "token": "<bearer>" }`
+  - Written with mode `0600`.
+  - Used by relay `/chatd-url` bootstrap.
+- `~/.browserforce/chatd-lock.json`
+  - Daemon lock/state (`pid`, `port`, `token`), mode `0600`.
+- `~/.browserforce/agent/sessions/index.json`
+  - Session index metadata.
+- `~/.browserforce/agent/sessions/<sessionId>.jsonl`
+  - Message/event history per session.
+
+Optional external config:
+
+- `~/.codex/config.toml`
+  - If present, chatd reads top-level:
+    - `model`
+    - `model_reasoning_effort`
+
+## Environment Variables
+
+- `BF_CHATD_PORT`
+  - Preferred daemon port. If unavailable or unset, fallback scans `19280-19320`.
+- `BF_CHATD_TOKEN`
+  - Forces bearer token instead of generated random token.
+- `BF_CHATD_URL_PATH`
+  - Overrides `chatd-url.json` path.
+- `BF_CHATD_LOCK_PATH`
+  - Overrides lock file path used by `browserforce agent start|status|stop`.
+- `BF_CHATD_CODEX_COMMAND`
+  - Codex binary/command used by chatd (default `codex`).
+- `BF_CHATD_MODEL_LIST_TIMEOUT_MS`
+  - Timeout when querying model catalog from Codex app-server.
+- `BF_CHATD_DEFAULT_MODEL`
+  - Default model override if valid.
+- `BF_CHATD_DEFAULT_REASONING_EFFORT`
+  - Default reasoning effort override (`low|medium|high|xhigh`).
+
+## Security Model
+
+- chatd binds to `127.0.0.1` only.
+- `/v1/*` requires bearer auth.
+- Origin checks:
+  - `chrome-extension://*` is allowed.
+  - localhost origins are allowed for local tooling.
+- Relay `GET /chatd-url` is extension-gated (trusted extension origin/ID must match connected extension).
+
+## Troubleshooting
+
+- `agent_not_running` in side panel:
+  - Run `browserforce agent start`.
+- `extension_not_connected` from `/chatd-url`:
+  - Ensure extension is connected to relay (`browserforce status`).
+- `Unauthorized` from `/v1/*`:
+  - Token mismatch/stale bootstrap. Restart daemon and reopen side panel.
+- `Context: unavailable` chip:
+  - No `run.usage` emitted yet for that session. Send a run and re-open session metadata.
+
+## Screenshots (Add Later)
+
+Placeholders for future docs updates:
+
+- Side-panel open state
+- Session switcher
+- Context usage chip
+- Typical error states (`agent_not_running`, `extension_not_connected`)

From 801969705f8cb795e828c844806b8694bb6df692 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 00:16:15 +0530
Subject: [PATCH 180/192] Fix composer textarea bottom clipping

- Increase composer textarea line box + vertical padding so descenders are not cut off\n- Raise textarea max-height to keep multiline drafts readable before internal scrolling\n- Make autosize respect computed CSS max-height and add a 1px safety buffer to avoid partial-line clipping
---
 extension/agent-panel.css | 8 ++++----
 extension/agent-panel.js  | 4 +++-
 2 files changed, 7 insertions(+), 5 deletions(-)

diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index 9b3c8f3..af054b0 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -992,11 +992,11 @@ body {
   font-size: 12px;
   font-family: inherit;
   color: var(--text);
-  line-height: 1.35;
-  min-height: 18px;
-  max-height: 56px;
+  line-height: 1.4;
+  min-height: 22px;
+  max-height: 96px;
   overflow-y: auto;
-  padding: 0;
+  padding: 2px 0 3px;
 }
 
 .composer-textarea::placeholder {
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index c2b0665..9799e1a 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -111,8 +111,10 @@ function reconcileSessionRunState(sessionId) {
 }
 
 function autoResizeInput() {
+  const styles = window.getComputedStyle(chatInputEl);
+  const maxHeight = Number.parseFloat(styles.maxHeight) || 160;
   chatInputEl.style.height = 'auto';
-  chatInputEl.style.height = `${Math.min(chatInputEl.scrollHeight, 160)}px`;
+  chatInputEl.style.height = `${Math.min(chatInputEl.scrollHeight + 1, maxHeight)}px`;
 }
 
 function syncComposerLayoutState() {

From a84ebe3e4b4effe065a44c4f70105e53fae823aa Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 00:21:32 +0530
Subject: [PATCH 181/192] fix(mcp): stabilize labeled screenshots via
 sequential snapshot+screenshot flow

- split labeled capture into deterministic sequential steps: build refs/snapshot, render overlays, then take screenshot

- preserve visible in-image labels while reducing overload risk with capped refs and bounded concurrency

- respect interactiveOnly/refAll behavior in snapshot refs and clear stale ref maps when AX tree is unavailable

- add focused tests for label box filtering/capping and end-to-end labeled screenshot helper behavior
---
 mcp/src/a11y-labels.js               |  7 ++-
 mcp/src/exec-engine.js               | 94 ++++++++++++++++++++++++++--
 mcp/src/index.js                     |  6 +-
 mcp/src/snapshot.js                  |  4 +-
 mcp/test/a11y-labels.test.js         | 44 ++++++++++++-
 mcp/test/exec-engine-plugins.test.js | 84 +++++++++++++++++++++++++
 6 files changed, 226 insertions(+), 13 deletions(-)

diff --git a/mcp/src/a11y-labels.js b/mcp/src/a11y-labels.js
index 6ec29aa..942f076 100644
--- a/mcp/src/a11y-labels.js
+++ b/mcp/src/a11y-labels.js
@@ -356,6 +356,7 @@ export const A11Y_CLIENT_CODE = `
 const MAX_CONCURRENCY = 24;
 const BOX_MODEL_TIMEOUT_MS = 5000;
 const MAX_SCREENSHOT_DIMENSION = 1568;
+const MAX_LABEL_BOXES = 400;
 
 export async function resolveScopeBackendNodeId(cdp, selector) {
   if (!selector) return null;
@@ -371,10 +372,12 @@ export async function resolveScopeBackendNodeId(cdp, selector) {
 }
 
 export async function getLabelBoxes(cdp, refs) {
+  const labelRefs = refs
+    .filter(ref => ref.backendNodeId && INTERACTIVE_ROLES.has(ref.role))
+    .slice(0, MAX_LABEL_BOXES);
   const sema = new Semaphore(MAX_CONCURRENCY);
   const results = await Promise.all(
-    refs.map(async (ref) => {
-      if (!ref.backendNodeId) return null;
+    labelRefs.map(async (ref) => {
       await sema.acquire();
       try {
         const response = await Promise.race([
diff --git a/mcp/src/exec-engine.js b/mcp/src/exec-engine.js
index e538aca..60a5331 100644
--- a/mcp/src/exec-engine.js
+++ b/mcp/src/exec-engine.js
@@ -10,13 +10,16 @@ import {
   TEST_ID_ATTRS, createSmartDiff,
   buildSnapshotText, parseSearchPattern, annotateStableAttrs,
 } from './snapshot.js';
-import { screenshotWithLabels } from './a11y-labels.js';
+import { Semaphore, injectA11yClient, showLabels, hideLabels } from './a11y-labels.js';
 import { getCleanHTML } from './clean-html.js';
 import { getPageMarkdown } from './page-markdown.js';
 
 // ─── Configuration ───────────────────────────────────────────────────────────
 
 const DEFAULT_PORT = 19222;
+const LABEL_SCREENSHOT_MAX_DIMENSION = 1568;
+const LABEL_BOX_CONCURRENCY = 16;
+const MAX_LABEL_OVERLAY_REFS = 300;
 export const BF_DIR = join(homedir(), '.browserforce');
 export const CDP_URL_FILE = join(BF_DIR, 'cdp-url');
 const RELAY_SCRIPT = fileURLToPath(new URL('../../relay/src/index.js', import.meta.url));
@@ -569,7 +572,10 @@ export function buildExecContext(
   const snapshot = async ({ selector, search, showDiffSinceLastCall = true } = {}) => {
     const page = activePage();
     const axRoot = await getAccessibilityTree(page, selector);
-    if (!axRoot) return 'No accessibility tree available for this page.';
+    if (!axRoot) {
+      lastRefToLocator.set(page, new Map());
+      return 'No accessibility tree available for this page.';
+    }
     const stableIds = await getStableIds(page, selector);
     annotateStableAttrs(axRoot, stableIds);
     const searchPattern = parseSearchPattern(search);
@@ -615,6 +621,36 @@ export function buildExecContext(
     return fullSnapshot;
   };
 
+  const buildSnapshotData = async ({ selector, search, refAll = false } = {}) => {
+    const page = activePage();
+    const axRoot = await getAccessibilityTree(page, selector);
+    if (!axRoot) {
+      lastRefToLocator.set(page, new Map());
+      return {
+        text: 'No accessibility tree available for this page.',
+        refs: [],
+        page,
+      };
+    }
+
+    const stableIds = await getStableIds(page, selector);
+    annotateStableAttrs(axRoot, stableIds);
+    const searchPattern = parseSearchPattern(search);
+    const { text: snapshotText, refs } = buildSnapshotText(axRoot, null, searchPattern, { refAll });
+    const refMap = new Map(refs.map(({ ref, locator }) => [ref, locator]));
+    lastRefToLocator.set(page, refMap);
+    const title = await page.title().catch(() => '');
+    const pageUrl = page.url();
+    const refTable = refs.length > 0
+      ? '\n\n--- Ref → Locator ---\n' + refs.map(r => `${r.ref}: ${r.locator}`).join('\n')
+      : '';
+    return {
+      text: `Page: ${title} (${pageUrl})\nRefs: ${refs.length} labeled elements\n\n${snapshotText}${refTable}`,
+      refs,
+      page,
+    };
+  };
+
   const refToLocator = ({ ref, page: targetPage } = {}) => {
     const p = targetPage || activePage();
     const map = lastRefToLocator.get(p);
@@ -646,12 +682,58 @@ export function buildExecContext(
   };
 
   const screenshotWithAccessibilityLabels = async ({ selector, interactiveOnly = true } = {}) => {
-    const page = activePage();
-    const { screenshot, snapshot: snapText, labelCount } = await screenshotWithLabels(page, {
+    const { text: snapText, refs, page } = await buildSnapshotData({
       selector,
-      interactiveOnly,
+      search: null,
+      refAll: !interactiveOnly,
     });
-    return { _bf_type: 'labeled_screenshot', screenshot, snapshot: snapText, labelCount };
+
+    const sema = new Semaphore(LABEL_BOX_CONCURRENCY);
+    const labelCandidates = refs
+      .map(ref => ({ ref: ref.ref, role: ref.role, locator: ref.locator }))
+      .slice(0, MAX_LABEL_OVERLAY_REFS);
+    const labels = (await Promise.all(labelCandidates.map(async (candidate) => {
+      await sema.acquire();
+      try {
+        const box = await page.locator(candidate.locator).first().boundingBox();
+        if (!box || box.width <= 0 || box.height <= 0) return null;
+        return {
+          ref: candidate.ref,
+          role: candidate.role,
+          box: { x: box.x, y: box.y, width: box.width, height: box.height },
+        };
+      } catch {
+        return null;
+      } finally {
+        sema.release();
+      }
+    }))).filter(Boolean);
+
+    let labelsInjected = false;
+    let labelCount = 0;
+    if (labels.length > 0) {
+      await injectA11yClient(page);
+      labelCount = await showLabels(page, labels);
+      labelsInjected = true;
+    }
+
+    const viewport = await page.evaluate((maxDim) => ({
+      width: Math.min(window.innerWidth, maxDim),
+      height: Math.min(window.innerHeight, maxDim),
+    }), LABEL_SCREENSHOT_MAX_DIMENSION);
+    try {
+      const screenshot = await page.screenshot({
+        type: 'jpeg',
+        quality: 80,
+        scale: 'css',
+        clip: { x: 0, y: 0, ...viewport },
+      });
+      return { _bf_type: 'labeled_screenshot', screenshot, snapshot: snapText, labelCount };
+    } finally {
+      if (labelsInjected) {
+        try { await hideLabels(page); } catch { /* page may have navigated */ }
+      }
+    }
   };
 
   const cleanHTML = (selector, opts) => getCleanHTML(activePage(), selector, opts);
diff --git a/mcp/src/index.js b/mcp/src/index.js
index 802dd77..fdcb312 100644
--- a/mcp/src/index.js
+++ b/mcp/src/index.js
@@ -306,7 +306,8 @@ Helpers:
   screenshotWithAccessibilityLabels({ selector?, interactiveOnly? })
                                      Vimium-style labeled screenshot + accessibility snapshot.
                                      Returns image with color-coded element labels (e1, e2...) and
-                                     matching text snapshot. Use when visual layout matters.
+                                     matching text snapshot. Use for explicitly annotated/labeled captures.
+                                     For plain screenshots, prefer state.page.screenshot() and snapshot() as separate calls.
   cleanHTML(selector?, opts?)        Cleaned HTML — strips scripts, styles, decorative elements.
                                      Keeps semantic attrs: href, src, role, aria-*, data-testid.
                                      opts: { maxAttrLen?, maxContentLen? }
@@ -412,7 +413,8 @@ Use snapshot({ showDiffSinceLastCall: false }) when you need full output.
 ═══ SNAPSHOT VS SCREENSHOT ═══
 
 Prefer snapshot() for text/content/verification.
-Use screenshotWithAccessibilityLabels() only when visual layout or spatial relationships matter.
+For plain screenshot requests, use state.page.screenshot() first, then snapshot() if textual verification is needed.
+Use screenshotWithAccessibilityLabels() only when labels/refs on the image are explicitly needed.
 
 snapshot vs cleanHTML vs pageMarkdown:
   - snapshot(): interactive structure, refs, quick verification
diff --git a/mcp/src/snapshot.js b/mcp/src/snapshot.js
index 6913f5a..8db53b1 100644
--- a/mcp/src/snapshot.js
+++ b/mcp/src/snapshot.js
@@ -86,7 +86,7 @@ export function hasMatchingDescendant(node, pattern) {
  *
  * When multiple nodes resolve to the same ref, a -2, -3 suffix is appended.
  */
-export function buildSnapshotText(axTree, stableIdMap, searchPattern) {
+export function buildSnapshotText(axTree, stableIdMap, searchPattern, { refAll = false } = {}) {
   const lines = [];
   const refs = [];
   let refCounter = 0;
@@ -119,7 +119,7 @@ export function buildSnapshotText(axTree, stableIdMap, searchPattern) {
       lineText += ` "${escapeLocatorName(name)}"`;
     }
 
-    if (isInteractive) {
+    if (isInteractive || (refAll && isContext)) {
       refCounter++;
       const stableEntry = node._stableAttr || null;
       let baseRef = stableEntry ? stableEntry.value : `e${refCounter}`;
diff --git a/mcp/test/a11y-labels.test.js b/mcp/test/a11y-labels.test.js
index 4fdae2f..bc7227e 100644
--- a/mcp/test/a11y-labels.test.js
+++ b/mcp/test/a11y-labels.test.js
@@ -1,6 +1,6 @@
 import { describe, it } from 'node:test';
 import assert from 'node:assert/strict';
-import { Semaphore, buildBoxFromQuad, buildSnapshotFromCdpNodes } from '../src/a11y-labels.js';
+import { Semaphore, buildBoxFromQuad, buildSnapshotFromCdpNodes, getLabelBoxes } from '../src/a11y-labels.js';
 
 describe('Semaphore', () => {
   it('allows up to max concurrent acquisitions', async () => {
@@ -177,3 +177,45 @@ describe('buildSnapshotFromCdpNodes', () => {
     assert.equal(refs[0].role, 'button');
   });
 });
+
+describe('getLabelBoxes', () => {
+  it('resolves box models only for interactive refs', async () => {
+    const calledBackendNodeIds = [];
+    const cdp = {
+      send: async (_method, { backendNodeId }) => {
+        calledBackendNodeIds.push(backendNodeId);
+        return { model: { border: [0, 0, 10, 0, 10, 10, 0, 10] } };
+      },
+    };
+    const refs = [
+      { ref: 'e1', role: 'button', backendNodeId: 11 },
+      { ref: 'e2', role: 'heading', backendNodeId: 12 },
+      { ref: 'e3', role: 'link', backendNodeId: 13 },
+      { ref: 'e4', role: 'navigation', backendNodeId: 14 },
+      { ref: 'e5', role: 'textbox' },
+    ];
+
+    const labels = await getLabelBoxes(cdp, refs);
+    assert.deepEqual(calledBackendNodeIds.sort((a, b) => a - b), [11, 13]);
+    assert.deepEqual(labels.map(label => label.ref), ['e1', 'e3']);
+  });
+
+  it('caps box-model lookups to avoid oversized CDP bursts', async () => {
+    let callCount = 0;
+    const cdp = {
+      send: async () => {
+        callCount++;
+        return { model: { border: [0, 0, 10, 0, 10, 10, 0, 10] } };
+      },
+    };
+    const refs = Array.from({ length: 450 }, (_v, i) => ({
+      ref: `e${i + 1}`,
+      role: 'button',
+      backendNodeId: i + 1,
+    }));
+
+    const labels = await getLabelBoxes(cdp, refs);
+    assert.equal(callCount, 400);
+    assert.equal(labels.length, 400);
+  });
+});
diff --git a/mcp/test/exec-engine-plugins.test.js b/mcp/test/exec-engine-plugins.test.js
index 50fe705..e664bf4 100644
--- a/mcp/test/exec-engine-plugins.test.js
+++ b/mcp/test/exec-engine-plugins.test.js
@@ -36,6 +36,66 @@ function createSnapshotPage() {
   };
 }
 
+function createLabeledScreenshotPage() {
+  const screenshotCalls = [];
+  const screenshotBuffer = Buffer.from('jpeg-image-data');
+  const locatorCalls = [];
+  let a11yInjected = false;
+  return {
+    isClosed: () => false,
+    url: () => 'https://example.test',
+    title: async () => 'Snapshot Test',
+    screenshot: async (opts) => {
+      screenshotCalls.push(opts);
+      return screenshotBuffer;
+    },
+    locator: (selector) => {
+      locatorCalls.push(selector);
+      return {
+        first: () => ({
+          boundingBox: async () => ({ x: 20, y: 30, width: 160, height: 40 }),
+        }),
+      };
+    },
+    evaluate: async (_fn, arg) => {
+      if (typeof _fn === 'string') {
+        a11yInjected = true;
+        return undefined;
+      }
+      const source = String(_fn);
+      if (source.includes('typeof globalThis.__bf_a11y')) {
+        return a11yInjected;
+      }
+      if (source.includes('renderA11yLabels(entries)')) {
+        return Array.isArray(arg) ? arg.length : 0;
+      }
+      if (source.includes('__bf_labels__')) {
+        return undefined;
+      }
+      if (arg && typeof arg === 'object' && Array.isArray(arg.testIdAttrs)) {
+        return {};
+      }
+      if (typeof arg === 'number') {
+        return { width: 1200, height: 700 };
+      }
+      return {
+        role: 'WebArea',
+        name: '',
+        children: [
+          {
+            role: 'main',
+            name: '',
+            children: [{ role: 'button', name: 'Submit', children: [] }],
+          },
+        ],
+      };
+    },
+    getScreenshotCalls: () => screenshotCalls,
+    getScreenshotBuffer: () => screenshotBuffer,
+    getLocatorCalls: () => locatorCalls,
+  };
+}
+
 function createCleanHtmlPage() {
   return {
     isClosed: () => false,
@@ -436,6 +496,30 @@ test('buildExecContext exposes screenshot and content helpers in execute scope',
   assert.equal(typeof ctx.pageMarkdown, 'function');
 });
 
+test('screenshotWithAccessibilityLabels runs snapshot and direct screenshot sequentially', async () => {
+  const page = createLabeledScreenshotPage();
+  const ctx = buildExecContext(page, { pages: () => [page] }, {}, {}, {});
+
+  const result = await ctx.screenshotWithAccessibilityLabels({ interactiveOnly: false });
+  const calls = page.getScreenshotCalls();
+  const locatorCalls = page.getLocatorCalls();
+
+  assert.equal(calls.length, 1);
+  assert.equal(locatorCalls.length, 2);
+  assert.deepEqual(calls[0], {
+    type: 'jpeg',
+    quality: 80,
+    scale: 'css',
+    clip: { x: 0, y: 0, width: 1200, height: 700 },
+  });
+  assert.equal(result._bf_type, 'labeled_screenshot');
+  assert.equal(result.screenshot.toString('base64'), page.getScreenshotBuffer().toString('base64'));
+  assert.ok(result.snapshot.includes('Page: Snapshot Test (https://example.test)'));
+  assert.ok(result.snapshot.includes('- button "Submit" [ref=e2]'));
+  assert.equal(result.labelCount, 2);
+  assert.ok(result.snapshot.includes('- main [ref=e1]'));
+});
+
 test('buildExecContext exposes callable ref and CDP helpers', async () => {
   const fakeSession = { send: async () => ({}) };
   const page = {

From 913fdc7593a9687c0d578c18f33648edb00f69ba Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 00:35:08 +0530
Subject: [PATCH 182/192] Restore run activity headings and improve thinking
 affordances

- Derive compact reasoning heading rows from chat.commentary updates while preserving inline commentary text bubbles\n- Keep latest active BrowserForce:execute rows in title treatment so shimmer/transition remains visible during execute-heavy runs\n- Add left-side composer thinking spinner tied to active run state while retaining stop button on the right\n- Increase shimmer contrast for better visibility in the side panel theme\n- Extend reducer and panel contract tests to lock heading, shimmer, and composer thinking behavior
---
 extension/agent-panel-state.js               | 50 +++++++++++++++++++-
 extension/agent-panel.css                    | 42 ++++++++++++----
 extension/agent-panel.js                     | 11 +++--
 test/agent/agent-panel-contract.test.js      |  2 +
 test/agent/agent-panel-send-contract.test.js |  7 +++
 test/agent/sse-events.test.js                |  3 ++
 6 files changed, 102 insertions(+), 13 deletions(-)

diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index 50d376e..4a48b2d 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -178,6 +178,40 @@ function normalizeStepDetails(details, label = '') {
   return lines;
 }
 
+function stripInlineMarkdown(text) {
+  return String(text || '')
+    .replace(/`([^`]+)`/g, '$1')
+    .replace(/\*\*([^*]+)\*\*/g, '$1')
+    .replace(/\*([^*\n]+)\*/g, '$1')
+    .replace(/~~([^~]+)~~/g, '$1')
+    .replace(/\[([^\]]+)\]\(([^)]+)\)/g, '$1')
+    .replace(/^>\s*/gm, '')
+    .trim();
+}
+
+function commentaryHeadingFromDelta(delta) {
+  const source = String(delta || '').trim();
+  if (!source) return '';
+  const firstLine = source
+    .split('\n')
+    .map((line) => line.trim())
+    .find(Boolean) || '';
+  if (!firstLine) return '';
+
+  let heading = stripInlineMarkdown(firstLine)
+    .replace(/^[\-*•\d.)\s]+/, '')
+    .replace(/^\s*(?:i['’]?m|i am|i['’]?ll|i will)\s+/i, '')
+    .replace(/^(?:next|now)\s*,?\s+/i, '')
+    .replace(/[.?!:;,\s]+$/, '')
+    .replace(/\s+/g, ' ')
+    .trim();
+
+  if (!heading) return '';
+  if (/^(browserforce|recovery action|error[:\s])/i.test(heading)) return '';
+  if (heading.length > 96) heading = `${heading.slice(0, 93).trimEnd()}...`;
+  return heading.charAt(0).toUpperCase() + heading.slice(1);
+}
+
 function normalizeStep(step) {
   if (!step || typeof step !== 'object') return null;
   const label = trimStepLabel(step.label);
@@ -713,11 +747,25 @@ export function applyEvent(state = initialState, evt = {}) {
   if (evt.event === 'chat.commentary') {
     const run = state.runs[evt.runId] || { text: '', done: false, steps: [], timeline: [] };
     const delta = evt.payload?.delta || '';
+    const heading = commentaryHeadingFromDelta(delta);
+    const commentaryStep = heading
+      ? {
+        kind: 'reasoning',
+        status: 'running',
+        label: heading,
+      }
+      : null;
+    const timelineWithHeading = commentaryStep
+      ? pushTimelineEntry(run, { type: 'step', ...commentaryStep })
+      : (Array.isArray(run.timeline) ? run.timeline : []);
+    const nextSteps = commentaryStep ? pushStep(run, commentaryStep) : (Array.isArray(run.steps) ? run.steps : []);
+    const timeline = pushTimelineEntry({ timeline: timelineWithHeading }, { type: 'text', text: delta });
     return {
       ...state,
       runs: upsertRun(state, evt.runId, {
         sessionId: evt.sessionId,
-        timeline: pushTimelineEntry(run, { type: 'text', text: delta }),
+        steps: nextSteps,
+        timeline,
       }),
     };
   }
diff --git a/extension/agent-panel.css b/extension/agent-panel.css
index af054b0..ed86b01 100644
--- a/extension/agent-panel.css
+++ b/extension/agent-panel.css
@@ -703,20 +703,20 @@ body {
 }
 
 .step-label.title-label.shimmer-text {
-  color: transparent;
-  -webkit-text-fill-color: transparent;
+  color: var(--text);
   background-image: linear-gradient(
-    95deg,
-    rgba(61, 48, 40, 0.45) 0%,
-    rgba(61, 48, 40, 0.45) 35%,
-    rgba(193, 95, 60, 0.96) 50%,
-    rgba(61, 48, 40, 0.45) 65%,
-    rgba(61, 48, 40, 0.45) 100%
+    96deg,
+    rgba(61, 48, 40, 0.62) 0%,
+    rgba(61, 48, 40, 0.62) 34%,
+    rgba(193, 95, 60, 1) 50%,
+    rgba(61, 48, 40, 0.62) 66%,
+    rgba(61, 48, 40, 0.62) 100%
   );
   background-size: 220% 100%;
   background-position: 110% 0;
   -webkit-background-clip: text;
   background-clip: text;
+  -webkit-text-fill-color: transparent;
   animation: reasoning-shimmer 2.3s ease-in-out infinite;
 }
 
@@ -956,6 +956,7 @@ body {
   min-height: 38px;
   box-shadow: none;
   transition: border-color 0.16s, box-shadow 0.16s, min-height 0.16s, padding 0.16s;
+  position: relative;
 }
 
 .composer-box:focus-within {
@@ -983,6 +984,26 @@ body {
   align-self: end;
 }
 
+.composer-box.is-thinking::before {
+  content: '';
+  position: absolute;
+  left: 10px;
+  top: 50%;
+  width: 11px;
+  height: 11px;
+  border-radius: 999px;
+  border: 1.8px solid var(--crail-soft);
+  border-top-color: var(--crail);
+  transform: translateY(-50%);
+  animation: spin 0.8s linear infinite;
+  pointer-events: none;
+}
+
+.composer-box.is-multiline.is-thinking::before {
+  top: 12px;
+  transform: none;
+}
+
 .composer-textarea {
   flex: 1;
   resize: none;
@@ -999,6 +1020,10 @@ body {
   padding: 2px 0 3px;
 }
 
+.composer-box.is-thinking .composer-textarea {
+  padding-left: 18px;
+}
+
 .composer-textarea::placeholder {
   color: var(--text-subtle);
 }
@@ -1385,6 +1410,7 @@ body {
 }
 
 @media (prefers-reduced-motion: reduce) {
+  .composer-box.is-thinking::before,
   .step-item.pulse .run-step-icon,
   .step-label.title-label.shimmer-text,
   .step-label.title-label.title-transition-in,
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 9799e1a..e4d1129 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -133,6 +133,7 @@ function syncComposerState() {
   const hasText = chatInputEl.value.trim().length > 0;
   const runInProgress = isActiveRunInProgress();
 
+  composerBoxEl.classList.toggle('is-thinking', enabled && runInProgress);
   stopRunBtn.disabled = !enabled || !runInProgress;
   stopRunBtn.classList.toggle('active', enabled && runInProgress);
   stopRunBtn.hidden = !runInProgress;
@@ -675,12 +676,14 @@ function renderRunTimeline(run, fallbackText = '') {
     const isLatest = index === latestStepIndex;
     const shouldPulse = isLatest && status === 'running';
     const isReasoningTitle = String(entry?.kind || '').toLowerCase() === 'reasoning';
-    const isRunningReasoning = isReasoningTitle && normalizedStatus === 'running';
+    const isExecuteTitle = isBrowserForceExecuteStep(entry);
+    const isTitleRow = isReasoningTitle || isExecuteTitle;
+    const isRunningTitle = isTitleRow && normalizedStatus === 'running';
     const labelClasses = ['step-label'];
-    if (isReasoningTitle) labelClasses.push('title-label');
-    if (isRunningReasoning && isLatest) {
+    if (isTitleRow) labelClasses.push('title-label');
+    if (isRunningTitle && isLatest) {
       labelClasses.push('shimmer-text');
-      if (shouldAnimateLatestReasoningTitle({ run, entry, isLatest, isRunningReasoning })) {
+      if (shouldAnimateLatestReasoningTitle({ run, entry, isLatest, isRunningReasoning: isRunningTitle })) {
         labelClasses.push('title-transition-in');
       }
     }
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index a81e1be..62362b4 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -57,6 +57,8 @@ test('agent panel composer matches compact/expanded shell structure', () => {
   assert.match(html, /id="bf-stop-run"[\s\S]*icon-stop/);
   assert.match(html, /id="bf-send-btn"/);
   assert.match(css, /\.composer-box\.is-multiline/);
+  assert.match(css, /\.composer-box\.is-thinking::before/);
+  assert.match(css, /\.composer-box\.is-thinking \.composer-textarea/);
   assert.match(css, /\.btn-send[\s\S]*border-radius:\s*999px/);
 });
 
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index 08e8813..f9a9274 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -150,6 +150,12 @@ test('tool-call timeline entries render collapsed toggle rows with click-to-expa
   assert.match(js, /closest\('button\[data-step-key\]'\)/);
 });
 
+test('latest running BrowserForce execute rows are treated as title rows with shimmer', () => {
+  assert.match(js, /const isExecuteTitle = isBrowserForceExecuteStep\(entry\)/);
+  assert.match(js, /const isTitleRow = isReasoningTitle \|\| isExecuteTitle/);
+  assert.match(js, /if \(isRunningTitle && isLatest\) \{/);
+});
+
 test('done tool-call icon renders animated svg check markup', () => {
   assert.match(js, /function renderRunStepIcon\(icon\)/);
   assert.match(js, /run-step-icon-done-svg/);
@@ -165,6 +171,7 @@ test('composer toggles single-line and multiline visual state from textarea heig
 });
 
 test('send and stop buttons are mutually exclusive based on run state', () => {
+  assert.match(js, /composerBoxEl\.classList\.toggle\('is-thinking', enabled && runInProgress\)/);
   assert.match(js, /stopRunBtn\.hidden\s*=\s*!runInProgress/);
   assert.match(js, /sendBtn\.hidden\s*=\s*runInProgress/);
 });
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
index 1e52e2b..f0e116e 100644
--- a/test/agent/sse-events.test.js
+++ b/test/agent/sse-events.test.js
@@ -201,7 +201,10 @@ test('chat.commentary text stays inline but does not pollute final assistant mes
   const s3 = applyEvent(s2, { event: 'chat.final', runId: 'r1', sessionId: 's1', payload: { text: 'Final answer.' } });
 
   const timeline = s3.runs.r1.timeline || [];
+  const commentaryStep = (s3.runs.r1.steps || []).find((item) => item?.kind === 'reasoning' && /Inspecting files/.test(item?.label || ''));
   assert.equal(timeline.some((item) => item?.type === 'text' && /Inspecting files/.test(item?.text || '')), true);
+  assert.equal(timeline.some((item) => item?.type === 'step' && item?.kind === 'reasoning' && /Inspecting files/.test(item?.label || '')), true);
+  assert.equal(Boolean(commentaryStep), true);
   assert.equal(timeline.some((item) => item?.type === 'text' && /Final answer/.test(item?.text || '')), true);
   assert.equal(s3.runs.r1.text, 'Final answer.');
   assert.equal(s3.messagesBySession.s1.at(-1)?.text, 'Final answer.');

From 3e6d2e30fe139c1711e874497c2675258a0fcc08 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 09:10:09 +0530
Subject: [PATCH 183/192] Render commentary headings from historical text
 timeline entries

- Add render-layer fallback in normalizeRunTimeline to derive reasoning heading rows from interim text entries when step data is missing\n- Treat non-final text chunks (with later tool steps) as commentary headings so older sessions show meaningful progress titles\n- Preserve final text bubble rendering by leaving terminal text entries untouched\n- Avoid backend migration by deriving headings at display time for existing session logs
---
 extension/agent-panel.js | 70 ++++++++++++++++++++++++++++++++++++----
 1 file changed, 64 insertions(+), 6 deletions(-)

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index e4d1129..f5a9a15 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -512,12 +512,70 @@ function renderSessions() {
 function normalizeRunTimeline(run, fallbackText = '') {
   if (!run) return [];
   if (Array.isArray(run.timeline) && run.timeline.length > 0) {
-    return run.timeline.filter((entry) => {
-      if (!entry || typeof entry !== 'object') return false;
-      if (entry.type === 'text') return typeof entry.text === 'string' && entry.text.length > 0;
-      if (entry.type === 'step') return typeof entry.label === 'string' && entry.label.trim().length > 0;
-      return false;
-    });
+    const isRenderableStep = (entry) => (
+      !!entry
+      && typeof entry === 'object'
+      && entry.type === 'step'
+      && typeof entry.label === 'string'
+      && entry.label.trim().length > 0
+    );
+    const isRenderableText = (entry) => (
+      !!entry
+      && typeof entry === 'object'
+      && entry.type === 'text'
+      && typeof entry.text === 'string'
+      && entry.text.trim().length > 0
+    );
+    const stripInlineMarkdown = (text) => String(text || '')
+      .replace(/`([^`]+)`/g, '$1')
+      .replace(/\*\*([^*]+)\*\*/g, '$1')
+      .replace(/\*([^*\n]+)\*/g, '$1')
+      .replace(/~~([^~]+)~~/g, '$1')
+      .replace(/\[([^\]]+)\]\(([^)]+)\)/g, '$1')
+      .replace(/^>\s*/gm, '')
+      .trim();
+    const headingFromText = (text) => {
+      const firstLine = String(text || '')
+        .split('\n')
+        .map((line) => line.trim())
+        .find(Boolean) || '';
+      if (!firstLine) return '';
+      let heading = stripInlineMarkdown(firstLine)
+        .replace(/^[\-*\d.)\s]+/, '')
+        .replace(/^\s*(?:i'?m|i am|i'?ll|i will)\s+/i, '')
+        .replace(/^(?:next|now)\s*,?\s+/i, '')
+        .replace(/[.?!:;,\s]+$/, '')
+        .replace(/\s+/g, ' ')
+        .trim();
+      if (!heading) return '';
+      if (heading.length > 96) heading = `${heading.slice(0, 93).trimEnd()}...`;
+      return heading.charAt(0).toUpperCase() + heading.slice(1);
+    };
+
+    const timeline = [];
+    const source = run.timeline.filter((entry) => isRenderableStep(entry) || isRenderableText(entry));
+    for (let index = 0; index < source.length; index += 1) {
+      const entry = source[index];
+      if (entry.type === 'step') {
+        timeline.push(entry);
+        continue;
+      }
+      const hasStepAfter = source.slice(index + 1).some((item) => item.type === 'step');
+      if (!hasStepAfter) {
+        timeline.push(entry);
+        continue;
+      }
+      const heading = headingFromText(entry.text || '');
+      if (!heading) continue;
+      timeline.push({
+        type: 'step',
+        kind: 'reasoning',
+        status: run.done ? 'done' : 'running',
+        key: `derived:commentary:${index}`,
+        label: heading,
+      });
+    }
+    return timeline;
   }
 
   const steps = Array.isArray(run.steps) ? run.steps : [];

From 751d9d38f52f2d0b8066a91da8cfbfce688136de Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 09:13:28 +0530
Subject: [PATCH 184/192] Add paced incremental rendering for streamed chat
 deltas

- Split large chat.delta/chat.commentary payloads into smaller chunks for visible incremental streaming in side panel\n- Pump buffered chunks on a short timer so updates are perceptible instead of arriving as one large block\n- Flush pending chunks before non-text events for the same run to preserve event ordering around tool/final states\n- Reset stream chunk queue on session/event-loop switches to prevent stale carry-over between sessions\n- Add panel contract coverage for chunked stream update pipeline
---
 extension/agent-panel.js                     | 115 ++++++++++++++++++-
 test/agent/agent-panel-send-contract.test.js |   9 ++
 2 files changed, 123 insertions(+), 1 deletion(-)

diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index f5a9a15..3464ba2 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -19,6 +19,9 @@ const REASONING_PRESETS = [
 ];
 const BROWSERFORCE_AGENT_OPEN_REQUEST_KEY = 'browserforceAgentOpenRequest';
 const BROWSERFORCE_AGENT_OPEN_REQUEST_MAX_AGE_MS = 60_000;
+const STREAM_CHUNK_TARGET_CHARS = 24;
+const STREAM_CHUNK_LOOKAHEAD_CHARS = 14;
+const STREAM_CHUNK_INTERVAL_MS = 26;
 
 const state = {
   value: initialState,
@@ -41,6 +44,8 @@ const state = {
   sessionTitleDrafts: {},
   eventController: null,
   eventLoopToken: 0,
+  streamEventQueue: [],
+  streamEventTimer: null,
   sessionSelectionToken: 0,
   popover: 'none',
   startupIssue: null,
@@ -261,7 +266,7 @@ function dispatch(action) {
   render();
 }
 
-function dispatchEvent(evt) {
+function applyIncomingEvent(evt) {
   state.value = applyEvent(state.value, evt);
   if (evt?.event === 'run.started' && evt.sessionId && evt.runId) {
     state.currentRunBySession = assignSessionRunId(state.currentRunBySession, evt.sessionId, evt.runId);
@@ -272,6 +277,112 @@ function dispatchEvent(evt) {
   render();
 }
 
+function splitDeltaForDisplayStreaming(delta) {
+  const text = String(delta || '');
+  if (!text) return [];
+  if (text.length <= STREAM_CHUNK_TARGET_CHARS) return [text];
+  const chunks = [];
+  let cursor = 0;
+  while (cursor < text.length) {
+    let end = Math.min(cursor + STREAM_CHUNK_TARGET_CHARS, text.length);
+    if (end < text.length) {
+      const lookahead = text.slice(end, Math.min(end + STREAM_CHUNK_LOOKAHEAD_CHARS, text.length));
+      const wsIndex = lookahead.search(/\s/);
+      if (wsIndex >= 0) {
+        end += wsIndex + 1;
+      }
+    }
+    if (end <= cursor) end = Math.min(cursor + STREAM_CHUNK_TARGET_CHARS, text.length);
+    chunks.push(text.slice(cursor, end));
+    cursor = end;
+  }
+  return chunks;
+}
+
+function resetStreamEventQueue() {
+  if (state.streamEventTimer) {
+    window.clearTimeout(state.streamEventTimer);
+    state.streamEventTimer = null;
+  }
+  state.streamEventQueue = [];
+}
+
+function scheduleStreamEventPump() {
+  if (state.streamEventTimer || state.streamEventQueue.length === 0) return;
+  state.streamEventTimer = window.setTimeout(() => {
+    state.streamEventTimer = null;
+    const next = state.streamEventQueue.shift();
+    if (next) {
+      applyIncomingEvent(next);
+    }
+    if (state.streamEventQueue.length > 0) {
+      scheduleStreamEventPump();
+    }
+  }, STREAM_CHUNK_INTERVAL_MS);
+}
+
+function flushStreamEventsForRun(sessionId, runId) {
+  if (!sessionId || !runId || state.streamEventQueue.length === 0) return;
+  const keep = [];
+  const flush = [];
+  for (const queued of state.streamEventQueue) {
+    if (queued?.sessionId === sessionId && queued?.runId === runId) {
+      flush.push(queued);
+    } else {
+      keep.push(queued);
+    }
+  }
+  state.streamEventQueue = keep;
+  if (flush.length > 0) {
+    for (const queued of flush) {
+      applyIncomingEvent(queued);
+    }
+  }
+  if (state.streamEventTimer) {
+    window.clearTimeout(state.streamEventTimer);
+    state.streamEventTimer = null;
+  }
+  if (state.streamEventQueue.length > 0) {
+    scheduleStreamEventPump();
+  }
+}
+
+function dispatchEvent(evt) {
+  if (!evt || typeof evt !== 'object') return;
+  const eventType = String(evt.event || '');
+  const isTextDeltaEvent = (
+    (eventType === 'chat.delta' || eventType === 'chat.commentary')
+    && typeof evt.payload?.delta === 'string'
+  );
+
+  if (!isTextDeltaEvent) {
+    flushStreamEventsForRun(evt.sessionId, evt.runId);
+    applyIncomingEvent(evt);
+    return;
+  }
+
+  const chunks = splitDeltaForDisplayStreaming(evt.payload.delta);
+  if (chunks.length <= 1) {
+    applyIncomingEvent(evt);
+    return;
+  }
+
+  const firstPayload = { ...(evt.payload || {}), delta: chunks[0] };
+  applyIncomingEvent({ ...evt, payload: firstPayload });
+
+  const bufferedPayload = { ...(evt.payload || {}) };
+  for (let index = 1; index < chunks.length; index += 1) {
+    state.streamEventQueue.push({
+      ...evt,
+      payload: {
+        ...bufferedPayload,
+        delta: chunks[index],
+      },
+    });
+  }
+  scheduleStreamEventPump();
+}
+
 function formatModelLabel(model) {
   return model && String(model).trim() ? model : 'Default';
 }
@@ -1408,6 +1519,7 @@ async function loadSessionMetadata(sessionId) {
 }
 
 async function selectSession(sessionId) {
+  resetStreamEventQueue();
   state.sessionSelectionToken += 1;
   const selectionToken = state.sessionSelectionToken;
   dispatch({ type: 'session.selected', sessionId });
@@ -1555,6 +1667,7 @@ async function consumeEventStream(body, loopToken) {
 }
 
 function connectEvents(sessionId) {
+  resetStreamEventQueue();
   state.eventLoopToken += 1;
   const loopToken = state.eventLoopToken;
   if (state.eventController) state.eventController.abort();
diff --git a/test/agent/agent-panel-send-contract.test.js b/test/agent/agent-panel-send-contract.test.js
index f9a9274..4d262b3 100644
--- a/test/agent/agent-panel-send-contract.test.js
+++ b/test/agent/agent-panel-send-contract.test.js
@@ -150,6 +150,15 @@ test('tool-call timeline entries render collapsed toggle rows with click-to-expa
   assert.match(js, /closest\('button\[data-step-key\]'\)/);
 });
 
+test('text deltas are chunked into paced stream updates for visible incremental rendering', () => {
+  assert.match(js, /STREAM_CHUNK_TARGET_CHARS/);
+  assert.match(js, /STREAM_CHUNK_INTERVAL_MS/);
+  assert.match(js, /function splitDeltaForDisplayStreaming\(delta\)/);
+  assert.match(js, /state\.streamEventQueue/);
+  assert.match(js, /flushStreamEventsForRun\(evt\.sessionId, evt\.runId\)/);
+  assert.match(js, /scheduleStreamEventPump\(\)/);
+});
+
 test('latest running BrowserForce execute rows are treated as title rows with shimmer', () => {
   assert.match(js, /const isExecuteTitle = isBrowserForceExecuteStep\(entry\)/);
   assert.match(js, /const isTitleRow = isReasoningTitle \|\| isExecuteTitle/);

From c89b9d7227ceedf09f689668ba6ae4257030a941 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 09:41:57 +0530
Subject: [PATCH 185/192] agent: isolate codex runs and manage BrowserForce
 AGENTS prompt

- default agent Codex working directory to ~/.browserforce/agent-cwd via BF_CHATD_CODEX_CWD
- sync a managed AGENTS.md template into the agent cwd on start
- preserve user overrides by updating only managed files with BrowserForce header marker
- remove inlined static system prompt from chatd runtime prompt assembly
- add CLI agent test coverage for AGENTS.md sync and content
---
 agent/instructions/AGENTS.md | 25 +++++++++++++++++++++++++
 agent/src/chatd.js           |  5 ++++-
 bin.js                       | 35 ++++++++++++++++++++++++++++++++++-
 test/agent/cli-agent.test.js |  9 ++++++++-
 4 files changed, 71 insertions(+), 3 deletions(-)
 create mode 100644 agent/instructions/AGENTS.md

diff --git a/agent/instructions/AGENTS.md b/agent/instructions/AGENTS.md
new file mode 100644
index 0000000..2e05979
--- /dev/null
+++ b/agent/instructions/AGENTS.md
@@ -0,0 +1,25 @@
+# BrowserForce Agent Instructions
+
+## Role
+
+You are BrowserForce Agent, a warm, practical, action-first browser assistant.
+Your default mode is helpful execution, not long theory.
+
+## Response Style
+
+- Be friendly and clear without fluff.
+- Lead with the direct answer.
+- Prefer short, actionable steps users can do immediately.
+- When useful, end with a concrete next action.
+
+## Browser-First Behavior
+
+- If a request depends on page contents, inspect with BrowserForce tools before answering.
+- Never pretend to have seen page details without a successful tool result in the current run.
+- If a tool call fails, quote the exact error and give one focused recovery action.
+
+## Scope Discipline
+
+- This side-panel assistant is user-help focused first.
+- Do coding/development workflows only when the user explicitly asks for code or repo changes.
+- Avoid heavyweight developer process instructions unless they are directly relevant to the user request.
diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index dcc784e..4b6091d 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -999,7 +999,10 @@ export async function startChatd(opts = {}) {
   const storageRoot = opts.storageRoot || ephemeralStorageRoot;
   const token = opts.token || process.env.BF_CHATD_TOKEN || randomBytes(32).toString('base64url');
   const chatdUrlPath = opts.chatdUrlPath || process.env.BF_CHATD_URL_PATH || CHATD_URL_PATH;
-  const runExecutor = opts.runExecutor || createDefaultRunExecutor({ codexCwd: opts.codexCwd || process.cwd() });
+  const envCodexCwd = String(process.env.BF_CHATD_CODEX_CWD || '').trim();
+  const runExecutor = opts.runExecutor || createDefaultRunExecutor({
+    codexCwd: opts.codexCwd || envCodexCwd || process.cwd(),
+  });
   const modelFetcher = opts.modelFetcher || (() => fetchCodexModelCatalog({
     command: opts.codexCommand || process.env.BF_CHATD_CODEX_COMMAND || 'codex',
     timeoutMs: Number(process.env.BF_CHATD_MODEL_LIST_TIMEOUT_MS || MODEL_LIST_TIMEOUT_MS),
diff --git a/bin.js b/bin.js
index 4f99e26..f35d206 100644
--- a/bin.js
+++ b/bin.js
@@ -620,6 +620,30 @@ async function cmdAgent() {
 
   const lockPath = process.env.BF_CHATD_LOCK_PATH || join(homedir(), '.browserforce', 'chatd-lock.json');
   const chatdUrlPath = process.env.BF_CHATD_URL_PATH || join(homedir(), '.browserforce', 'chatd-url.json');
+  const AGENT_INSTRUCTIONS_TEMPLATE_PATH = fileURLToPath(new URL('./agent/instructions/AGENTS.md', import.meta.url));
+  const MANAGED_AGENTS_HEADER = '<!-- BrowserForce managed AGENTS.md (remove this header to opt out of auto-sync) -->';
+
+  const syncManagedAgentInstructions = async (codexCwd) => {
+    const targetPath = join(codexCwd, 'AGENTS.md');
+    const template = await fsp.readFile(AGENT_INSTRUCTIONS_TEMPLATE_PATH, 'utf8');
+    const nextBody = `${MANAGED_AGENTS_HEADER}\n\n${String(template || '').trimEnd()}\n`;
+
+    let currentBody = null;
+    try {
+      currentBody = await fsp.readFile(targetPath, 'utf8');
+    } catch (error) {
+      if (error?.code !== 'ENOENT') throw error;
+    }
+
+    if (currentBody == null) {
+      await fsp.writeFile(targetPath, nextBody, 'utf8');
+      return;
+    }
+
+    if (currentBody.startsWith(MANAGED_AGENTS_HEADER) && currentBody !== nextBody) {
+      await fsp.writeFile(targetPath, nextBody, 'utf8');
+    }
+  };
 
   if (sub === 'start') {
     const current = await readLock({ lockPath });
@@ -631,6 +655,10 @@ async function cmdAgent() {
     const envPort = Number(process.env.BF_CHATD_PORT || 0);
     const port = await pickChatdPort({ envPort });
     const token = randomBytes(32).toString('base64url');
+    const codexCwd = String(process.env.BF_CHATD_CODEX_CWD || '').trim()
+      || join(homedir(), '.browserforce', 'agent-cwd');
+    await fsp.mkdir(codexCwd, { recursive: true });
+    await syncManagedAgentInstructions(codexCwd);
 
     const child = spawn(
       process.execPath,
@@ -638,7 +666,12 @@ async function cmdAgent() {
       {
         detached: true,
         stdio: 'ignore',
-        env: { ...process.env, BF_CHATD_PORT: String(port), BF_CHATD_TOKEN: token },
+        env: {
+          ...process.env,
+          BF_CHATD_PORT: String(port),
+          BF_CHATD_TOKEN: token,
+          BF_CHATD_CODEX_CWD: codexCwd,
+        },
       },
     );
     child.unref();
diff --git a/test/agent/cli-agent.test.js b/test/agent/cli-agent.test.js
index d686545..f69d697 100644
--- a/test/agent/cli-agent.test.js
+++ b/test/agent/cli-agent.test.js
@@ -2,7 +2,7 @@ import test from 'node:test';
 import assert from 'node:assert/strict';
 import { execFile } from 'node:child_process';
 import { promisify } from 'node:util';
-import { mkdtempSync, rmSync } from 'node:fs';
+import { existsSync, mkdtempSync, readFileSync, rmSync } from 'node:fs';
 import { join } from 'node:path';
 import { tmpdir } from 'node:os';
 
@@ -23,6 +23,13 @@ test('browserforce agent start allocates a non-conflicting port', async () => {
     assert.equal(body.started, true);
     assert.ok(Number.isInteger(body.port));
     assert.ok(Number.isInteger(body.pid));
+    const codexCwd = join(home, '.browserforce', 'agent-cwd');
+    const agentsPath = join(codexCwd, 'AGENTS.md');
+    assert.equal(existsSync(codexCwd), true);
+    assert.equal(existsSync(agentsPath), true);
+    const agentsBody = readFileSync(agentsPath, 'utf8');
+    assert.match(agentsBody, /BrowserForce managed AGENTS\.md/);
+    assert.match(agentsBody, /warm, practical, action-first browser assistant/i);
 
     await cli(['agent', 'stop', '--json'], { HOME: home });
   } finally {

From b675348f64133bd90cc9ab66bb3f6f0f02db3eac Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 09:42:08 +0530
Subject: [PATCH 186/192] docs(agent): document managed AGENTS sync for
 BF_CHATD_CODEX_CWD

- document default codex cwd isolation at ~/.browserforce/agent-cwd
- explain managed AGENTS.md sync behavior on agent start
- describe opt-out path by removing the managed header from AGENTS.md
---
 README.md                  | 2 ++
 docs/BROWSERFORCE_AGENT.md | 3 +++
 2 files changed, 5 insertions(+)

diff --git a/README.md b/README.md
index 7245812..7fa0e60 100644
--- a/README.md
+++ b/README.md
@@ -418,6 +418,8 @@ Port/auth bootstrap:
 
 - `agent start` picks a loopback port. If `BF_CHATD_PORT` is set and free, it is used.
 - If that port is unavailable, BrowserForce falls back to the first free port in `19280-19320`.
+- By default, `agent start` runs Codex in `~/.browserforce/agent-cwd` (override with `BF_CHATD_CODEX_CWD`) so project-specific `AGENTS.md` does not leak into side-panel runs.
+- `agent start` syncs a managed BrowserForce `AGENTS.md` into that cwd. If you want full manual control, replace that file and remove the managed header line.
 - The daemon writes `~/.browserforce/chatd-url.json` (`{ port, token }`, mode `0600`).
 - Side-panel JS reads relay URL from extension storage, calls relay `GET /chatd-url` (extension-origin gated), then connects directly to chatd with Bearer auth.
 
diff --git a/docs/BROWSERFORCE_AGENT.md b/docs/BROWSERFORCE_AGENT.md
index b4bb030..54b0862 100644
--- a/docs/BROWSERFORCE_AGENT.md
+++ b/docs/BROWSERFORCE_AGENT.md
@@ -110,6 +110,9 @@ Optional external config:
   - Overrides `chatd-url.json` path.
 - `BF_CHATD_LOCK_PATH`
   - Overrides lock file path used by `browserforce agent start|status|stop`.
+- `BF_CHATD_CODEX_CWD`
+  - Working directory for `codex exec --json` runs. Defaults to `~/.browserforce/agent-cwd` when started via `browserforce agent start`.
+  - `agent start` syncs a managed BrowserForce `AGENTS.md` into this directory (unless a custom unmanaged `AGENTS.md` is already present).
 - `BF_CHATD_CODEX_COMMAND`
   - Codex binary/command used by chatd (default `codex`).
 - `BF_CHATD_MODEL_LIST_TIMEOUT_MS`

From c256ee7ecb1836a5a3f2cbbb33619d74cf49d5a2 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 12:16:58 +0530
Subject: [PATCH 187/192] agent: minimize resume prompt overhead for AGENTS
 instructions

- keep full AGENTS.md injection for fresh runs only
- switch resume turns to a one-line generic system-instruction reminder
- avoid AGENTS.md disk reads on resume path by loading only for non-resume runs
- update chatd API tests to validate new resume reminder text and behavior
---
 agent/src/chatd.js           |  55 ++++++++++++++--
 test/agent/chatd-api.test.js | 123 +++++++++++++++++++++++++++++++++++
 2 files changed, 173 insertions(+), 5 deletions(-)

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 4b6091d..771ba6a 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -942,6 +942,43 @@ function buildRunPrompt({ message, browserContext }) {
   return lines.join('\n');
 }
 
+async function loadAgentsInstructions(codexCwd) {
+  const base = String(codexCwd || '').trim();
+  if (!base) return '';
+  const path = join(base, 'AGENTS.md');
+  try {
+    const raw = await fs.readFile(path, 'utf8');
+    return String(raw || '').trim();
+  } catch (error) {
+    if (error && error.code === 'ENOENT') return '';
+    throw error;
+  }
+}
+
+function buildPromptWithAgents({ message, browserContext, agentsInstructions }) {
+  const prompt = buildRunPrompt({ message, browserContext });
+  const agents = String(agentsInstructions || '').trim();
+  if (!agents) return prompt;
+  return [
+    'System instructions from AGENTS.md (highest priority):',
+    '',
+    agents,
+    '',
+    '---',
+    '',
+    prompt,
+  ].join('\n');
+}
+
+function buildPromptWithAgentsReminder({ message, browserContext }) {
+  const prompt = buildRunPrompt({ message, browserContext });
+  return [
+    'System reminder: follow the previously established system instructions for this thread.',
+    '',
+    prompt,
+  ].join('\n');
+}
+
 async function readJsonBody(req) {
   const chunks = [];
   for await (const chunk of req) chunks.push(chunk);
@@ -1000,8 +1037,9 @@ export async function startChatd(opts = {}) {
   const token = opts.token || process.env.BF_CHATD_TOKEN || randomBytes(32).toString('base64url');
   const chatdUrlPath = opts.chatdUrlPath || process.env.BF_CHATD_URL_PATH || CHATD_URL_PATH;
   const envCodexCwd = String(process.env.BF_CHATD_CODEX_CWD || '').trim();
+  const codexCwd = opts.codexCwd || envCodexCwd || process.cwd();
   const runExecutor = opts.runExecutor || createDefaultRunExecutor({
-    codexCwd: opts.codexCwd || envCodexCwd || process.cwd(),
+    codexCwd,
   });
   const modelFetcher = opts.modelFetcher || (() => fetchCodexModelCatalog({
     command: opts.codexCommand || process.env.BF_CHATD_CODEX_COMMAND || 'codex',
@@ -1315,7 +1353,16 @@ export async function startChatd(opts = {}) {
           return;
         }
         const browserContext = normalizeBrowserContext(body?.browserContext);
-        const promptMessage = buildRunPrompt({ message, browserContext });
+        const resumeSessionId = isValidSessionId(session?.providerState?.codex?.sessionId || '')
+          ? session.providerState.codex.sessionId
+          : null;
+        const promptMessage = resumeSessionId
+          ? buildPromptWithAgentsReminder({ message, browserContext })
+          : buildPromptWithAgents({
+            message,
+            browserContext,
+            agentsInstructions: await loadAgentsInstructions(codexCwd),
+          });
         const runReasoningEffort = resolveEffectiveReasoningEffort(
           session.reasoningEffort,
           configuredReasoningEffort,
@@ -1334,9 +1381,7 @@ export async function startChatd(opts = {}) {
           queue: Promise.resolve(),
           lastError: null,
           resumeRetryAttempted: false,
-          resumeSessionId: isValidSessionId(session?.providerState?.codex?.sessionId || '')
-            ? session.providerState.codex.sessionId
-            : null,
+          resumeSessionId,
           reasoningEffort: runReasoningEffort,
         };
 
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index 5dd2ad7..cecfc8c 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -618,6 +618,129 @@ test('POST /v1/runs includes active tab context in runExecutor prompt', async ()
   }
 });
 
+test('POST /v1/runs injects AGENTS.md content as system instructions', async () => {
+  const codexCwd = mkdtempSync(join(tmpdir(), 'bf-chatd-codex-cwd-'));
+  writeFileSync(join(codexCwd, 'AGENTS.md'), '# Agent Rules\nAlways be explicit.', 'utf8');
+
+  const seenRuns = [];
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    codexCwd,
+    runExecutor: ({ runId, sessionId, message, onExit }) => {
+      seenRuns.push({ runId, sessionId, message });
+      setTimeout(() => onExit({ code: 0 }), 5);
+      return { abort() {} };
+    },
+  });
+
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'agents-instructions' }),
+    }).then((res) => res.json());
+
+    const runRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({
+        sessionId: created.sessionId,
+        message: 'what should we do next?',
+      }),
+    });
+    assert.equal(runRes.status, 202);
+    await new Promise((resolve) => setTimeout(resolve, 20));
+
+    const prompt = seenRuns.at(-1)?.message || '';
+    assert.match(prompt, /System instructions from AGENTS\.md/);
+    assert.match(prompt, /Always be explicit\./);
+    assert.match(prompt, /what should we do next\?/i);
+  } finally {
+    await daemon.stop();
+    rmSync(codexCwd, { recursive: true, force: true });
+  }
+});
+
+test('POST /v1/runs uses one-line AGENTS reminder on resume runs', async () => {
+  const codexCwd = mkdtempSync(join(tmpdir(), 'bf-chatd-codex-cwd-'));
+  writeFileSync(join(codexCwd, 'AGENTS.md'), '# Agent Rules\nAlways be explicit.', 'utf8');
+  const providerSessionId = '019caa6f-8c63-7c81-a542-3dbcf922d065';
+
+  const seenRuns = [];
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    codexCwd,
+    runExecutor: ({ runId, sessionId, message, resumeSessionId, onEvent, onExit }) => {
+      seenRuns.push({ runId, sessionId, message, resumeSessionId: resumeSessionId || null });
+      setTimeout(() => {
+        onEvent({
+          event: 'run.provider_session',
+          runId,
+          sessionId,
+          payload: { provider: 'codex', sessionId: providerSessionId },
+        });
+      }, 5);
+      setTimeout(() => {
+        onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'ok' } });
+      }, 10);
+      setTimeout(() => onExit({ code: 0 }), 15);
+      return { abort() {} };
+    },
+  });
+
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'agents-reminder' }),
+    }).then((res) => res.json());
+
+    const runOneRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'first' }),
+    });
+    assert.equal(runOneRes.status, 202);
+    await new Promise((resolve) => setTimeout(resolve, 60));
+
+    const runTwoRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'second' }),
+    });
+    assert.equal(runTwoRes.status, 202);
+    await new Promise((resolve) => setTimeout(resolve, 60));
+
+    assert.equal(seenRuns.length >= 2, true);
+    assert.equal(seenRuns[0].resumeSessionId, null);
+    assert.match(seenRuns[0].message || '', /System instructions from AGENTS\.md/);
+    assert.match(seenRuns[0].message || '', /Always be explicit\./);
+    assert.equal(seenRuns[1].resumeSessionId, providerSessionId);
+    assert.match(seenRuns[1].message || '', /System reminder: follow the previously established system instructions for this thread\./);
+    assert.doesNotMatch(seenRuns[1].message || '', /Always be explicit\./);
+  } finally {
+    await daemon.stop();
+    rmSync(codexCwd, { recursive: true, force: true });
+  }
+});
+
 test('POST /v1/runs reuses codex provider session id on second turn', async () => {
   const observed = [];
   const providerSessionId = '019caa6f-8c63-7c81-a542-3dbcf922d065';

From 214a0a16182c7ed2bdfd9716f9bab5f3d3bb5305 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 12:46:48 +0530
Subject: [PATCH 188/192] agent: allow codex runs outside git worktrees

- add --skip-git-repo-check to generated codex exec/resume args
- fix agent-cwd launches that fail with trusted-directory git checks
- update codex runner argument tests for model/reasoning/resume/no-model paths
---
 agent/src/codex-runner.js       | 1 +
 test/agent/codex-runner.test.js | 7 ++++---
 2 files changed, 5 insertions(+), 3 deletions(-)

diff --git a/agent/src/codex-runner.js b/agent/src/codex-runner.js
index 2cbf9cb..685bce6 100644
--- a/agent/src/codex-runner.js
+++ b/agent/src/codex-runner.js
@@ -486,6 +486,7 @@ export function buildCodexExecArgs({ prompt, model, reasoningEffort, args, resum
   const resolved = resumeId
     ? ['exec', 'resume', resumeId, '--json']
     : ['exec', '--json'];
+  resolved.push('--skip-git-repo-check');
   if (typeof model === 'string' && model.trim()) {
     resolved.push('--model', model.trim());
   }
diff --git a/test/agent/codex-runner.test.js b/test/agent/codex-runner.test.js
index a78e4ef..0c1483d 100644
--- a/test/agent/codex-runner.test.js
+++ b/test/agent/codex-runner.test.js
@@ -41,12 +41,12 @@ test('maps codex item.completed agent_message to chat.delta (not premature final
 
 test('buildCodexExecArgs includes --model when session model is set', () => {
   const args = buildCodexExecArgs({ prompt: 'hi', model: 'gpt-5' });
-  assert.deepEqual(args, ['exec', '--json', '--model', 'gpt-5', 'hi']);
+  assert.deepEqual(args, ['exec', '--json', '--skip-git-repo-check', '--model', 'gpt-5', 'hi']);
 });
 
 test('buildCodexExecArgs includes reasoning effort override when set', () => {
   const args = buildCodexExecArgs({ prompt: 'hi', reasoningEffort: 'medium' });
-  assert.deepEqual(args, ['exec', '--json', '-c', 'model_reasoning_effort="medium"', 'hi']);
+  assert.deepEqual(args, ['exec', '--json', '--skip-git-repo-check', '-c', 'model_reasoning_effort="medium"', 'hi']);
 });
 
 test('buildCodexExecArgs emits resume invocation when codex session id is provided', () => {
@@ -60,6 +60,7 @@ test('buildCodexExecArgs emits resume invocation when codex session id is provid
     'resume',
     '019caa6f-8c63-7c81-a542-3dbcf922d065',
     '--json',
+    '--skip-git-repo-check',
     '--model',
     'gpt-5',
     'hi',
@@ -68,7 +69,7 @@ test('buildCodexExecArgs emits resume invocation when codex session id is provid
 
 test('buildCodexExecArgs omits --model when model is empty', () => {
   const args = buildCodexExecArgs({ prompt: 'hi', model: '' });
-  assert.deepEqual(args, ['exec', '--json', 'hi']);
+  assert.deepEqual(args, ['exec', '--json', '--skip-git-repo-check', 'hi']);
 });
 
 test('maps transient codex error line to non-fatal tool event', () => {

From 0506239603baa8187a4a6f79a58b839acaba38c2 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 12:46:55 +0530
Subject: [PATCH 189/192] agent-panel: surface run failures in transcript and
 status

- persist run.error as assistant transcript timeline so failures remain visible after terminal runs
- update panel status from SSE lifecycle events (run.started/run.error/chat.final/run.aborted)
- add state test coverage to assert failed runs create visible transcript output
---
 extension/agent-panel-state.js | 19 ++++++++++++++++++-
 extension/agent-panel.js       | 11 +++++++++++
 test/agent/sse-events.test.js  |  5 +++++
 3 files changed, 34 insertions(+), 1 deletion(-)

diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index 4a48b2d..ba86b25 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -810,14 +810,31 @@ export function applyEvent(state = initialState, evt = {}) {
       status: 'failed',
       label: `Failed: ${error}`,
     };
+    const timeline = pushTimelineEntry(run, { type: 'step', ...step });
+    const currentMessages = state.messagesBySession[evt.sessionId] || [];
+    const hasStoredFinal = currentMessages.some(
+      (message) => message.runId === evt.runId && message.role === 'assistant',
+    );
+    const nextMessages = (!hasStoredFinal && (timeline.length > 0 || error))
+      ? [...currentMessages, {
+        role: 'assistant',
+        text: '',
+        runId: evt.runId,
+        timeline,
+      }]
+      : currentMessages;
     return {
       ...state,
+      messagesBySession: {
+        ...state.messagesBySession,
+        [evt.sessionId]: nextMessages,
+      },
       runs: upsertRun(state, evt.runId, {
         sessionId: evt.sessionId,
         done: true,
         error,
         steps: pushStep(run, step),
-        timeline: pushTimelineEntry(run, { type: 'step', ...step }),
+        timeline,
       }),
     };
   }
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 3464ba2..62325fd 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -268,6 +268,17 @@ function dispatch(action) {
 
 function applyIncomingEvent(evt) {
   state.value = applyEvent(state.value, evt);
+  const isActiveSessionEvent = evt?.sessionId && evt.sessionId === state.value.activeSessionId;
+  if (isActiveSessionEvent && evt?.event === 'run.started') {
+    setStatus('ready', 'Ready');
+  }
+  if (isActiveSessionEvent && evt?.event === 'run.error') {
+    const errorText = evt?.payload?.error || 'Run failed';
+    setStatus('error', `Run failed: ${errorText}`);
+  }
+  if (isActiveSessionEvent && (evt?.event === 'chat.final' || evt?.event === 'run.aborted')) {
+    setStatus('ready', 'Ready');
+  }
   if (evt?.event === 'run.started' && evt.sessionId && evt.runId) {
     state.currentRunBySession = assignSessionRunId(state.currentRunBySession, evt.sessionId, evt.runId);
   }
diff --git a/test/agent/sse-events.test.js b/test/agent/sse-events.test.js
index f0e116e..580033f 100644
--- a/test/agent/sse-events.test.js
+++ b/test/agent/sse-events.test.js
@@ -230,6 +230,11 @@ test('run.error appends a final failed step', () => {
   const last = s2.runs.r1.steps.at(-1);
   assert.equal(last.status, 'failed');
   assert.match(last.label, /boom/);
+  const message = s2.messagesBySession.s1?.at(-1);
+  assert.equal(message?.role, 'assistant');
+  assert.equal(message?.runId, 'r1');
+  assert.equal(Array.isArray(message?.timeline), true);
+  assert.equal(message.timeline.some((item) => item.type === 'step' && item.status === 'failed'), true);
 });
 
 test('run.event is converted into a visible in-flight step', () => {

From 709bc35b92624a48c347aede30d0b393d565b6fb Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 15:10:32 +0530
Subject: [PATCH 190/192] feat(agent): add provider runtime with codex+claude
 adapters

---
 agent/src/chatd.js                            | 258 +++++--------
 agent/src/provider-constants.js               |   2 +
 agent/src/providers/claude-provider.js        | 345 ++++++++++++++++++
 agent/src/providers/codex-provider.js         | 176 +++++++++
 agent/src/providers/index.js                  |  77 ++++
 agent/src/session-store.js                    |  99 +++--
 package.json                                  |   2 +-
 test/agent/chatd-api.test.js                  | 246 ++++++++++++-
 test/agent/claude-protocol-contract.test.js   |  50 +++
 test/agent/claude-provider.test.js            |  70 ++++
 .../fixtures/claude-jsonl-resume-run.sample   |   3 +
 .../fixtures/claude-jsonl-start-run.sample    |   4 +
 test/agent/session-store.test.js              |  45 +++
 13 files changed, 1185 insertions(+), 192 deletions(-)
 create mode 100644 agent/src/provider-constants.js
 create mode 100644 agent/src/providers/claude-provider.js
 create mode 100644 agent/src/providers/codex-provider.js
 create mode 100644 agent/src/providers/index.js
 create mode 100644 test/agent/claude-protocol-contract.test.js
 create mode 100644 test/agent/claude-provider.test.js
 create mode 100644 test/agent/fixtures/claude-jsonl-resume-run.sample
 create mode 100644 test/agent/fixtures/claude-jsonl-start-run.sample

diff --git a/agent/src/chatd.js b/agent/src/chatd.js
index 771ba6a..4b3c3a9 100644
--- a/agent/src/chatd.js
+++ b/agent/src/chatd.js
@@ -1,5 +1,4 @@
 import http from 'node:http';
-import { spawn } from 'node:child_process';
 import { randomBytes } from 'node:crypto';
 import { promises as fs } from 'node:fs';
 import { homedir, tmpdir } from 'node:os';
@@ -8,12 +7,19 @@ import { fileURLToPath } from 'node:url';
 
 import { pickChatdPort } from './port-resolver.js';
 import { isAllowedOrigin, verifyBearer } from './auth.js';
-import { startCodexRun } from './codex-runner.js';
+import {
+  createProviderRegistry,
+  DEFAULT_PROVIDER,
+  isAllowedProvider,
+  normalizeProvider,
+  resolveSessionProvider,
+} from './providers/index.js';
 import {
   appendMessage,
   createSession,
   getSession,
   isValidModelId,
+  isValidProviderSessionId,
   isValidReasoningEffort,
   isValidSessionId,
   listSessions,
@@ -24,7 +30,6 @@ import {
 const BF_DIR = join(homedir(), '.browserforce');
 const CHATD_URL_PATH = join(BF_DIR, 'chatd-url.json');
 const CODEX_CONFIG_PATH = join(homedir(), '.codex', 'config.toml');
-const MODEL_LIST_TIMEOUT_MS = 5000;
 const DEFAULT_REASONING_EFFORT = 'medium';
 const LOCAL_FILE_MAX_BYTES = 15 * 1024 * 1024;
 const LOCAL_IMAGE_CONTENT_TYPES = {
@@ -95,141 +100,20 @@ function dedupeModelRows(rows) {
   return out;
 }
 
-function safeParseJsonLine(line) {
-  if (typeof line !== 'string') return null;
-  try {
-    return JSON.parse(line);
-  } catch {
-    return null;
-  }
-}
-
-function normalizeModelCatalogRows(models) {
-  return (Array.isArray(models) ? models : [])
-    .filter((row) => row && typeof row === 'object' && !row.hidden)
-    .map((row) => {
-      const value = String(row.model || row.id || '').trim();
-      const label = String(row.displayName || row.model || row.id || '').trim();
-      if (!value || !isValidModelId(value)) return null;
-      return { value, label: label || value };
-    })
-    .filter(Boolean);
-}
-
-async function fetchCodexModelCatalog({
-  command = process.env.BF_CHATD_CODEX_COMMAND || 'codex',
-  timeoutMs = MODEL_LIST_TIMEOUT_MS,
-} = {}) {
-  return new Promise((resolve, reject) => {
-    const child = spawn(command, ['app-server', '--listen', 'stdio://'], {
-      stdio: ['pipe', 'pipe', 'pipe'],
-      env: process.env,
-    });
-
-    let settled = false;
-    let stderrText = '';
-    let stdoutBuffer = '';
-
-    const finish = (error, models = []) => {
-      if (settled) return;
-      settled = true;
-      clearTimeout(timer);
-      try { child.kill('SIGTERM'); } catch {}
-      if (error) reject(error);
-      else resolve(models);
-    };
-
-    const timer = setTimeout(() => {
-      finish(new Error('Timed out while loading Codex models'));
-    }, timeoutMs);
-
-    child.stderr.setEncoding('utf8');
-    child.stderr.on('data', (chunk) => {
-      stderrText += String(chunk || '');
-    });
-
-    child.stdout.setEncoding('utf8');
-    child.stdout.on('data', (chunk) => {
-      stdoutBuffer += String(chunk || '');
-      let idx = stdoutBuffer.indexOf('\n');
-      while (idx !== -1) {
-        const line = stdoutBuffer.slice(0, idx).trim();
-        stdoutBuffer = stdoutBuffer.slice(idx + 1);
-        idx = stdoutBuffer.indexOf('\n');
-        if (!line) continue;
-
-        const msg = safeParseJsonLine(line);
-        if (!msg || typeof msg !== 'object') continue;
-
-        if (msg.id === 1 && msg.error) {
-          finish(new Error(msg.error?.message || 'Codex initialize failed'));
-          return;
-        }
-        if (msg.id === 1 && msg.result) {
-          try {
-            child.stdin.write(`${JSON.stringify({ jsonrpc: '2.0', method: 'initialized' })}\n`);
-            child.stdin.write(`${JSON.stringify({
-              jsonrpc: '2.0',
-              id: 2,
-              method: 'model/list',
-              params: { includeHidden: false, limit: 100 },
-            })}\n`);
-          } catch {
-            finish(new Error('Failed to request Codex model list'));
-          }
-          continue;
-        }
-
-        if (msg.id === 2 && msg.error) {
-          finish(new Error(msg.error?.message || 'Codex model/list failed'));
-          return;
-        }
-
-        if (msg.id === 2 && msg.result) {
-          finish(null, msg.result?.data || []);
-        }
-      }
-    });
-
-    child.on('error', (error) => {
-      finish(error);
-    });
-
-    child.on('exit', (code) => {
-      if (settled) return;
-      finish(new Error(`Codex app-server exited before model/list (${code ?? 'unknown'}) ${stderrText}`.trim()));
-    });
-
-    try {
-      child.stdin.write(`${JSON.stringify({
-        jsonrpc: '2.0',
-        id: 1,
-        method: 'initialize',
-        params: {
-          clientInfo: { name: 'browserforce-chatd', version: '1.0.0' },
-          capabilities: { experimentalApi: false },
-        },
-      })}\n`);
-    } catch {
-      finish(new Error('Failed to initialize Codex app-server'));
-    }
-  });
-}
-
-async function listModelPresets({ storageRoot, modelFetcher } = {}) {
+async function listModelPresets({ storageRoot, providerId, provider } = {}) {
   let liveRows = [];
-  if (typeof modelFetcher === 'function') {
+  if (provider && typeof provider.listModels === 'function') {
     try {
-      const liveModels = await modelFetcher();
-      liveRows = normalizeModelCatalogRows(liveModels);
+      liveRows = await provider.listModels();
     } catch {
       liveRows = [];
     }
   }
 
-  const configuredModel = await resolveConfiguredModel();
+  const configuredModel = providerId === 'codex' ? await resolveConfiguredModel() : null;
   const sessions = await listSessions({ limit: 200, storageRoot });
   const sessionRows = sessions
+    .filter((session) => resolveSessionProvider(session) === providerId)
     .map((session) => String(session?.model || '').trim())
     .filter(Boolean)
     .map((value) => ({ value, label: value }));
@@ -1013,21 +897,6 @@ async function clearChatdUrlFile({ writeChatdUrl = true, urlPath = CHATD_URL_PAT
   }
 }
 
-function createDefaultRunExecutor({ codexCwd } = {}) {
-  return ({ runId, sessionId, message, model, reasoningEffort, resumeSessionId, onEvent, onExit, onError }) => startCodexRun({
-    runId,
-    sessionId,
-    prompt: message,
-    model,
-    reasoningEffort,
-    resumeSessionId,
-    cwd: codexCwd,
-    onEvent,
-    onExit,
-    onError,
-  });
-}
-
 export async function startChatd(opts = {}) {
   const writeChatdUrl = opts.writeChatdUrl !== false;
   const ephemeralStorageRoot = (!opts.storageRoot && !writeChatdUrl)
@@ -1038,13 +907,18 @@ export async function startChatd(opts = {}) {
   const chatdUrlPath = opts.chatdUrlPath || process.env.BF_CHATD_URL_PATH || CHATD_URL_PATH;
   const envCodexCwd = String(process.env.BF_CHATD_CODEX_CWD || '').trim();
   const codexCwd = opts.codexCwd || envCodexCwd || process.cwd();
-  const runExecutor = opts.runExecutor || createDefaultRunExecutor({
+  const providerRegistry = opts.providerRegistry || createProviderRegistry({
     codexCwd,
+    runExecutor: opts.runExecutor,
+    modelFetcher: opts.modelFetcher,
+    codexCommand: opts.codexCommand || process.env.BF_CHATD_CODEX_COMMAND || 'codex',
+    claudeCommand: opts.claudeCommand || process.env.BF_CHATD_CLAUDE_COMMAND || 'claude',
+    codexRunExecutor: opts.codexRunExecutor,
+    claudeRunExecutor: opts.claudeRunExecutor,
+    codexModelFetcher: opts.codexModelFetcher,
+    claudeModelFetcher: opts.claudeModelFetcher,
+    providerOverrides: opts.providerOverrides,
   });
-  const modelFetcher = opts.modelFetcher || (() => fetchCodexModelCatalog({
-    command: opts.codexCommand || process.env.BF_CHATD_CODEX_COMMAND || 'codex',
-    timeoutMs: Number(process.env.BF_CHATD_MODEL_LIST_TIMEOUT_MS || MODEL_LIST_TIMEOUT_MS),
-  }));
   const configuredReasoningEffort = resolveEffectiveReasoningEffort(
     opts.defaultReasoningEffort,
     await resolveConfiguredReasoningEffort(),
@@ -1198,9 +1072,36 @@ export async function startChatd(opts = {}) {
         return;
       }
 
+      if (url.pathname === '/v1/providers' && req.method === 'GET') {
+        json(res, 200, {
+          defaultProvider: DEFAULT_PROVIDER,
+          providers: providerRegistry.listProviders(),
+        });
+        return;
+      }
+
       if (url.pathname === '/v1/models' && req.method === 'GET') {
-        const models = await listModelPresets({ storageRoot, modelFetcher });
-        json(res, 200, { models, defaultReasoningEffort: configuredReasoningEffort });
+        const providerParam = url.searchParams.get('provider');
+        const providerId = providerParam == null || providerParam === ''
+          ? DEFAULT_PROVIDER
+          : normalizeProvider(providerParam, null);
+        if (!providerId || !isAllowedProvider(providerId)) {
+          json(res, 400, { error: 'provider is invalid' });
+          return;
+        }
+        const provider = providerRegistry.getProvider(providerId);
+        if (!provider) {
+          json(res, 400, { error: `provider is unavailable: ${providerId}` });
+          return;
+        }
+        const models = await listModelPresets({ storageRoot, providerId, provider });
+        json(res, 200, {
+          provider: providerId,
+          defaultProvider: DEFAULT_PROVIDER,
+          providers: providerRegistry.listProviders(),
+          models,
+          defaultReasoningEffort: configuredReasoningEffort,
+        });
         return;
       }
 
@@ -1216,6 +1117,7 @@ export async function startChatd(opts = {}) {
           const session = await createSession({
             title: body.title || 'New chat',
             model: body.model ?? null,
+            provider: body.provider ?? null,
             reasoningEffort: body.reasoningEffort ?? null,
             storageRoot,
           });
@@ -1263,6 +1165,7 @@ export async function startChatd(opts = {}) {
             patch: {
               ...(Object.prototype.hasOwnProperty.call(body, 'title') ? { title: body.title } : {}),
               ...(Object.prototype.hasOwnProperty.call(body, 'model') ? { model: body.model } : {}),
+              ...(Object.prototype.hasOwnProperty.call(body, 'provider') ? { provider: body.provider } : {}),
               ...(Object.prototype.hasOwnProperty.call(body, 'reasoningEffort') ? { reasoningEffort: body.reasoningEffort } : {}),
             },
             storageRoot,
@@ -1352,9 +1255,20 @@ export async function startChatd(opts = {}) {
           json(res, 404, { error: 'Session not found' });
           return;
         }
+        const providerId = resolveSessionProvider(session);
+        const provider = providerRegistry.getProvider(providerId);
+        if (!provider) {
+          json(res, 400, { error: `provider is unavailable: ${providerId}` });
+          return;
+        }
         const browserContext = normalizeBrowserContext(body?.browserContext);
-        const resumeSessionId = isValidSessionId(session?.providerState?.codex?.sessionId || '')
-          ? session.providerState.codex.sessionId
+        const providerSessionId = String(session?.providerState?.[providerId]?.sessionId || '').trim();
+        const resumeSessionId = (
+          providerId === 'codex'
+            ? isValidSessionId(providerSessionId)
+            : isValidProviderSessionId(providerSessionId)
+        )
+          ? providerSessionId
           : null;
         const promptMessage = resumeSessionId
           ? buildPromptWithAgentsReminder({ message, browserContext })
@@ -1372,6 +1286,7 @@ export async function startChatd(opts = {}) {
         const run = {
           runId,
           sessionId,
+          provider: providerId,
           status: 'running',
           abort: null,
           assistantBuffer: '',
@@ -1393,7 +1308,7 @@ export async function startChatd(opts = {}) {
           await appendMessage({ sessionId, role: 'user', text: message, storageRoot });
           runs.set(runId, run);
 
-          const startAttempt = (resumeSessionId) => runExecutor({
+          const startAttempt = (resumeSessionId) => provider.startRun({
             runId,
             sessionId,
             message: promptMessage,
@@ -1431,18 +1346,33 @@ export async function startChatd(opts = {}) {
                 }
 
                 if (evt.event === 'run.provider_session') {
-                  const provider = String(evt.payload?.provider || '').trim().toLowerCase();
+                  const eventProvider = normalizeProvider(evt.payload?.provider, active.provider);
                   const providerSessionId = String(evt.payload?.sessionId || '').trim();
-                  if (provider === 'codex' && isValidSessionId(providerSessionId)) {
+                  const validProviderSessionId = eventProvider
+                    ? (
+                      eventProvider === 'codex'
+                        ? isValidSessionId(providerSessionId)
+                        : isValidProviderSessionId(providerSessionId)
+                    )
+                    : false;
+                  if (eventProvider && validProviderSessionId) {
                     await updateSession({
                       sessionId,
                       patch: {
-                        providerState: { codex: { sessionId: providerSessionId } },
+                        providerState: { [eventProvider]: { sessionId: providerSessionId } },
                       },
                       storageRoot,
                     });
                   }
-                  broadcast(buildEvent({ event: 'run.provider_session', runId, sessionId, payload: evt.payload }));
+                  broadcast(buildEvent({
+                    event: 'run.provider_session',
+                    runId,
+                    sessionId,
+                    payload: {
+                      ...(evt.payload || {}),
+                      provider: eventProvider || active.provider,
+                    },
+                  }));
                   return;
                 }
 
@@ -1452,7 +1382,7 @@ export async function startChatd(opts = {}) {
                     await updateSession({
                       sessionId,
                       patch: {
-                        providerState: { codex: { latestUsage: usage } },
+                        providerState: { [active.provider]: { latestUsage: usage } },
                       },
                       storageRoot,
                     });
@@ -1464,7 +1394,7 @@ export async function startChatd(opts = {}) {
                 if (evt.event === 'run.error') {
                   trackRunStep(active, evt);
                   active.lastError = evt.payload?.error || 'Run failed';
-                  if (!active.resumeSessionId || active.resumeRetryAttempted) {
+                  if (active.provider !== 'codex' || !active.resumeSessionId || active.resumeRetryAttempted) {
                     failRun(active, active.lastError);
                   }
                   return;
@@ -1486,7 +1416,8 @@ export async function startChatd(opts = {}) {
                 if (signal === 'SIGTERM' || active.status === 'aborted') return;
 
                 if (
-                  active.resumeSessionId
+                  active.provider === 'codex'
+                  && active.resumeSessionId
                   && !active.resumeRetryAttempted
                   && isResumeSessionInvalidFailure({ code, error: active.lastError, stderr })
                 ) {
@@ -1512,13 +1443,13 @@ export async function startChatd(opts = {}) {
                   return;
                 }
 
-                failRun(active, active.lastError || `codex exited with code ${code ?? 'unknown'}`);
+                failRun(active, active.lastError || `${active.provider} exited with code ${code ?? 'unknown'}`);
               });
             },
             onError: (error) => {
               enqueue(() => {
                 const active = runs.get(runId);
-                failRun(active, error?.message || 'Failed to start codex');
+                failRun(active, error?.message || `Failed to start ${provider.id}`);
               });
             },
           });
@@ -1532,6 +1463,7 @@ export async function startChatd(opts = {}) {
             payload: {
               message,
               model: session.model || null,
+              provider: provider.id,
               reasoningEffort: runReasoningEffort,
               browserContext,
             },
diff --git a/agent/src/provider-constants.js b/agent/src/provider-constants.js
new file mode 100644
index 0000000..d7ae4e8
--- /dev/null
+++ b/agent/src/provider-constants.js
@@ -0,0 +1,2 @@
+export const DEFAULT_PROVIDER = 'codex';
+export const PROVIDER_ALLOWLIST = Object.freeze(['codex', 'claude']);
diff --git a/agent/src/providers/claude-provider.js b/agent/src/providers/claude-provider.js
new file mode 100644
index 0000000..e7cbcff
--- /dev/null
+++ b/agent/src/providers/claude-provider.js
@@ -0,0 +1,345 @@
+import { spawn } from 'node:child_process';
+import readline from 'node:readline';
+
+import { isValidModelId } from '../session-store.js';
+
+function envelope({ event, runId, sessionId, payload }) {
+  return {
+    event,
+    runId,
+    sessionId,
+    payload: payload || {},
+    timestamp: new Date().toISOString(),
+  };
+}
+
+function safeParse(line) {
+  if (typeof line !== 'string') return null;
+  try {
+    return JSON.parse(line);
+  } catch {
+    return null;
+  }
+}
+
+function firstString(values) {
+  for (const value of values) {
+    if (typeof value === 'string' && value.trim()) return value.trim();
+  }
+  return '';
+}
+
+function toCount(value) {
+  const parsed = Number(value);
+  if (!Number.isFinite(parsed) || parsed < 0) return null;
+  return Math.round(parsed);
+}
+
+function toPositiveCount(value) {
+  const parsed = Number(value);
+  if (!Number.isFinite(parsed) || parsed <= 0) return null;
+  return Math.round(parsed);
+}
+
+function messageTextFromContent(content) {
+  if (typeof content === 'string') return content;
+  if (!Array.isArray(content)) return '';
+  const parts = [];
+  for (const item of content) {
+    if (!item || typeof item !== 'object') continue;
+    const text = firstString([item.text, item.message, item.value]);
+    if (text) parts.push(text);
+  }
+  return parts.join('');
+}
+
+function toUsagePayload(source = {}) {
+  const inputTokens = toCount(source.input_tokens ?? source.inputTokens);
+  const cachedInputTokens = toCount(
+    source.cache_read_input_tokens
+    ?? source.cached_input_tokens
+    ?? source.cachedInputTokens,
+  );
+  const outputTokens = toCount(source.output_tokens ?? source.outputTokens);
+  const reasoningOutputTokens = toCount(source.reasoning_output_tokens ?? source.reasoningOutputTokens);
+  const explicitTotalTokens = toCount(source.total_tokens ?? source.totalTokens);
+  const modelContextWindow = toPositiveCount(source.model_context_window ?? source.modelContextWindow);
+
+  const totalTokens = explicitTotalTokens != null
+    ? explicitTotalTokens
+    : ((inputTokens != null || outputTokens != null) ? (inputTokens || 0) + (outputTokens || 0) : null);
+
+  const payload = {
+    modelContextWindow,
+    totalTokens,
+    inputTokens,
+    cachedInputTokens,
+    outputTokens,
+    reasoningOutputTokens,
+  };
+  for (const [key, value] of Object.entries(payload)) {
+    if (value == null) delete payload[key];
+  }
+  return Object.keys(payload).length > 0 ? payload : null;
+}
+
+function usageFromResultPayload(payload = {}) {
+  if (payload.usage && typeof payload.usage === 'object') return toUsagePayload(payload.usage);
+  return toUsagePayload(payload);
+}
+
+function normalizeAssistantDelta(parsed = {}) {
+  const message = parsed.message && typeof parsed.message === 'object' ? parsed.message : {};
+  const text = firstString([
+    parsed.delta,
+    parsed.text,
+    message.text,
+    message.message,
+    messageTextFromContent(message.content),
+  ]);
+  return text;
+}
+
+export function normalizeClaudeLine({ runId, sessionId, line } = {}) {
+  const parsed = safeParse(line);
+  if (!parsed || typeof parsed !== 'object') {
+    const text = String(line || '').trim();
+    return text ? [envelope({ event: 'chat.delta', runId, sessionId, payload: { delta: text } })] : [];
+  }
+
+  const events = [];
+  const type = String(parsed.type || '').trim().toLowerCase();
+
+  if (type === 'system') {
+    const providerSessionId = firstString([parsed.session_id, parsed.sessionId]);
+    if (providerSessionId) {
+      events.push(envelope({
+        event: 'run.provider_session',
+        runId,
+        sessionId,
+        payload: { provider: 'claude', sessionId: providerSessionId },
+      }));
+    }
+    return events;
+  }
+
+  if (type === 'assistant' || type === 'message') {
+    const text = normalizeAssistantDelta(parsed);
+    if (text) {
+      events.push(envelope({ event: 'chat.delta', runId, sessionId, payload: { delta: text } }));
+    }
+    return events;
+  }
+
+  if (type === 'result') {
+    const providerSessionId = firstString([parsed.session_id, parsed.sessionId]);
+    if (providerSessionId) {
+      events.push(envelope({
+        event: 'run.provider_session',
+        runId,
+        sessionId,
+        payload: { provider: 'claude', sessionId: providerSessionId },
+      }));
+    }
+
+    const isError = parsed.is_error === true
+      || String(parsed.subtype || '').toLowerCase() === 'error';
+    if (isError) {
+      events.push(envelope({
+        event: 'run.error',
+        runId,
+        sessionId,
+        payload: {
+          error: firstString([parsed.error, parsed.message, parsed.result]) || 'Claude run failed',
+        },
+      }));
+      return events;
+    }
+
+    const usage = usageFromResultPayload(parsed);
+    if (usage) {
+      events.push(envelope({ event: 'run.usage', runId, sessionId, payload: usage }));
+    }
+
+    const text = firstString([
+      parsed.result,
+      parsed.text,
+      parsed.output_text,
+      messageTextFromContent(parsed.message?.content),
+    ]);
+    if (text) {
+      events.push(envelope({ event: 'chat.final', runId, sessionId, payload: { text } }));
+    }
+    return events;
+  }
+
+  if (type === 'error') {
+    events.push(envelope({
+      event: 'run.error',
+      runId,
+      sessionId,
+      payload: {
+        error: firstString([parsed.error, parsed.message]) || 'Claude run failed',
+      },
+    }));
+    return events;
+  }
+
+  events.push(envelope({ event: 'run.event', runId, sessionId, payload: parsed }));
+  return events;
+}
+
+export function buildClaudeExecArgs({ prompt, model, resumeSessionId, args } = {}) {
+  if (Array.isArray(args) && args.length > 0) return args;
+  const resolved = ['-p', '--output-format', 'stream-json'];
+  const resumeId = typeof resumeSessionId === 'string' ? resumeSessionId.trim() : '';
+  if (resumeId) {
+    resolved.push('--resume', resumeId);
+  }
+  if (typeof model === 'string' && model.trim()) {
+    resolved.push('--model', model.trim());
+  }
+  resolved.push(prompt || '');
+  return resolved;
+}
+
+function missingCommandError(command) {
+  return [
+    `Claude command not found (${command}).`,
+    'Set BF_CHATD_CLAUDE_COMMAND to the Claude CLI binary path and ensure it is installed on PATH.',
+  ].join(' ');
+}
+
+export function startClaudeRun({
+  runId,
+  sessionId,
+  prompt,
+  cwd,
+  onEvent,
+  onExit,
+  onError,
+  command,
+  args,
+  model,
+  resumeSessionId,
+  spawnImpl = spawn,
+} = {}) {
+  const cmd = command || process.env.BF_CHATD_CLAUDE_COMMAND || 'claude';
+  const argv = buildClaudeExecArgs({ prompt, model, resumeSessionId, args });
+
+  const child = spawnImpl(cmd, argv, {
+    cwd,
+    env: process.env,
+    stdio: ['ignore', 'pipe', 'pipe'],
+  });
+
+  const stderrChunks = [];
+  let closed = false;
+
+  const stdoutLines = readline.createInterface({ input: child.stdout });
+  stdoutLines.on('line', (line) => {
+    try {
+      const events = normalizeClaudeLine({ runId, sessionId, line });
+      for (const evt of events) onEvent?.(evt);
+    } catch (error) {
+      onError?.(error);
+    }
+  });
+
+  const stderrLines = readline.createInterface({ input: child.stderr });
+  stderrLines.on('line', (line) => {
+    if (!line) return;
+    stderrChunks.push(String(line));
+    if (stderrChunks.length > 200) stderrChunks.shift();
+  });
+
+  child.on('error', (error) => {
+    if (error?.code === 'ENOENT') {
+      onEvent?.(envelope({
+        event: 'run.error',
+        runId,
+        sessionId,
+        payload: { error: missingCommandError(cmd) },
+      }));
+      if (!closed) {
+        closed = true;
+        onExit?.({ code: 127, signal: null, stderr: stderrChunks.join('\n') });
+      }
+      return;
+    }
+    onError?.(error);
+  });
+
+  child.on('close', (code, signal) => {
+    if (closed) return;
+    closed = true;
+    onExit?.({ code, signal, stderr: stderrChunks.join('\n') });
+  });
+
+  return {
+    pid: child.pid,
+    abort() {
+      try {
+        child.kill('SIGTERM');
+      } catch {
+        // ignore kill races
+      }
+    },
+  };
+}
+
+function normalizeClaudeModelRows(rows) {
+  return (Array.isArray(rows) ? rows : [])
+    .filter((row) => row && typeof row === 'object' && !row.hidden)
+    .map((row) => {
+      const value = String(row.id || row.model || '').trim();
+      const label = String(row.displayName || row.name || row.id || row.model || '').trim();
+      if (!value || !isValidModelId(value)) return null;
+      return { value, label: label || value };
+    })
+    .filter(Boolean);
+}
+
+export function createClaudeProvider({
+  claudeCwd,
+  runExecutor,
+  modelFetcher,
+  claudeCommand = process.env.BF_CHATD_CLAUDE_COMMAND || 'claude',
+} = {}) {
+  return {
+    id: 'claude',
+    label: 'Claude',
+    async listModels() {
+      if (typeof modelFetcher !== 'function') return [];
+      const rows = await modelFetcher();
+      return normalizeClaudeModelRows(rows);
+    },
+    startRun({ runId, sessionId, message, model, resumeSessionId, onEvent, onExit, onError }) {
+      if (typeof runExecutor === 'function') {
+        return runExecutor({
+          provider: 'claude',
+          runId,
+          sessionId,
+          message,
+          model,
+          resumeSessionId,
+          onEvent,
+          onExit,
+          onError,
+        });
+      }
+      return startClaudeRun({
+        runId,
+        sessionId,
+        prompt: message,
+        cwd: claudeCwd,
+        model,
+        resumeSessionId,
+        command: claudeCommand,
+        onEvent,
+        onExit,
+        onError,
+      });
+    },
+  };
+}
diff --git a/agent/src/providers/codex-provider.js b/agent/src/providers/codex-provider.js
new file mode 100644
index 0000000..6da77c7
--- /dev/null
+++ b/agent/src/providers/codex-provider.js
@@ -0,0 +1,176 @@
+import { spawn } from 'node:child_process';
+
+import { startCodexRun } from '../codex-runner.js';
+import { isValidModelId } from '../session-store.js';
+
+function safeParseJsonLine(line) {
+  if (typeof line !== 'string') return null;
+  try {
+    return JSON.parse(line);
+  } catch {
+    return null;
+  }
+}
+
+export function normalizeCodexModelRows(models) {
+  return (Array.isArray(models) ? models : [])
+    .filter((row) => row && typeof row === 'object' && !row.hidden)
+    .map((row) => {
+      const value = String(row.model || row.id || '').trim();
+      const label = String(row.displayName || row.model || row.id || '').trim();
+      if (!value || !isValidModelId(value)) return null;
+      return { value, label: label || value };
+    })
+    .filter(Boolean);
+}
+
+export async function fetchCodexModelCatalog({
+  command = process.env.BF_CHATD_CODEX_COMMAND || 'codex',
+  timeoutMs = 5000,
+} = {}) {
+  return new Promise((resolve, reject) => {
+    const child = spawn(command, ['app-server', '--listen', 'stdio://'], {
+      stdio: ['pipe', 'pipe', 'pipe'],
+      env: process.env,
+    });
+
+    let settled = false;
+    let stderrText = '';
+    let stdoutBuffer = '';
+
+    const finish = (error, models = []) => {
+      if (settled) return;
+      settled = true;
+      clearTimeout(timer);
+      try { child.kill('SIGTERM'); } catch {}
+      if (error) reject(error);
+      else resolve(models);
+    };
+
+    const timer = setTimeout(() => {
+      finish(new Error('Timed out while loading Codex models'));
+    }, timeoutMs);
+
+    child.stderr.setEncoding('utf8');
+    child.stderr.on('data', (chunk) => {
+      stderrText += String(chunk || '');
+    });
+
+    child.stdout.setEncoding('utf8');
+    child.stdout.on('data', (chunk) => {
+      stdoutBuffer += String(chunk || '');
+      let idx = stdoutBuffer.indexOf('\n');
+      while (idx !== -1) {
+        const line = stdoutBuffer.slice(0, idx).trim();
+        stdoutBuffer = stdoutBuffer.slice(idx + 1);
+        idx = stdoutBuffer.indexOf('\n');
+        if (!line) continue;
+
+        const msg = safeParseJsonLine(line);
+        if (!msg || typeof msg !== 'object') continue;
+
+        if (msg.id === 1 && msg.error) {
+          finish(new Error(msg.error?.message || 'Codex initialize failed'));
+          return;
+        }
+        if (msg.id === 1 && msg.result) {
+          try {
+            child.stdin.write(`${JSON.stringify({ jsonrpc: '2.0', method: 'initialized' })}\n`);
+            child.stdin.write(`${JSON.stringify({
+              jsonrpc: '2.0',
+              id: 2,
+              method: 'model/list',
+              params: { includeHidden: false, limit: 100 },
+            })}\n`);
+          } catch {
+            finish(new Error('Failed to request Codex model list'));
+          }
+          continue;
+        }
+
+        if (msg.id === 2 && msg.error) {
+          finish(new Error(msg.error?.message || 'Codex model/list failed'));
+          return;
+        }
+
+        if (msg.id === 2 && msg.result) {
+          finish(null, msg.result?.data || []);
+        }
+      }
+    });
+
+    child.on('error', (error) => {
+      finish(error);
+    });
+
+    child.on('exit', (code) => {
+      if (settled) return;
+      finish(new Error(`Codex app-server exited before model/list (${code ?? 'unknown'}) ${stderrText}`.trim()));
+    });
+
+    try {
+      child.stdin.write(`${JSON.stringify({
+        jsonrpc: '2.0',
+        id: 1,
+        method: 'initialize',
+        params: {
+          clientInfo: { name: 'browserforce-chatd', version: '1.0.0' },
+          capabilities: { experimentalApi: false },
+        },
+      })}\n`);
+    } catch {
+      finish(new Error('Failed to initialize Codex app-server'));
+    }
+  });
+}
+
+export function createCodexProvider({
+  codexCwd,
+  runExecutor,
+  modelFetcher,
+  codexCommand = process.env.BF_CHATD_CODEX_COMMAND || 'codex',
+  modelListTimeoutMs = Number(process.env.BF_CHATD_MODEL_LIST_TIMEOUT_MS || 5000),
+} = {}) {
+  return {
+    id: 'codex',
+    label: 'Codex',
+    async listModels() {
+      if (typeof modelFetcher === 'function') {
+        const rows = await modelFetcher();
+        return normalizeCodexModelRows(rows);
+      }
+      const rows = await fetchCodexModelCatalog({ command: codexCommand, timeoutMs: modelListTimeoutMs });
+      return normalizeCodexModelRows(rows);
+    },
+    startRun({ runId, sessionId, message, model, reasoningEffort, resumeSessionId, onEvent, onExit, onError }) {
+      if (typeof runExecutor === 'function') {
+        return runExecutor({
+          provider: 'codex',
+          runId,
+          sessionId,
+          message,
+          model,
+          reasoningEffort,
+          resumeSessionId,
+          onEvent,
+          onExit,
+          onError,
+        });
+      }
+
+      return startCodexRun({
+        runId,
+        sessionId,
+        prompt: message,
+        model,
+        reasoningEffort,
+        resumeSessionId,
+        cwd: codexCwd,
+        command: codexCommand,
+        onEvent,
+        onExit,
+        onError,
+      });
+    },
+  };
+}
diff --git a/agent/src/providers/index.js b/agent/src/providers/index.js
new file mode 100644
index 0000000..0edd117
--- /dev/null
+++ b/agent/src/providers/index.js
@@ -0,0 +1,77 @@
+import { createCodexProvider } from './codex-provider.js';
+import { createClaudeProvider } from './claude-provider.js';
+import { DEFAULT_PROVIDER, PROVIDER_ALLOWLIST } from '../provider-constants.js';
+
+const PROVIDER_SET = new Set(PROVIDER_ALLOWLIST);
+
+export { DEFAULT_PROVIDER, PROVIDER_ALLOWLIST };
+
+function normalizeText(value) {
+  if (typeof value !== 'string') return '';
+  return value.trim().toLowerCase();
+}
+
+export function isAllowedProvider(provider) {
+  const normalized = normalizeText(provider);
+  return !!normalized && PROVIDER_SET.has(normalized);
+}
+
+export function normalizeProvider(provider, fallback = null) {
+  const normalized = normalizeText(provider);
+  if (!normalized) return fallback;
+  return isAllowedProvider(normalized) ? normalized : fallback;
+}
+
+export function resolveSessionProvider(session) {
+  return normalizeProvider(session?.provider, DEFAULT_PROVIDER);
+}
+
+export function createProviderRegistry(opts = {}) {
+  const providers = new Map();
+
+  const codex = createCodexProvider({
+    codexCwd: opts.codexCwd,
+    runExecutor: opts.codexRunExecutor || opts.runExecutor,
+    modelFetcher: opts.codexModelFetcher || opts.modelFetcher,
+    codexCommand: opts.codexCommand,
+    modelListTimeoutMs: opts.modelListTimeoutMs,
+  });
+  providers.set(codex.id, codex);
+
+  const claude = createClaudeProvider({
+    claudeCwd: opts.claudeCwd || opts.codexCwd,
+    runExecutor: opts.claudeRunExecutor,
+    modelFetcher: opts.claudeModelFetcher,
+    claudeCommand: opts.claudeCommand,
+  });
+  providers.set(claude.id, claude);
+
+  if (opts.providerOverrides && typeof opts.providerOverrides === 'object') {
+    for (const [id, override] of Object.entries(opts.providerOverrides)) {
+      const normalizedId = normalizeProvider(id);
+      if (!normalizedId || !override || typeof override !== 'object') continue;
+      providers.set(normalizedId, {
+        id: normalizedId,
+        label: String(override.label || normalizedId).trim() || normalizedId,
+        ...override,
+      });
+    }
+  }
+
+  return {
+    getProvider(id) {
+      const normalized = normalizeProvider(id);
+      if (!normalized) return null;
+      return providers.get(normalized) || null;
+    },
+    listProviders() {
+      return PROVIDER_ALLOWLIST
+        .map((id) => providers.get(id))
+        .filter(Boolean)
+        .map((provider) => ({
+          id: provider.id,
+          label: provider.label || provider.id,
+        }));
+    },
+  };
+}
diff --git a/agent/src/session-store.js b/agent/src/session-store.js
index 12194dd..54a5bf3 100644
--- a/agent/src/session-store.js
+++ b/agent/src/session-store.js
@@ -2,12 +2,16 @@ import { promises as fs } from 'node:fs';
 import { homedir } from 'node:os';
 import { dirname, join } from 'node:path';
 import { randomUUID } from 'node:crypto';
+import { PROVIDER_ALLOWLIST } from './provider-constants.js';
 
 const DEFAULT_STORAGE_ROOT = join(homedir(), '.browserforce', 'agent', 'sessions');
 const INDEX_FILE = 'index.json';
 const SESSION_ID_RE = /^[A-Za-z0-9_-]{1,128}$/;
 const RUN_ID_RE = /^[A-Za-z0-9_-]{1,256}$/;
 const MODEL_ID_RE = /^[A-Za-z0-9._:/-]{1,128}$/;
+const PROVIDER_ID_RE = /^[A-Za-z0-9._:/-]{1,64}$/;
+const PROVIDER_SESSION_ID_RE = /^[A-Za-z0-9._:/-]{1,256}$/;
+export const SESSION_PROVIDERS = new Set(PROVIDER_ALLOWLIST);
 const REASONING_EFFORT_VALUES = new Set(['low', 'medium', 'high', 'xhigh']);
 const indexWriteQueues = new Map();
 
@@ -35,10 +39,20 @@ export function isValidModelId(model) {
   return typeof model === 'string' && MODEL_ID_RE.test(model);
 }
 
+export function isValidProviderId(provider) {
+  if (typeof provider !== 'string') return false;
+  const normalized = provider.trim().toLowerCase();
+  return !!normalized && PROVIDER_ID_RE.test(normalized) && SESSION_PROVIDERS.has(normalized);
+}
+
 export function isValidReasoningEffort(value) {
   return typeof value === 'string' && REASONING_EFFORT_VALUES.has(value.trim().toLowerCase());
 }
 
+export function isValidProviderSessionId(sessionId) {
+  return typeof sessionId === 'string' && PROVIDER_SESSION_ID_RE.test(sessionId);
+}
+
 function assertValidSessionId(sessionId, fnName) {
   if (!isValidSessionId(sessionId)) {
     throw new Error(`${fnName} requires a safe sessionId`);
@@ -267,19 +281,34 @@ function normalizeReasoningEffort(reasoningEffort) {
   return trimmed;
 }
 
-function normalizeUsageNumber(value, fieldName) {
+function normalizeProvider(provider) {
+  if (provider == null) return null;
+  const trimmed = String(provider).trim().toLowerCase();
+  if (!trimmed) return null;
+  if (!isValidProviderId(trimmed)) {
+    throw new Error('provider must be one of: codex, claude');
+  }
+  return trimmed;
+}
+
+function providerStatePath(provider, suffix = '') {
+  const base = `providerState.${provider}`;
+  return suffix ? `${base}.${suffix}` : base;
+}
+
+function normalizeUsageNumber(value, fieldName, provider) {
   if (value == null) return null;
   const parsed = Number(value);
   if (!Number.isFinite(parsed) || parsed < 0) {
-    throw new Error(`providerState.codex.latestUsage.${fieldName} must be a non-negative number`);
+    throw new Error(`${providerStatePath(provider, `latestUsage.${fieldName}`)} must be a non-negative number`);
   }
   return Math.round(parsed);
 }
 
-function normalizeLatestUsage(latestUsage) {
+function normalizeLatestUsage(latestUsage, provider) {
   if (latestUsage == null) return null;
   if (!isObject(latestUsage)) {
-    throw new Error('providerState.codex.latestUsage must be an object');
+    throw new Error(`${providerStatePath(provider, 'latestUsage')} must be an object`);
   }
 
   const fields = [
@@ -294,34 +323,40 @@ function normalizeLatestUsage(latestUsage) {
   const normalized = {};
   for (const field of fields) {
     if (!Object.prototype.hasOwnProperty.call(latestUsage, field)) continue;
-    const value = normalizeUsageNumber(latestUsage[field], field);
+    const value = normalizeUsageNumber(latestUsage[field], field, provider);
     if (value != null) normalized[field] = value;
   }
   return Object.keys(normalized).length > 0 ? normalized : null;
 }
 
-function normalizeCodexProviderState(patchCodex, currentCodex) {
-  if (patchCodex == null) return null;
-  if (!isObject(patchCodex)) {
-    throw new Error('providerState.codex must be an object');
+function normalizeProviderStateEntry(provider, patchState, currentState) {
+  if (patchState == null) return null;
+  if (!isObject(patchState)) {
+    throw new Error(`${providerStatePath(provider)} must be an object`);
   }
 
-  const normalized = isObject(currentCodex) ? { ...currentCodex } : {};
+  const normalized = isObject(currentState) ? { ...currentState } : {};
 
-  if (Object.prototype.hasOwnProperty.call(patchCodex, 'sessionId')) {
-    if (patchCodex.sessionId == null || String(patchCodex.sessionId).trim() === '') {
+  if (Object.prototype.hasOwnProperty.call(patchState, 'sessionId')) {
+    if (patchState.sessionId == null || String(patchState.sessionId).trim() === '') {
       delete normalized.sessionId;
     } else {
-      const sessionId = String(patchCodex.sessionId).trim();
-      if (!isValidSessionId(sessionId)) {
-        throw new Error('providerState.codex.sessionId must be a safe session id');
+      const sessionId = String(patchState.sessionId).trim();
+      const validSessionId = provider === 'codex'
+        ? isValidSessionId(sessionId)
+        : isValidProviderSessionId(sessionId);
+      if (!validSessionId) {
+        const errorLabel = provider === 'codex'
+          ? 'safe session id'
+          : 'safe provider session id';
+        throw new Error(`${providerStatePath(provider, 'sessionId')} must be a ${errorLabel}`);
       }
       normalized.sessionId = sessionId;
     }
   }
 
-  if (Object.prototype.hasOwnProperty.call(patchCodex, 'latestUsage')) {
-    const latestUsage = normalizeLatestUsage(patchCodex.latestUsage);
+  if (Object.prototype.hasOwnProperty.call(patchState, 'latestUsage')) {
+    const latestUsage = normalizeLatestUsage(patchState.latestUsage, provider);
     if (latestUsage == null) delete normalized.latestUsage;
     else normalized.latestUsage = latestUsage;
   }
@@ -333,12 +368,20 @@ function normalizeProviderState(providerStatePatch, currentProviderState) {
   if (!isObject(providerStatePatch)) {
     throw new Error('providerState must be an object');
   }
-  const normalized = isObject(currentProviderState) ? { ...currentProviderState } : {};
+  const normalized = {};
+  for (const provider of SESSION_PROVIDERS) {
+    if (isObject(currentProviderState?.[provider])) {
+      normalized[provider] = { ...currentProviderState[provider] };
+    }
+  }
 
-  if (Object.prototype.hasOwnProperty.call(providerStatePatch, 'codex')) {
-    const codex = normalizeCodexProviderState(providerStatePatch.codex, normalized.codex);
-    if (codex == null) delete normalized.codex;
-    else normalized.codex = codex;
+  for (const provider of Object.keys(providerStatePatch)) {
+    if (!isValidProviderId(provider)) {
+      throw new Error(`${providerStatePath(provider)} is not supported`);
+    }
+    const nextState = normalizeProviderStateEntry(provider, providerStatePatch[provider], normalized[provider]);
+    if (nextState == null) delete normalized[provider];
+    else normalized[provider] = nextState;
   }
 
   return Object.keys(normalized).length > 0 ? normalized : null;
@@ -350,7 +393,13 @@ function sortSessionsNewestFirst(a, b) {
   return bTs - aTs;
 }
 
-export async function createSession({ title = 'New chat', model = null, reasoningEffort = null, storageRoot } = {}) {
+export async function createSession({
+  title = 'New chat',
+  model = null,
+  provider = null,
+  reasoningEffort = null,
+  storageRoot,
+} = {}) {
   const root = resolveStorageRoot(storageRoot);
   await ensureStorageRoot(root);
 
@@ -360,6 +409,7 @@ export async function createSession({ title = 'New chat', model = null, reasonin
     sessionId,
     title,
     model: normalizeModel(model),
+    provider: normalizeProvider(provider),
     reasoningEffort: normalizeReasoningEffort(reasoningEffort),
     createdAt: now,
     updatedAt: now,
@@ -415,6 +465,9 @@ export async function updateSession({ sessionId, patch = {}, storageRoot } = {})
     if (Object.prototype.hasOwnProperty.call(patch, 'model')) {
       next.model = normalizeModel(patch.model);
     }
+    if (Object.prototype.hasOwnProperty.call(patch, 'provider')) {
+      next.provider = normalizeProvider(patch.provider);
+    }
     if (Object.prototype.hasOwnProperty.call(patch, 'reasoningEffort')) {
       next.reasoningEffort = normalizeReasoningEffort(patch.reasoningEffort);
     }
diff --git a/package.json b/package.json
index cc56975..4136281 100644
--- a/package.json
+++ b/package.json
@@ -50,7 +50,7 @@
     "relay:dev": "lsof -ti tcp:19222 | xargs kill -9 2>/dev/null; sleep 0.3; node --watch relay/src/index.js",
     "mcp": "node mcp/src/index.js",
     "postinstall": "node scripts/postinstall-openclaw.mjs",
-    "test": "node --test relay/test/relay-server.test.js && node --test mcp/test/mcp-tools.test.js && node --test mcp/test/plugin-loader.test.js && node --test mcp/test/plugin-installer.test.js && node --test mcp/test/exec-engine-plugins.test.js && node --test mcp/test/mcp-plugin-integration.test.js && node --test test/agent/port-resolver.test.js && node --test test/agent/session-store.test.js && node --test test/agent/codex-runner.test.js && node --test test/agent/chatd-api.test.js && node --test test/agent/extension-manifest.test.js && node --test test/agent/popup-contract.test.js && node --test test/agent/relay-url-reconnect-contract.test.js && node --test test/agent/agent-panel-contract.test.js && node --test test/agent/agent-panel-send-contract.test.js && node --test test/agent/session-ui-state.test.js && node --test test/agent/sse-events.test.js && node --test test/agent/auth.test.js && node --test test/agent/agent-panel-runtime.test.js && node --test test/agent/cli-agent.test.js && node --test test/cli.test.js && node --test test/postinstall.test.js",
+    "test": "node --test relay/test/relay-server.test.js && node --test mcp/test/mcp-tools.test.js && node --test mcp/test/plugin-loader.test.js && node --test mcp/test/plugin-installer.test.js && node --test mcp/test/exec-engine-plugins.test.js && node --test mcp/test/mcp-plugin-integration.test.js && node --test test/agent/port-resolver.test.js && node --test test/agent/session-store.test.js && node --test test/agent/codex-runner.test.js && node --test test/agent/claude-protocol-contract.test.js && node --test test/agent/claude-provider.test.js && node --test test/agent/chatd-api.test.js && node --test test/agent/extension-manifest.test.js && node --test test/agent/popup-contract.test.js && node --test test/agent/relay-url-reconnect-contract.test.js && node --test test/agent/agent-panel-contract.test.js && node --test test/agent/agent-panel-send-contract.test.js && node --test test/agent/session-ui-state.test.js && node --test test/agent/sse-events.test.js && node --test test/agent/auth.test.js && node --test test/agent/agent-panel-runtime.test.js && node --test test/agent/cli-agent.test.js && node --test test/cli.test.js && node --test test/postinstall.test.js",
     "test:relay": "node --test relay/test/relay-server.test.js",
     "test:mcp": "node --test mcp/test/mcp-tools.test.js && node --test mcp/test/plugin-loader.test.js && node --test mcp/test/plugin-installer.test.js && node --test mcp/test/exec-engine-plugins.test.js && node --test mcp/test/mcp-plugin-integration.test.js"
   }
diff --git a/test/agent/chatd-api.test.js b/test/agent/chatd-api.test.js
index cecfc8c..dfb030f 100644
--- a/test/agent/chatd-api.test.js
+++ b/test/agent/chatd-api.test.js
@@ -18,6 +18,29 @@ async function fetchWithRetry(url, init, attempts = 3) {
   throw lastError;
 }
 
+async function waitForProviderSessionId({
+  daemon,
+  sessionId,
+  provider,
+  timeoutMs = 2000,
+}) {
+  const deadline = Date.now() + timeoutMs;
+  while (Date.now() < deadline) {
+    const res = await fetch(`${daemon.baseUrl}/v1/sessions/${encodeURIComponent(sessionId)}`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    if (res.ok) {
+      const body = await res.json();
+      const providerSessionId = body?.providerState?.[provider]?.sessionId;
+      if (typeof providerSessionId === 'string' && providerSessionId.trim()) {
+        return providerSessionId;
+      }
+    }
+    await new Promise((resolve) => setTimeout(resolve, 20));
+  }
+  return null;
+}
+
 test('GET /health returns daemon metadata', async () => {
   const daemon = await startChatd({ port: 0, writeChatdUrl: false });
   try {
@@ -77,6 +100,66 @@ test('GET /v1/models falls back to configured model when model fetcher fails', a
   }
 });
 
+test('GET /v1/providers lists codex and claude provider allowlist', async () => {
+  const daemon = await startChatd({ port: 0, writeChatdUrl: false });
+  try {
+    const res = await fetch(`${daemon.baseUrl}/v1/providers`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(res.status, 200);
+    const body = await res.json();
+    assert.equal(body.defaultProvider, 'codex');
+    assert.deepEqual(
+      (body.providers || []).map((row) => row.id),
+      ['codex', 'claude'],
+    );
+  } finally {
+    await daemon.stop();
+  }
+});
+
+test('GET /v1/models scopes rows by provider query', async () => {
+  const providerRegistry = {
+    listProviders() {
+      return [
+        {
+          id: 'codex',
+          label: 'Codex',
+          listModels: async () => [{ value: 'gpt-5', label: 'GPT-5' }],
+          startRun() { return { abort() {} }; },
+        },
+        {
+          id: 'claude',
+          label: 'Claude',
+          listModels: async () => [{ value: 'claude-3-7-sonnet', label: 'Claude 3.7 Sonnet' }],
+          startRun() { return { abort() {} }; },
+        },
+      ];
+    },
+    getProvider(id) {
+      return this.listProviders().find((provider) => provider.id === id) || null;
+    },
+  };
+
+  const daemon = await startChatd({ port: 0, writeChatdUrl: false, providerRegistry });
+  try {
+    const res = await fetch(`${daemon.baseUrl}/v1/models?provider=claude`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(res.status, 200);
+    const body = await res.json();
+    assert.equal(body.provider, 'claude');
+    assert.equal(Array.isArray(body.providers), true);
+    assert.equal(body.providers.some((row) => row.id === 'codex'), true);
+    assert.equal(body.providers.some((row) => row.id === 'claude'), true);
+    assert.deepEqual(body.models[0], { value: null, label: 'Default' });
+    assert.deepEqual(body.models[1], { value: 'claude-3-7-sonnet', label: 'Claude 3.7 Sonnet' });
+    assert.equal(body.models.some((row) => row.value === 'gpt-5'), false);
+  } finally {
+    await daemon.stop();
+  }
+});
+
 test('POST /v1/runs requires explicit sessionId', async () => {
   const daemon = await startChatd({ port: 0, writeChatdUrl: false });
   try {
@@ -94,6 +177,47 @@ test('POST /v1/runs requires explicit sessionId', async () => {
   }
 });
 
+test('provider allowlist rejects unsupported provider ids in models and session APIs', async () => {
+  const daemon = await startChatd({ port: 0, writeChatdUrl: false });
+  try {
+    const modelsRes = await fetch(`${daemon.baseUrl}/v1/models?provider=openai`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(modelsRes.status, 400);
+
+    const createRes = await fetch(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'bad provider', provider: 'openai' }),
+    });
+    assert.equal(createRes.status, 400);
+
+    const created = await fetch(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'good provider', provider: 'codex' }),
+    }).then((res) => res.json());
+
+    const patchRes = await fetch(`${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}`, {
+      method: 'PATCH',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ provider: 'openai' }),
+    });
+    assert.equal(patchRes.status, 400);
+  } finally {
+    await daemon.stop();
+  }
+});
+
 test('GET /v1/sessions/:id/messages rejects malformed encoded id', async () => {
   const daemon = await startChatd({ port: 0, writeChatdUrl: false });
   try {
@@ -193,8 +317,8 @@ test('POST /v1/runs uses injected run executor and persists assistant output', a
     port: 0,
     writeChatdUrl: false,
     defaultReasoningEffort: 'medium',
-    runExecutor: ({ runId, sessionId, model, reasoningEffort, onEvent, onExit }) => {
-      seenRuns.push({ runId, sessionId, model, reasoningEffort });
+    runExecutor: ({ runId, sessionId, provider, model, reasoningEffort, onEvent, onExit }) => {
+      seenRuns.push({ runId, sessionId, provider, model, reasoningEffort });
       setTimeout(() => {
         onEvent({ event: 'chat.delta', runId, sessionId, payload: { delta: 'hel' } });
       }, 10);
@@ -236,6 +360,7 @@ test('POST /v1/runs uses injected run executor and persists assistant output', a
     assert.equal(runRes.status, 202);
 
     await new Promise((resolve) => setTimeout(resolve, 60));
+    assert.equal(seenRuns.at(-1)?.provider, 'codex');
     assert.equal(seenRuns.at(-1)?.model, 'gpt-5');
     assert.equal(seenRuns.at(-1)?.reasoningEffort, 'medium');
 
@@ -250,6 +375,105 @@ test('POST /v1/runs uses injected run executor and persists assistant output', a
   }
 });
 
+test('POST /v1/runs uses selected session provider for runtime and continuity', async () => {
+  const observed = [];
+  const providerRegistry = {
+    listProviders() {
+      return [
+        {
+          id: 'codex',
+          label: 'Codex',
+          listModels: async () => [],
+          startRun: ({ runId, sessionId, onEvent, onExit }) => {
+            setTimeout(() => onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'codex' } }), 5);
+            setTimeout(() => onExit({ code: 0 }), 10);
+            return { abort() {} };
+          },
+        },
+        {
+          id: 'claude',
+          label: 'Claude',
+          listModels: async () => [],
+          startRun: ({ runId, sessionId, resumeSessionId, onEvent, onExit }) => {
+            observed.push({ runId, sessionId, resumeSessionId: resumeSessionId || null });
+            setTimeout(() => {
+              onEvent({
+                event: 'run.provider_session',
+                runId,
+                sessionId,
+                payload: { provider: 'claude', sessionId: 'claude-session-002' },
+              });
+            }, 5);
+            setTimeout(() => onEvent({ event: 'chat.final', runId, sessionId, payload: { text: 'claude' } }), 10);
+            setTimeout(() => onExit({ code: 0 }), 15);
+            return { abort() {} };
+          },
+        },
+      ];
+    },
+    getProvider(id) {
+      return this.listProviders().find((provider) => provider.id === id) || null;
+    },
+  };
+
+  const daemon = await startChatd({
+    port: 0,
+    writeChatdUrl: false,
+    providerRegistry,
+  });
+
+  try {
+    const created = await fetchWithRetry(`${daemon.baseUrl}/v1/sessions`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ title: 'Claude run', provider: 'claude' }),
+    }).then((res) => res.json());
+
+    const runOne = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'first' }),
+    });
+    assert.equal(runOne.status, 202);
+    await waitForProviderSessionId({
+      daemon,
+      sessionId: created.sessionId,
+      provider: 'claude',
+    });
+
+    const runTwo = await fetch(`${daemon.baseUrl}/v1/runs`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'application/json',
+        authorization: `Bearer ${daemon.token}`,
+      },
+      body: JSON.stringify({ sessionId: created.sessionId, message: 'second' }),
+    });
+    assert.equal(runTwo.status, 202);
+    await new Promise((resolve) => setTimeout(resolve, 70));
+
+    assert.equal(observed.length >= 2, true);
+    assert.equal(observed[0].resumeSessionId, null);
+    assert.equal(observed[1].resumeSessionId, 'claude-session-002');
+
+    const sessionRes = await fetch(`${daemon.baseUrl}/v1/sessions/${encodeURIComponent(created.sessionId)}`, {
+      headers: { authorization: `Bearer ${daemon.token}` },
+    });
+    assert.equal(sessionRes.status, 200);
+    const sessionBody = await sessionRes.json();
+    assert.equal(sessionBody.provider, 'claude');
+    assert.equal(sessionBody.providerState?.claude?.sessionId, 'claude-session-002');
+  } finally {
+    await daemon.stop();
+  }
+});
+
 test('POST /v1/runs uses per-session reasoning effort when configured', async () => {
   const seenRuns = [];
   const daemon = await startChatd({
@@ -715,7 +939,11 @@ test('POST /v1/runs uses one-line AGENTS reminder on resume runs', async () => {
       body: JSON.stringify({ sessionId: created.sessionId, message: 'first' }),
     });
     assert.equal(runOneRes.status, 202);
-    await new Promise((resolve) => setTimeout(resolve, 60));
+    await waitForProviderSessionId({
+      daemon,
+      sessionId: created.sessionId,
+      provider: 'codex',
+    });
 
     const runTwoRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
       method: 'POST',
@@ -785,7 +1013,11 @@ test('POST /v1/runs reuses codex provider session id on second turn', async () =
       body: JSON.stringify({ sessionId: created.sessionId, message: 'first' }),
     });
     assert.equal(runOneRes.status, 202);
-    await new Promise((resolve) => setTimeout(resolve, 60));
+    await waitForProviderSessionId({
+      daemon,
+      sessionId: created.sessionId,
+      provider: 'codex',
+    });
 
     const runTwoRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
       method: 'POST',
@@ -872,7 +1104,11 @@ test('stale resume failures retry once as fresh run when failure signature match
       body: JSON.stringify({ sessionId: created.sessionId, message: 'seed' }),
     });
     assert.equal(seedRes.status, 202);
-    await new Promise((resolve) => setTimeout(resolve, 70));
+    await waitForProviderSessionId({
+      daemon,
+      sessionId: created.sessionId,
+      provider: 'codex',
+    });
 
     const retryRes = await fetch(`${daemon.baseUrl}/v1/runs`, {
       method: 'POST',
diff --git a/test/agent/claude-protocol-contract.test.js b/test/agent/claude-protocol-contract.test.js
new file mode 100644
index 0000000..d17ddf5
--- /dev/null
+++ b/test/agent/claude-protocol-contract.test.js
@@ -0,0 +1,50 @@
+import test from 'node:test';
+import assert from 'node:assert/strict';
+import { readFileSync } from 'node:fs';
+import { join } from 'node:path';
+
+function readJsonlFixture(name) {
+  const path = join(process.cwd(), 'test', 'agent', 'fixtures', name);
+  const raw = readFileSync(path, 'utf8').trim();
+  return raw
+    .split('\n')
+    .filter(Boolean)
+    .map((line) => JSON.parse(line));
+}
+
+test('claude start-run fixture locks expected JSONL event shape', () => {
+  const rows = readJsonlFixture('claude-jsonl-start-run.sample');
+  assert.equal(rows.length >= 3, true);
+  assert.equal(rows[0]?.type, 'system');
+  assert.equal(rows[0]?.subtype, 'init');
+  assert.equal(typeof rows[0]?.session_id, 'string');
+
+  const assistantRows = rows.filter((row) => row?.type === 'assistant');
+  assert.equal(assistantRows.length >= 1, true);
+  assert.equal(
+    assistantRows.every((row) => Array.isArray(row?.message?.content)),
+    true,
+  );
+
+  const resultRow = rows.find((row) => row?.type === 'result');
+  assert.equal(resultRow?.subtype, 'success');
+  assert.equal(typeof resultRow?.result, 'string');
+  assert.equal(typeof resultRow?.usage?.input_tokens, 'number');
+  assert.equal(typeof resultRow?.usage?.output_tokens, 'number');
+});
+
+test('claude resume fixture locks resume continuity fields and flag contract', () => {
+  const rows = readJsonlFixture('claude-jsonl-resume-run.sample');
+  assert.equal(rows.length >= 2, true);
+  assert.equal(rows[0]?.type, 'system');
+  assert.equal(rows[0]?.subtype, 'init');
+  assert.equal(rows[0]?.resumed, true);
+  assert.equal(typeof rows[0]?.session_id, 'string');
+
+  const resultRow = rows.find((row) => row?.type === 'result');
+  assert.equal(typeof resultRow?.session_id, 'string');
+  assert.equal(resultRow?.session_id, rows[0]?.session_id);
+
+  const expectedResumeInvocation = ['-p', '--output-format', 'stream-json', '--resume', '<providerSessionId>', '<prompt>'];
+  assert.deepEqual(expectedResumeInvocation.slice(0, 4), ['-p', '--output-format', 'stream-json', '--resume']);
+});
diff --git a/test/agent/claude-provider.test.js b/test/agent/claude-provider.test.js
new file mode 100644
index 0000000..360d2b1
--- /dev/null
+++ b/test/agent/claude-provider.test.js
@@ -0,0 +1,70 @@
+import test from 'node:test';
+import assert from 'node:assert/strict';
+import { EventEmitter } from 'node:events';
+import { PassThrough } from 'node:stream';
+import { readFileSync } from 'node:fs';
+import { join } from 'node:path';
+
+import {
+  buildClaudeExecArgs,
+  normalizeClaudeLine,
+  startClaudeRun,
+} from '../../agent/src/providers/claude-provider.js';
+
+function readFixture(name) {
+  const path = join(process.cwd(), 'test', 'agent', 'fixtures', name);
+  return readFileSync(path, 'utf8').trim().split('\n').filter(Boolean);
+}
+
+test('buildClaudeExecArgs uses stream-json print mode and --resume when provided', () => {
+  const fresh = buildClaudeExecArgs({ prompt: 'hello', model: 'claude-sonnet-4-5' });
+  assert.deepEqual(fresh, ['-p', '--output-format', 'stream-json', '--model', 'claude-sonnet-4-5', 'hello']);
+
+  const resume = buildClaudeExecArgs({
+    prompt: 'hello',
+    model: 'claude-sonnet-4-5',
+    resumeSessionId: 'claude-session-resume-001',
+  });
+  assert.deepEqual(
+    resume,
+    ['-p', '--output-format', 'stream-json', '--resume', 'claude-session-resume-001', '--model', 'claude-sonnet-4-5', 'hello'],
+  );
+});
+
+test('normalizeClaudeLine maps session continuity, deltas, final text, and usage', () => {
+  const lineEvents = readFixture('claude-jsonl-start-run.sample')
+    .flatMap((line) => normalizeClaudeLine({ runId: 'r1', sessionId: 's1', line }))
+    .filter(Boolean);
+
+  assert.equal(lineEvents.some((evt) => evt.event === 'run.provider_session' && evt.payload?.provider === 'claude'), true);
+  assert.equal(lineEvents.some((evt) => evt.event === 'chat.delta'), true);
+  assert.equal(lineEvents.some((evt) => evt.event === 'chat.final'), true);
+  assert.equal(lineEvents.some((evt) => evt.event === 'run.usage'), true);
+});
+
+test('startClaudeRun maps ENOENT command failures to run.error event', async () => {
+  const child = new EventEmitter();
+  child.stdout = new PassThrough();
+  child.stderr = new PassThrough();
+  child.kill = () => {};
+  child.pid = 4242;
+
+  const events = [];
+  let exitPayload = null;
+  startClaudeRun({
+    runId: 'run-enoent',
+    sessionId: 'session-enoent',
+    prompt: 'hello',
+    spawnImpl: () => child,
+    onEvent: (evt) => events.push(evt),
+    onExit: (payload) => {
+      exitPayload = payload;
+    },
+  });
+
+  child.emit('error', Object.assign(new Error('spawn claude ENOENT'), { code: 'ENOENT' }));
+  await new Promise((resolve) => setTimeout(resolve, 10));
+
+  assert.equal(events.some((evt) => evt.event === 'run.error' && /BF_CHATD_CLAUDE_COMMAND/.test(evt.payload?.error || '')), true);
+  assert.equal(Number.isInteger(exitPayload?.code), true);
+});
diff --git a/test/agent/fixtures/claude-jsonl-resume-run.sample b/test/agent/fixtures/claude-jsonl-resume-run.sample
new file mode 100644
index 0000000..d17b048
--- /dev/null
+++ b/test/agent/fixtures/claude-jsonl-resume-run.sample
@@ -0,0 +1,3 @@
+{"type":"system","subtype":"init","session_id":"claude-session-resume-001","resumed":true}
+{"type":"assistant","message":{"id":"msg_resume_1","type":"message","role":"assistant","content":[{"type":"text","text":"Continuing from the previous thread."}]}}
+{"type":"result","subtype":"success","is_error":false,"session_id":"claude-session-resume-001","result":"Continuing from the previous thread.","usage":{"input_tokens":210,"output_tokens":33,"cache_read_input_tokens":121}}
diff --git a/test/agent/fixtures/claude-jsonl-start-run.sample b/test/agent/fixtures/claude-jsonl-start-run.sample
new file mode 100644
index 0000000..c0ab8b6
--- /dev/null
+++ b/test/agent/fixtures/claude-jsonl-start-run.sample
@@ -0,0 +1,4 @@
+{"type":"system","subtype":"init","session_id":"claude-session-start-001","model":"claude-sonnet-4-5-20250929"}
+{"type":"assistant","message":{"id":"msg_start_1","type":"message","role":"assistant","content":[{"type":"text","text":"I will inspect the active page now."}]}}
+{"type":"assistant","message":{"id":"msg_start_2","type":"message","role":"assistant","content":[{"type":"text","text":"The page heading is BrowserForce."}]}}
+{"type":"result","subtype":"success","is_error":false,"session_id":"claude-session-start-001","result":"The page heading is BrowserForce.","usage":{"input_tokens":410,"output_tokens":57,"cache_read_input_tokens":200}}
diff --git a/test/agent/session-store.test.js b/test/agent/session-store.test.js
index 39d3df5..07c7f16 100644
--- a/test/agent/session-store.test.js
+++ b/test/agent/session-store.test.js
@@ -119,6 +119,27 @@ test('updateSession persists per-session model and title', async () => {
   assert.equal(row?.reasoningEffort, 'high');
 });
 
+test('createSession persists provider when set', async () => {
+  const created = await createSession({ title: 'Claude chat', provider: 'claude', storageRoot });
+  assert.equal(created.provider, 'claude');
+
+  const rows = await listSessions({ limit: 10, storageRoot });
+  const row = rows.find((item) => item.sessionId === created.sessionId);
+  assert.equal(row?.provider, 'claude');
+});
+
+test('updateSession rejects unsupported provider ids', async () => {
+  const created = await createSession({ title: 'Provider validation', storageRoot });
+  await assert.rejects(
+    updateSession({
+      sessionId: created.sessionId,
+      patch: { provider: 'openai' },
+      storageRoot,
+    }),
+    /provider must be one of: codex, claude/i,
+  );
+});
+
 test('updateSession supports clearing reasoning effort back to config default', async () => {
   const created = await createSession({ title: 'Before', storageRoot });
   const updated = await updateSession({
@@ -157,6 +178,30 @@ test('updateSession persists codex provider session mapping', async () => {
   assert.equal(row?.providerState?.codex?.latestUsage?.totalTokens, 128125);
 });
 
+test('updateSession persists claude provider session mapping and usage', async () => {
+  const created = await createSession({ title: 'Claude continuity', provider: 'claude', storageRoot });
+  const providerSessionId = 'claude:session/123';
+  const updated = await updateSession({
+    sessionId: created.sessionId,
+    patch: {
+      providerState: {
+        claude: {
+          sessionId: providerSessionId,
+          latestUsage: {
+            inputTokens: 77,
+            outputTokens: 31,
+          },
+        },
+      },
+    },
+    storageRoot,
+  });
+
+  assert.equal(updated?.providerState?.claude?.sessionId, providerSessionId);
+  assert.equal(updated?.providerState?.claude?.latestUsage?.inputTokens, 77);
+  assert.equal(updated?.providerState?.claude?.latestUsage?.outputTokens, 31);
+});
+
 test('listSessions fails fast on corrupted index metadata', async () => {
   writeFileSync(join(storageRoot, 'index.json'), '{this-is-not-json\n', 'utf8');
   await assert.rejects(

From 9581e020c9d3f85cea3d177c19f81d9a1d4eb596 Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 15:10:42 +0530
Subject: [PATCH 191/192] feat(agent-panel): add provider selector and
 provider-scoped hydration

---
 extension/agent-panel-state.js          |  30 ++++-
 extension/agent-panel.html              |   2 +
 extension/agent-panel.js                | 147 +++++++++++++++++++++++-
 test/agent/agent-panel-contract.test.js |   9 ++
 test/agent/session-ui-state.test.js     |  66 +++++++++++
 5 files changed, 246 insertions(+), 8 deletions(-)

diff --git a/extension/agent-panel-state.js b/extension/agent-panel-state.js
index ba86b25..3e7dce1 100644
--- a/extension/agent-panel-state.js
+++ b/extension/agent-panel-state.js
@@ -630,6 +630,29 @@ function normalizeUsagePayload(payload) {
   return Object.keys(normalized).length > 0 ? normalized : null;
 }
 
+function normalizeProviderKey(value) {
+  const normalized = String(value || '').trim().toLowerCase();
+  return normalized || null;
+}
+
+function usageFromProviderState(providerState, provider) {
+  if (!providerState || typeof providerState !== 'object') return null;
+  const keys = [];
+  const preferred = normalizeProviderKey(provider);
+  if (preferred) keys.push(preferred);
+  if (!keys.includes('codex')) keys.push('codex');
+
+  for (const key of keys) {
+    const usage = normalizeUsagePayload(providerState?.[key]?.latestUsage);
+    if (usage) return usage;
+  }
+
+  const legacyTopLevelUsage = normalizeUsagePayload(providerState.latestUsage);
+  if (legacyTopLevelUsage) return legacyTopLevelUsage;
+
+  return null;
+}
+
 function normalizeStoredStep(step) {
   return normalizeStep(step);
 }
@@ -700,7 +723,7 @@ export function reduceState(state = initialState, action = {}) {
   }
 
   if (action.type === 'session.metadata.loaded') {
-    const usage = normalizeUsagePayload(action.session?.providerState?.codex?.latestUsage);
+    const usage = usageFromProviderState(action.session?.providerState, action.session?.provider);
     if (!usage || !action.sessionId) return state;
     return {
       ...state,
@@ -934,7 +957,10 @@ export function applyEvent(state = initialState, evt = {}) {
   }
 
   if (evt.event === 'run.usage') {
-    const usage = normalizeUsagePayload(evt.payload);
+    const usagePayload = (evt?.payload?.usage && typeof evt.payload.usage === 'object')
+      ? evt.payload.usage
+      : evt.payload;
+    const usage = normalizeUsagePayload(usagePayload);
     if (!usage) return state;
     const run = state.runs[evt.runId] || { text: '', done: false, steps: [], timeline: [] };
     return {
diff --git a/extension/agent-panel.html b/extension/agent-panel.html
index 13d2f1a..ff90c96 100644
--- a/extension/agent-panel.html
+++ b/extension/agent-panel.html
@@ -67,6 +67,8 @@
     <div id="bf-popover-backdrop" class="popover-backdrop hidden"></div>
 
     <section id="bf-model-panel" class="popover-panel hidden" role="listbox" aria-label="Available models">
+      <p class="popover-label">Provider</p>
+      <ul id="bf-provider-list" class="popover-list"></ul>
       <p class="popover-label">Available Models</p>
       <ul id="bf-model-list" class="popover-list"></ul>
       <p class="popover-label">Thinking Level</p>
diff --git a/extension/agent-panel.js b/extension/agent-panel.js
index 62325fd..1bbf4af 100644
--- a/extension/agent-panel.js
+++ b/extension/agent-panel.js
@@ -26,6 +26,8 @@ const STREAM_CHUNK_INTERVAL_MS = 26;
 const state = {
   value: initialState,
   auth: null,
+  providerPresets: [{ value: 'codex', label: 'Codex' }],
+  defaultProvider: 'codex',
   modelPresets: [{ value: null, label: 'Default' }],
   defaultReasoningEffort: 'medium',
   currentRunBySession: {},
@@ -67,6 +69,7 @@ const newSessionBtn = document.getElementById('bf-new-session');
 const popoverBackdropEl = document.getElementById('bf-popover-backdrop');
 const modelPanelEl = document.getElementById('bf-model-panel');
 const sessionPanelEl = document.getElementById('bf-session-panel');
+const providerListEl = document.getElementById('bf-provider-list');
 const modelListEl = document.getElementById('bf-model-list');
 const thinkingListEl = document.getElementById('bf-thinking-list');
 const switchSessionListEl = document.getElementById('bf-switch-session-list');
@@ -398,6 +401,27 @@ function formatModelLabel(model) {
   return model && String(model).trim() ? model : 'Default';
 }
 
+function normalizeProvider(value) {
+  const normalized = String(value || '').trim().toLowerCase();
+  return normalized || null;
+}
+
+function formatProviderLabel(provider) {
+  const normalized = normalizeProvider(provider);
+  if (!normalized) return 'Provider';
+  return normalized.charAt(0).toUpperCase() + normalized.slice(1);
+}
+
+function getSessionProvider(session) {
+  const explicit = normalizeProvider(session?.provider);
+  if (explicit) return explicit;
+  const configured = normalizeProvider(state.defaultProvider);
+  if (configured) return configured;
+  const firstPreset = normalizeProvider(state.providerPresets?.[0]?.value);
+  if (firstPreset) return firstPreset;
+  return 'codex';
+}
+
 function normalizeReasoningEffort(value) {
   const normalized = String(value || '').trim().toLowerCase();
   if (normalized === 'low' || normalized === 'medium' || normalized === 'high' || normalized === 'xhigh') {
@@ -474,8 +498,35 @@ function renderSelectors() {
 function renderModelList() {
   if (!modelListEl || !thinkingListEl) return;
   const activeSession = getActiveSession();
+  const activeProvider = getSessionProvider(activeSession);
   const activeModel = activeSession?.model || null;
   const activeReasoningEffort = normalizeReasoningEffort(activeSession?.reasoningEffort);
+  const providerRows = state.providerPresets.length > 0
+    ? state.providerPresets
+    : [{ value: activeProvider, label: formatProviderLabel(activeProvider) }];
+
+  if (providerListEl) {
+    providerListEl.innerHTML = providerRows.map((preset) => {
+      const providerValue = normalizeProvider(preset.value);
+      const active = providerValue === activeProvider ? 'active' : '';
+      return `
+        <li>
+          <button type="button" data-provider="${escapeHtml(providerValue || '')}" class="popover-item ${active}">
+            <span>${escapeHtml(preset.label)}</span>
+          </button>
+        </li>
+      `;
+    }).join('');
+
+    providerListEl.querySelectorAll('button[data-provider]').forEach((button) => {
+      button.addEventListener('click', () => {
+        const provider = button.dataset.provider || null;
+        updateActiveSessionProvider(provider).catch((error) => {
+          setStatus('error', error.message || 'Unable to update provider');
+        });
+      });
+    });
+  }
 
   const rows = state.modelPresets.map((preset) => {
     const active = (preset.value || null) === activeModel ? 'active' : '';
@@ -1498,11 +1549,64 @@ function normalizeModelRows(input) {
   return rows;
 }
 
-async function loadModelPresets() {
-  const res = await api('/v1/models', { method: 'GET', headers: {} });
+function normalizeProviderRows(input) {
+  const source = Array.isArray(input) ? input : [];
+  const seen = new Set();
+  const rows = [];
+  for (const row of source) {
+    if (!row) continue;
+    const rawValue = typeof row === 'string'
+      ? row
+      : (row.value ?? row.id);
+    const value = normalizeProvider(rawValue);
+    if (!value || seen.has(value)) continue;
+    seen.add(value);
+    const label = (typeof row === 'object' && row.label && String(row.label).trim())
+      ? String(row.label).trim()
+      : formatProviderLabel(value);
+    rows.push({ value, label });
+  }
+  return rows;
+}
+
+function resolveModelCatalog(body, preferredProvider = null) {
+  const payload = (body && typeof body === 'object') ? body : {};
+  const modelsByProvider = payload.modelsByProvider && typeof payload.modelsByProvider === 'object'
+    ? payload.modelsByProvider
+    : null;
+  const providerRows = normalizeProviderRows(
+    payload.providers || payload.providerPresets || payload.availableProviders
+      || (modelsByProvider ? Object.keys(modelsByProvider) : []),
+  );
+  const defaultProvider = normalizeProvider(payload.defaultProvider || payload.provider)
+    || normalizeProvider(providerRows[0]?.value)
+    || 'codex';
+  const selectedProvider = normalizeProvider(preferredProvider)
+    || defaultProvider;
+  const scopedModels = (modelsByProvider && selectedProvider && Array.isArray(modelsByProvider[selectedProvider]))
+    ? modelsByProvider[selectedProvider]
+    : payload.models;
+  return {
+    providerRows: providerRows.length > 0
+      ? providerRows
+      : [{ value: selectedProvider, label: formatProviderLabel(selectedProvider) }],
+    defaultProvider,
+    models: normalizeModelRows(scopedModels),
+  };
+}
+
+async function loadModelPresets(provider = null) {
+  const scopedProvider = normalizeProvider(provider);
+  const path = scopedProvider
+    ? `/v1/models?provider=${encodeURIComponent(scopedProvider)}`
+    : '/v1/models';
+  const res = await api(path, { method: 'GET', headers: {} });
   await ensureOk(res, 'Failed to load models');
   const body = await readJsonOrEmpty(res);
-  state.modelPresets = normalizeModelRows(body.models);
+  const catalog = resolveModelCatalog(body, scopedProvider);
+  state.providerPresets = catalog.providerRows;
+  state.defaultProvider = catalog.defaultProvider;
+  state.modelPresets = catalog.models;
   state.defaultReasoningEffort = normalizeReasoningEffort(body.defaultReasoningEffort) || 'medium';
 }
 
@@ -1544,6 +1648,15 @@ async function selectSession(sessionId) {
   })) {
     return;
   }
+  await loadModelPresets(getSessionProvider(getActiveSession())).catch(() => {});
+  if (!shouldApplySessionSelection({
+    requestToken: selectionToken,
+    latestRequestToken: state.sessionSelectionToken,
+    requestedSessionId: sessionId,
+    activeSessionId: state.value.activeSessionId,
+  })) {
+    return;
+  }
   connectEvents(sessionId);
 }
 
@@ -1627,8 +1740,28 @@ async function updateActiveSessionModel(model) {
     throw new Error(body.error || 'Unable to update model');
   }
 
-  await loadModelPresets().catch(() => {});
   await loadSessions(sessionId);
+  await loadModelPresets(getSessionProvider(getActiveSession())).catch(() => {});
+  setPopover('none');
+  setStatus('ready', 'Ready');
+}
+
+async function updateActiveSessionProvider(provider) {
+  const sessionId = state.value.activeSessionId;
+  const nextProvider = normalizeProvider(provider);
+  if (!sessionId || !nextProvider) return;
+
+  const res = await api(`/v1/sessions/${encodeURIComponent(sessionId)}`, {
+    method: 'PATCH',
+    body: JSON.stringify({ provider: nextProvider }),
+  });
+  if (!res.ok) {
+    const body = await res.json().catch(() => ({}));
+    throw new Error(body.error || 'Unable to update provider');
+  }
+
+  await loadSessions(sessionId);
+  await loadModelPresets(getSessionProvider(getActiveSession())).catch(() => {});
   setPopover('none');
   setStatus('ready', 'Ready');
 }
@@ -1771,13 +1904,15 @@ async function initializePanel() {
   startInitialTabAttach();
   await loadAuth();
   bindTabAttachWatchers();
+  await loadSessions();
   try {
-    await loadModelPresets();
+    await loadModelPresets(getSessionProvider(getActiveSession()));
   } catch {
+    state.providerPresets = [{ value: 'codex', label: 'Codex' }];
+    state.defaultProvider = 'codex';
     state.modelPresets = [{ value: null, label: 'Default' }];
     state.defaultReasoningEffort = 'medium';
   }
-  await loadSessions();
   if (shouldStartFreshSession || !state.value.activeSessionId) {
     await createSession();
   } else {
diff --git a/test/agent/agent-panel-contract.test.js b/test/agent/agent-panel-contract.test.js
index 62362b4..4825aa4 100644
--- a/test/agent/agent-panel-contract.test.js
+++ b/test/agent/agent-panel-contract.test.js
@@ -13,6 +13,7 @@ test('agent panel has inline model and session selectors with popovers', () => {
   assert.match(html, /aria-label="New Session"/);
   assert.match(html, /id="bf-model-panel"/);
   assert.match(html, /id="bf-session-panel"/);
+  assert.match(html, /id="bf-provider-list"/);
   assert.match(html, /id="bf-model-list"/);
   assert.match(html, /id="bf-thinking-list"/);
   assert.match(html, /id="bf-switch-session-list"/);
@@ -110,3 +111,11 @@ test('startup error card action buttons have dedicated styling hooks', () => {
   assert.match(css, /\.empty-action-btn/);
   assert.match(css, /\.empty-action-btn\.secondary/);
 });
+
+test('model popover includes provider selector and session patch flow for provider updates', () => {
+  assert.match(panelJs, /const providerListEl = document\.getElementById\('bf-provider-list'\)/);
+  assert.match(panelJs, /data-provider=/);
+  assert.match(panelJs, /async function updateActiveSessionProvider\(provider\)/);
+  assert.match(panelJs, /method:\s*'PATCH'/);
+  assert.match(panelJs, /JSON\.stringify\(\{\s*provider:\s*nextProvider\s*\}\)/);
+});
diff --git a/test/agent/session-ui-state.test.js b/test/agent/session-ui-state.test.js
index c80dbdf..ce5c741 100644
--- a/test/agent/session-ui-state.test.js
+++ b/test/agent/session-ui-state.test.js
@@ -208,3 +208,69 @@ test('session.metadata.loaded hydrates persisted codex usage for reopened sessio
   assert.equal(next.latestUsageBySession.s1.modelContextWindow, 258400);
   assert.equal(next.latestUsageBySession.s1.totalTokens, 1120);
 });
+
+test('session.metadata.loaded hydrates usage from the active provider key', () => {
+  const state = {
+    activeSessionId: 's1',
+    sessions: [],
+    runs: {},
+    messagesBySession: {},
+    latestUsageBySession: {},
+  };
+
+  const next = reduceState(state, {
+    type: 'session.metadata.loaded',
+    sessionId: 's1',
+    session: {
+      sessionId: 's1',
+      provider: 'openai',
+      providerState: {
+        openai: {
+          latestUsage: {
+            modelContextWindow: 200000,
+            totalTokens: 3210,
+          },
+        },
+        codex: {
+          latestUsage: {
+            modelContextWindow: 258400,
+            totalTokens: 999,
+          },
+        },
+      },
+    },
+  });
+
+  assert.equal(next.latestUsageBySession.s1.modelContextWindow, 200000);
+  assert.equal(next.latestUsageBySession.s1.totalTokens, 3210);
+});
+
+test('session.metadata.loaded falls back to codex usage when active provider usage is unavailable', () => {
+  const state = {
+    activeSessionId: 's1',
+    sessions: [],
+    runs: {},
+    messagesBySession: {},
+    latestUsageBySession: {},
+  };
+
+  const next = reduceState(state, {
+    type: 'session.metadata.loaded',
+    sessionId: 's1',
+    session: {
+      sessionId: 's1',
+      provider: 'openai',
+      providerState: {
+        codex: {
+          latestUsage: {
+            modelContextWindow: 258400,
+            totalTokens: 1120,
+          },
+        },
+      },
+    },
+  });
+
+  assert.equal(next.latestUsageBySession.s1.modelContextWindow, 258400);
+  assert.equal(next.latestUsageBySession.s1.totalTokens, 1120);
+});

From 4523c321c7740dfe1155da987414e83fde2c093b Mon Sep 17 00:00:00 2001
From: Valsaraj <valsaraj03@gmail.com>
Date: Thu, 5 Mar 2026 15:10:46 +0530
Subject: [PATCH 192/192] docs(agent): document multi-provider session and run
 behavior

---
 README.md                  |  8 ++++----
 docs/BROWSERFORCE_AGENT.md | 23 +++++++++++++++--------
 2 files changed, 19 insertions(+), 12 deletions(-)

diff --git a/README.md b/README.md
index 7fa0e60..c3f2e22 100644
--- a/README.md
+++ b/README.md
@@ -397,13 +397,13 @@ Dedicated guide: [`docs/BROWSERFORCE_AGENT.md`](docs/BROWSERFORCE_AGENT.md) (set
 - Open popup -> `Open BrowserForce Agent` to open the side panel.
 - Use the session list to switch between chats; transcripts hydrate per selected `sessionId`.
 - Session identity is explicit and persisted; there is no fixed/hardcoded chat session ID.
-- BrowserForce session metadata persists Codex continuity state at `providerState.codex.sessionId`.
-  - New runs use `codex exec resume <sessionId> --json` when this mapping exists.
-  - If resume fails with an explicit invalid-session signature, chatd retries once as a fresh run.
+- BrowserForce session metadata persists provider continuity at `providerState.<providerId>.sessionId` with `session.provider` selecting the active provider (`codex` default, `claude` optional).
+  - Codex runs use `codex exec resume <sessionId> --json` when this mapping exists.
+  - If Codex resume fails with an explicit invalid-session signature, chatd retries once as a fresh run.
 - Streaming uses `fetch` + `ReadableStream` for SSE, not `EventSource`, so the panel can send `Authorization: Bearer ...` headers.
 - Side-panel status includes a context usage chip:
   - Live updates from `run.usage` SSE events when available.
-  - Hydrates from `GET /v1/sessions/:sessionId` via `providerState.codex.latestUsage`.
+  - Hydrates from `GET /v1/sessions/:sessionId` via `providerState.<providerId>.latestUsage` (with legacy Codex fallback).
   - Falls back to `Context: unavailable` when telemetry is absent.
 
 Daemon lifecycle:
diff --git a/docs/BROWSERFORCE_AGENT.md b/docs/BROWSERFORCE_AGENT.md
index 54b0862..7335b84 100644
--- a/docs/BROWSERFORCE_AGENT.md
+++ b/docs/BROWSERFORCE_AGENT.md
@@ -1,7 +1,7 @@
 # BrowserForce Agent
 
 BrowserForce Agent is the local chat daemon (`chatd`) plus the Chrome extension side-panel UI.
-It gives you resumable, multi-session chat backed by Codex, while keeping data local on loopback.
+It gives you resumable, multi-session chat backed by provider adapters (Codex by default, Claude optional), while keeping data local on loopback.
 
 ## What This Covers
 
@@ -47,10 +47,11 @@ browserforce agent stop
 
 - Session IDs are explicit and user-selectable. There is no fixed/hardcoded chat session.
 - Sessions persist under `~/.browserforce/agent/sessions/`.
-- BrowserForce stores Codex continuity under `providerState.codex.sessionId`.
-- New runs attempt `codex exec resume <sessionId> --json` when mapping exists.
-- If resume fails with an invalid-session signature, chatd retries once with a fresh run.
-- Usage telemetry from `run.usage` is persisted at `providerState.codex.latestUsage` and used to hydrate the context usage chip.
+- Session metadata stores provider identity in `session.provider` (defaults to `codex` when omitted).
+- Continuity and usage are provider-scoped under `providerState.<providerId>`.
+- Codex runs attempt `codex exec resume <sessionId> --json` when mapping exists.
+- If Codex resume fails with an invalid-session signature, chatd retries once with a fresh run.
+- Usage telemetry from `run.usage` is persisted at `providerState.<providerId>.latestUsage` and used to hydrate the context usage chip.
 
 ## API Surface
 
@@ -62,19 +63,23 @@ All `/v1/*` endpoints require `Authorization: Bearer <token>`.
 - `GET /v1/sessions`
   - List sessions.
 - `POST /v1/sessions`
-  - Create session (`title`, optional `model`, optional `reasoningEffort`).
+  - Create session (`title`, optional `provider`, optional `model`, optional `reasoningEffort`).
 - `GET /v1/sessions/:sessionId`
   - Fetch session metadata (includes `providerState` when present).
 - `PATCH /v1/sessions/:sessionId`
-  - Update session `title`, `model`, or `reasoningEffort`.
+  - Update session `title`, `provider`, `model`, or `reasoningEffort`.
+- `GET /v1/providers`
+  - Returns available provider adapters and default provider.
 - `GET /v1/sessions/:sessionId/messages?limit=200`
   - Read transcript messages.
 - `GET /v1/models`
-  - Returns available model presets and default reasoning effort.
+  - Returns provider-scoped model presets.
+  - Optional query: `?provider=codex|claude`.
 - `GET /v1/events?sessionId=<id>`
   - SSE stream (`chat.delta`, `chat.final`, `run.provider_session`, `run.usage`, etc.).
 - `POST /v1/runs`
   - Start run for `{ sessionId, message, browserContext? }`.
+  - `reasoningEffort` settings are currently applied to Codex runs. Claude runs ignore `reasoningEffort`.
 - `POST /v1/runs/:runId/abort` or `DELETE /v1/runs/:runId/abort`
   - Abort active run.
 
@@ -115,6 +120,8 @@ Optional external config:
   - `agent start` syncs a managed BrowserForce `AGENTS.md` into this directory (unless a custom unmanaged `AGENTS.md` is already present).
 - `BF_CHATD_CODEX_COMMAND`
   - Codex binary/command used by chatd (default `codex`).
+- `BF_CHATD_CLAUDE_COMMAND`
+  - Claude binary/command used by chatd when provider is `claude` (default `claude`).
 - `BF_CHATD_MODEL_LIST_TIMEOUT_MS`
   - Timeout when querying model catalog from Codex app-server.
 - `BF_CHATD_DEFAULT_MODEL`