feat: add Jetson Orin Nano support by realkim93 · Pull Request #405 · NVIDIA/NemoClaw

realkim93 · 2026-03-19T12:29:35Z

Closes #404
Also fixes #539, relates to #65, #300, #511, #1076

Summary

Adds end-to-end Jetson Orin Nano support: from GPU detection through to a working OpenClaw agent with local Ollama inference.

GPU detection: nvidia-smi reports [N/A] for memory.total on Jetson (unified memory). Added fallback via GPU name + /proc/device-tree/model, reading system RAM with free(1).
iptables-legacy fix: Tegra kernel lacks nft_chain_filter modules → k3s panic. Auto-rebuilds gateway image with iptables-legacy (idempotent via Docker label).
Default model: nemotron-3-nano:4b for Jetson (30B default exceeds 8GB).
Local inference routing: OpenShell 0.0.10 does not register inference.local in CoreDNS ([Bug] inference.local returns HTTP 403 inside sandbox when using Ollama local inference on DGX Spark #314, Local Ollama inference routing fails from sandbox #385, Sandbox cannot reach local Ollama instance on host due to enforced internal proxy (403 Forbidden) #417). Local providers (Ollama, vLLM) now route directly via host.openshell.internal instead.
Network policy: Added ollama_local endpoint to sandbox baseline policy.
Port conflict fix: Destroy stale gateways before port checks.
setup-jetson command: Host-level prep (NVIDIA runtime, kernel modules, Node.js version check).
Local provider sandbox config: Added ollama-local and vllm-local cases to getSandboxInferenceConfig() so local providers get direct endpoint URLs instead of falling through to the default inference.local path.

Verified working

$ openclaw agent --agent main --message 'Say hello in Korean'
안녕하세요! (Hello in Korean)

Inference path: OpenClaw (sandbox) → host.openshell.internal:11434 → Ollama → nemotron-3-nano:4b

Test plan

# tests 70
# pass 68
# fail 0
# skipped 2  (discrete NVIDIA + Apple — expected on Jetson)

GPU detection: "NVIDIA Jetson Orin Nano ... Super", 7619 MB
Gateway: iptables-legacy patched, k3s starts without panic
Sandbox: Phase Ready
Inference: openclaw agent responds via nemotron-3-nano:4b
Re-run onboard: no port conflict errors
Cloud inference (nvidia-nim): unaffected (still routes via inference.local)
Onboarding lifecycle: healthy gateway preserved on rerun, stale gateway cleaned up, foreign gateway left for port-check surfacing
Local provider routing: ollama-local and vllm-local route through host.openshell.internal
Node.js version check in setup-jetson.sh (>=22.16.0)

Tested on: Jetson Orin Nano Super (8GB, JetPack 6.x, kernel 5.15.148-tegra)

Co-Authored-By: Claude Opus 4.6 (1M context) noreply@anthropic.com

coderabbitai · 2026-03-19T12:29:49Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Detect Jetson unified-memory GPUs, prefer a smaller Jetson-specific Ollama model, patch the OpenShell gateway image for iptables-legacy on Jetson, add a host setup script and CLI command for Jetson preparation, and update local Ollama routing and tests for Jetson behavior.

Changes

Cohort / File(s)	Summary
GPU detection & model selection `bin/lib/nim.js`, `bin/lib/local-inference.js`	`detectGpu()` now recognizes Jetson unified‑memory devices (`jetson: true`, `nimCapable: false`, memory from system RAM). Added `DEFAULT_OLLAMA_MODEL_JETSON = "nemotron-3-nano:4b"`. `getOllamaModelOptions(runCapture, gpu)` and `getDefaultOllamaModel(runCapture, gpu)` signatures updated to accept `gpu` and prefer Jetson-specific model/fallbacks when `gpu.jetson`.
Onboard workflow & gateway patching `bin/lib/onboard.js`	`promptOllamaModel()` forwards `gpu`. `preflight()` frees gateway resources, treats Jetson as unified‑memory (no NIM), and performs gateway cleanup. Added `getGatewayImageTag()` and `patchGatewayImageForJetson()` to rebuild/tag the gateway image using `iptables-legacy` when needed; `startGateway(gpu)` invokes patching on Jetson. Writes sandbox `~/.nemoclaw/config.json` for openclaw sandbox.
CLI & host setup script `bin/nemoclaw.js`, `scripts/setup-jetson.sh`	New `nemoclaw setup-jetson` command. Added `scripts/setup-jetson.sh` to detect Jetson, add non‑root user to `docker` group, ensure `nvidia-container-runtime` is configured and set as Docker default-runtime, load/persist kernel modules, restart Docker with retries, and print next steps.
Local inference routing & config `bin/lib/inference-config.js`, `bin/lib/local-inference.js`	Exported host gateway URL and `getLocalProviderBaseUrl(...)`. `inference-config` now uses provider-specific direct endpoints (`OLLAMA_DIRECT_URL` / `VLLM_DIRECT_URL`) for `ollama-local`/`vllm-local` instead of the shared `INFERENCE_ROUTE_URL`.
Sandbox network policy `nemoclaw-blueprint/policies/openclaw-sandbox.yaml`	Added `network_policies.ollama_local` allowing sandbox access to `host.openshell.internal:11434` for `/usr/local/bin/openclaw`.
Tests `test/local-inference.test.js`, `test/nim.test.js`, `test/inference-config.test.js`	Added Jetson model tests and `DEFAULT_OLLAMA_MODEL_JETSON` import; `nim` tests probe GPU once and conditionally run Jetson/discrete/Apple checks; `ollama-local` provider test simplified to assert endpoint and model fields.
New file `scripts/setup-jetson.sh`	New executable script (host-side Jetson prep) added.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant CLI as "nemoclaw setup-jetson"
    participant Setup as "scripts/setup-jetson.sh"
    participant Docker as "Docker Daemon"
    User->>CLI: run setup-jetson
    CLI->>Setup: sudo -E bash setup-jetson.sh
    Setup->>Setup: validate OS/arch/root and detect Jetson
    Setup->>Docker: ensure nvidia-container-runtime in daemon.json
    Setup->>Docker: restart docker (systemctl/service)
    Docker-->>Setup: docker recovers
    Setup->>User: print completion & next steps

sequenceDiagram
    participant User
    participant Onboard as "nemoclaw onboard"
    participant Preflight as preflight()
    participant GPUDetect as nim.detectGpu()
    participant ModelSelect as local-inference
    participant GatewayPatch as patchGatewayImageForJetson()
    participant StartGW as startGateway()
    User->>Onboard: run onboard
    Onboard->>GPUDetect: detectGpu()
    GPUDetect-->>Onboard: { jetson: true, nimCapable: false, totalMemoryMB, name }
    Onboard->>Preflight: preflight(gpu)
    Preflight->>ModelSelect: getDefaultOllamaModel(runCapture, gpu)
    ModelSelect-->>Preflight: return nemotron-3-nano:4b (or first available)
    Preflight->>StartGW: startGateway(gpu)
    StartGW->>GatewayPatch: patchGatewayImageForJetson(gpu)
    GatewayPatch-->>StartGW: patched/tagged image
    StartGW->>StartGW: start gateway container
    StartGW-->>Onboard: gateway ready

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Poem

🐇 I sniffed a Jetson in the night,
Unified memory, cozy and light.
A smaller model snug in my pack,
Gateway patched—no k3s attack,
Hop! The warren's working right.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 40.91% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The PR title 'feat: add Jetson Orin Nano support' directly and clearly summarizes the main objective of the changeset, which adds comprehensive end-to-end Jetson Orin Nano support.
Linked Issues check	✅ Passed	The PR successfully addresses all coding requirements from issue `#404`: GPU detection fallback for Jetson devices, gateway image patching for iptables-legacy, Jetson-specific default Ollama model, local inference routing fix, and network policy updates.
Out of Scope Changes check	✅ Passed	All changes are directly related to issue `#404` requirements: GPU detection, gateway patching, model selection, inference routing, port conflict handling, and host setup commands. No unrelated changes detected.
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

🧹 Nitpick comments (2)

scripts/setup-jetson.sh (1)
179-194: Consider extending Docker restart timeout.

The restart polling loop allows 20 seconds (10 × 2s) for Docker to recover. On resource-constrained Jetson devices, Docker might take longer to restart, especially if it needs to pull the NVIDIA runtime shim.

Consider extending to 30 seconds for more headroom on slower devices, though 20s is likely sufficient for most cases.
♻️ Optional: Increase restart timeout
-  for i in 1 2 3 4 5 6 7 8 9 10; do
+  for i in $(seq 1 15); do
     if docker info > /dev/null 2>&1; then
       break
     fi
-    [ "$i" -eq 10 ] && fail "Docker didn't come back after restart. Check 'systemctl status docker'."
+    [ "$i" -eq 15 ] && fail "Docker didn't come back after restart. Check 'systemctl status docker'."
     sleep 2
   done
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@scripts/setup-jetson.sh` around lines 179 - 194, The restart polling
currently loops 10 times with a 2s sleep (≈20s total); increase the timeout to
~30s by changing the loop from 1..10 to 1..15 and update the failure check to
compare against 15 (keep the same docker info check and logging around
NEEDS_RESTART, systemctl restart docker, service docker restart/dockerd). This
ensures the existing restart logic and messages remain but gives Docker more
time on slow Jetson devices.
bin/lib/onboard.js (1)
351-406: Well-designed idempotent image patching.

The implementation is solid:

Uses Docker label (io.nemoclaw.jetson-patched) for idempotency check

Falls back to symlinks if update-alternatives fails

Fails explicitly if iptables-legacy is unavailable (important for debugging)

Cleans up temp directory after build

One minor note: os is required locally on line 381, but this is fine since it's a core module and keeps the import close to usage. However, os is already imported at line 8 of this file, so this local require is redundant.
♻️ Optional: Remove redundant local require
-  const os = require("os");
-  const tmpDir = fs.mkdtempSync(path.join(os.tmpdir(), "nemoclaw-jetson-"));
+  const tmpDir = fs.mkdtempSync(path.join(require("os").tmpdir(), "nemoclaw-jetson-"));
Or simply use the already-imported os from line 8. But since os is not currently imported at the top of this file (only fs and path from core modules), the local require is actually necessary. Disregard this suggestion.
Actually, looking more carefully at the file - line 8 shows const fs = require("fs"); and line 9 shows const path = require("path");, but os is not imported at the top. So the local require is necessary.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@bin/lib/onboard.js` around lines 351 - 406, The local require("os") inside
patchGatewayImageForJetson is redundant only if a top-level const os =
require("os") already exists; check the file imports and if os is already
imported at top remove the local const os = require("os") from
patchGatewayImageForJetson and use the top-level os, otherwise leave the local
require in place to avoid breaking the function.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@bin/lib/onboard.js`:
- Around line 351-406: The local require("os") inside patchGatewayImageForJetson
is redundant only if a top-level const os = require("os") already exists; check
the file imports and if os is already imported at top remove the local const os
= require("os") from patchGatewayImageForJetson and use the top-level os,
otherwise leave the local require in place to avoid breaking the function.

In `@scripts/setup-jetson.sh`:
- Around line 179-194: The restart polling currently loops 10 times with a 2s
sleep (≈20s total); increase the timeout to ~30s by changing the loop from 1..10
to 1..15 and update the failure check to compare against 15 (keep the same
docker info check and logging around NEEDS_RESTART, systemctl restart docker,
service docker restart/dockerd). This ensures the existing restart logic and
messages remain but gives Docker more time on slow Jetson devices.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 4b79d258-4025-40eb-a63d-c6475c23cd92

📥 Commits

Reviewing files that changed from the base of the PR and between 3ba517d and e311944.

📒 Files selected for processing (7)

bin/lib/local-inference.js
bin/lib/nim.js
bin/lib/onboard.js
bin/nemoclaw.js
scripts/setup-jetson.sh
test/local-inference.test.js
test/nim.test.js

wscurran · 2026-03-19T22:45:22Z

I've taken a look at the changes and see that this PR adds support for the Jetson Orin Nano, which includes implementing a fallback for GPU detection and resolving an issue with iptables-legacy that was causing a k3s panic.

realkim93 · 2026-03-20T01:25:25Z

Thanks for reviewing! This was tested on my Jetson Orin Nano Super (8GB, JetPack 6.x, kernel 5.15.148-tegra) — the gateway starts cleanly and the sandbox reaches Ready state with Ollama + nemotron-3-nano:4b. Let me know if there's anything I should adjust or if you'd like me to run additional tests on the device.

realkim93 · 2026-03-20T13:18:26Z

Added a commit to fix local Ollama inference routing inside the sandbox.

Problem: OpenShell 0.0.10 does not register inference.local in CoreDNS, so DNS resolution fails inside the sandbox. This is the same issue reported in #314, #385, #417 — OpenClaw inside the sandbox cannot reach the host Ollama.

Fix: For ollama-local provider, route directly through host.openshell.internal:11434 (Docker host-gateway alias) instead of inference.local. Also added ollama_local network policy endpoint to openclaw-sandbox.yaml.

All 35 tests pass (33 pass, 2 skipped).

coderabbitai

🧹 Nitpick comments (1)

test/inference-config.test.js (1)
33-40: Consider: tighten the URL assertion for completeness.

The regex /host\.openshell\.internal:11434/ validates the host and port but doesn't verify the protocol (http://) or path (/v1). A malformed URL like host.openshell.internal:11434 (missing protocol) would still pass.

If full validation is desired, consider a more specific match:
📝 Optional: stricter URL assertion
-    assert.match(cfg.endpointUrl, /host\.openshell\.internal:11434/);
+    assert.equal(cfg.endpointUrl, "http://host.openshell.internal:11434/v1");
Alternatively, restore full deep equality to catch regressions in ncpPartner, profile, or credentialEnv if those properties are expected to remain stable.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@test/inference-config.test.js` around lines 33 - 40, Tighten the URL
assertion in the test that calls getProviderSelectionConfig("ollama-local") by
asserting the full endpoint URL (including protocol and path) rather than just
matching host:port; update the check on cfg.endpointUrl to require
"http://host.openshell.internal:11434/v1" (or the canonical expected URL) and/or
replace the shallow property assertions with a deep equality check of the
returned config object to catch regressions in properties like ncpPartner,
profile, and credentialEnv while still keeping the expected values for
endpointType, model (DEFAULT_OLLAMA_MODEL), provider, and providerLabel.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@test/inference-config.test.js`:
- Around line 33-40: Tighten the URL assertion in the test that calls
getProviderSelectionConfig("ollama-local") by asserting the full endpoint URL
(including protocol and path) rather than just matching host:port; update the
check on cfg.endpointUrl to require "http://host.openshell.internal:11434/v1"
(or the canonical expected URL) and/or replace the shallow property assertions
with a deep equality check of the returned config object to catch regressions in
properties like ncpPartner, profile, and credentialEnv while still keeping the
expected values for endpointType, model (DEFAULT_OLLAMA_MODEL), provider, and
providerLabel.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: e2affc8e-3a9b-49c3-8c5e-e87f67d07302

📥 Commits

Reviewing files that changed from the base of the PR and between e311944 and 313c356.

📒 Files selected for processing (3)

bin/lib/inference-config.js
nemoclaw-blueprint/policies/openclaw-sandbox.yaml
test/inference-config.test.js

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (1)

bin/lib/onboard.js (1)

381-404: Clean up the temporary build context in a finally.

docker build throws here on failure, so Line 404 never runs and nemoclaw-jetson-* directories accumulate under the temp dir on repeated failures. A try/finally with fs.rmSync(..., { recursive: true, force: true }) makes the cleanup deterministic and avoids the extra shell-out.

♻️ Suggested change

   const os = require("os");
   const tmpDir = fs.mkdtempSync(path.join(os.tmpdir(), "nemoclaw-jetson-"));
   const dockerfile = path.join(tmpDir, "Dockerfile");
-  fs.writeFileSync(
-    dockerfile,
-    [
-      `FROM ${image}`,
-      `RUN if command -v update-alternatives >/dev/null 2>&1 && \\`,
-      `      update-alternatives --set iptables /usr/sbin/iptables-legacy 2>/dev/null && \\`,
-      `      update-alternatives --set ip6tables /usr/sbin/ip6tables-legacy 2>/dev/null; then \\`,
-      `      :; \\`,
-      `    elif [ -f /usr/sbin/iptables-legacy ] && [ -f /usr/sbin/ip6tables-legacy ]; then \\`,
-      `      ln -sf /usr/sbin/iptables-legacy /usr/sbin/iptables; \\`,
-      `      ln -sf /usr/sbin/ip6tables-legacy /usr/sbin/ip6tables; \\`,
-      `    else \\`,
-      `      echo "iptables-legacy not available in base image" >&2; exit 1; \\`,
-      `    fi`,
-      `LABEL io.nemoclaw.jetson-patched="true"`,
-      "",
-    ].join("\n")
-  );
-
-  run(`docker build --quiet -t "${image}" "${tmpDir}"`, { ignoreError: false });
-  run(`rm -rf "${tmpDir}"`, { ignoreError: true });
+  try {
+    fs.writeFileSync(
+      dockerfile,
+      [
+        `FROM ${image}`,
+        `RUN if command -v update-alternatives >/dev/null 2>&1 && \\`,
+        `      update-alternatives --set iptables /usr/sbin/iptables-legacy 2>/dev/null && \\`,
+        `      update-alternatives --set ip6tables /usr/sbin/ip6tables-legacy 2>/dev/null; then \\`,
+        `      :; \\`,
+        `    elif [ -f /usr/sbin/iptables-legacy ] && [ -f /usr/sbin/ip6tables-legacy ]; then \\`,
+        `      ln -sf /usr/sbin/iptables-legacy /usr/sbin/iptables; \\`,
+        `      ln -sf /usr/sbin/ip6tables-legacy /usr/sbin/ip6tables; \\`,
+        `    else \\`,
+        `      echo "iptables-legacy not available in base image" >&2; exit 1; \\`,
+        `    fi`,
+        `LABEL io.nemoclaw.jetson-patched="true"`,
+        "",
+      ].join("\n")
+    );
+
+    run(`docker build --quiet -t "${image}" "${tmpDir}"`, { ignoreError: false });
+  } finally {
+    fs.rmSync(tmpDir, { recursive: true, force: true });
+  }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bin/lib/onboard.js` around lines 381 - 404, The temp build context created
via fs.mkdtempSync (tmpDir) and populated (dockerfile via fs.writeFileSync) can
leak when run(`docker build...`) throws; wrap the build and any operations that
may throw in a try/finally around the run call(s) so cleanup always happens, and
in the finally use fs.rmSync(tmpDir, { recursive: true, force: true }) (instead
of run(`rm -rf ...`)) to deterministically remove the nemoclaw-jetson-*
directory; adjust ordering so tmpDir is removed after the build attempt and keep
existing ignoreError semantics removed in favor of the synchronous rmSync call.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@bin/lib/onboard.js`:
- Around line 146-148: The sandbox config builder incorrectly maps non-default
local Ollama models to "nvidia-nim" by relying on DEFAULT_OLLAMA_MODEL; update
buildSandboxConfigSyncScript to determine provider type from the selected
provider or endpoint metadata (e.g., inspect the selected provider object or
endpoint.provider/type field returned by promptOllamaModel or
getOllamaModelOptions) instead of comparing model names to DEFAULT_OLLAMA_MODEL,
and apply the same metadata-driven logic to the other mapping block that mirrors
this behavior (the later duplicate mapping). Locate references to
DEFAULT_OLLAMA_MODEL and replace the conditional mapping with logic that reads
provider.type/providerName or endpoint metadata and sets the sandbox provider to
that derived value.
- Around line 279-285: Move the "kill $(lsof -ti :18789 -c openclaw) ..."
cleanup so it runs after the sandbox recreate decision in createSandbox()
instead of before it: currently the port-18789 shutdown is executed
unconditionally near the top of onboard.js (adjacent to the existing run(...)
for gateway destroy), which can stop a valid dashboard forward before
createSandbox() chooses the existing-sandbox path; update the control flow to
keep the existing early cleanup for the nemoclaw gateway but defer the openclaw
port-18789 kill until after createSandbox() determines whether to recreate
(i.e., run the kill only in the branch that actually recreates/restarts the
sandbox/dashboard, not in the pre-check path).

---

Nitpick comments:
In `@bin/lib/onboard.js`:
- Around line 381-404: The temp build context created via fs.mkdtempSync
(tmpDir) and populated (dockerfile via fs.writeFileSync) can leak when
run(`docker build...`) throws; wrap the build and any operations that may throw
in a try/finally around the run call(s) so cleanup always happens, and in the
finally use fs.rmSync(tmpDir, { recursive: true, force: true }) (instead of
run(`rm -rf ...`)) to deterministically remove the nemoclaw-jetson-* directory;
adjust ordering so tmpDir is removed after the build attempt and keep existing
ignoreError semantics removed in favor of the synchronous rmSync call.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 2fc76141-d389-4c2e-88ba-76308543f6a0

📥 Commits

Reviewing files that changed from the base of the PR and between 78d320c and 980e3ab.

📒 Files selected for processing (3)

bin/lib/inference-config.js
bin/lib/onboard.js
nemoclaw-blueprint/policies/openclaw-sandbox.yaml

🚧 Files skipped from review as they are similar to previous changes (2)

nemoclaw-blueprint/policies/openclaw-sandbox.yaml
bin/lib/inference-config.js

bin/lib/onboard.js

coderabbitai

♻️ Duplicate comments (2)

bin/lib/onboard.js (2)

685-700: ⚠️ Potential issue | 🟠 Major

Jetson default model selection still conflicts with sandbox provider serialization.

Good call passing gpu here, but this now more reliably picks a Jetson-specific Ollama default, which still collides with the model-name heuristic in buildSandboxConfigSyncScript() (local Ollama can be written as nvidia-nim).

Suggested fix at the root cause (provider derivation)

 function buildSandboxConfigSyncScript(selectionConfig) {
-  const providerType =
-    selectionConfig.profile === "inference-local"
-      ? selectionConfig.model === DEFAULT_OLLAMA_MODEL
-        ? "ollama-local"
-        : "nvidia-nim"
-      : selectionConfig.endpointType === "vllm"
-        ? "vllm-local"
-        : "nvidia-nim";
+  const localKind = String(
+    selectionConfig.provider || selectionConfig.endpointType || ""
+  ).toLowerCase();
+  const providerType =
+    selectionConfig.profile === "inference-local"
+      ? localKind.includes("ollama")
+        ? "ollama-local"
+        : localKind.includes("vllm")
+          ? "vllm-local"
+          : "nvidia-nim"
+      : "nvidia-nim";

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bin/lib/onboard.js` around lines 685 - 700, The chosen Jetson-specific
default model (via getDefaultOllamaModel / promptOllamaModel) can still conflict
with the provider-detection heuristic in buildSandboxConfigSyncScript; fix this
by normalizing model names or deriving provider from the explicit provider
variable instead of the model string: update buildSandboxConfigSyncScript to use
the provider variable (provider === "ollama-local") or add a normalization
helper (e.g., normalizeOllamaModelForSerialization) that maps Jetson-specific
names like "nvidia-nim" to the canonical form used by sandbox serialization, and
ensure getDefaultOllamaModel and promptOllamaModel return/consult that
normalized value so provider serialization remains consistent with the selected
model.

279-285: ⚠️ Potential issue | 🟠 Major

Defer port 18789 process cleanup until recreate/create is confirmed.

Line 284 still unconditionally kills the dashboard-forward process before createSandbox() decides whether to recreate. If the user keeps an existing sandbox, onboarding can exit with the dashboard no longer forwarded.

Proposed control-flow fix

-  run("kill $(lsof -ti :18789 -c openclaw) 2>/dev/null || true", { ignoreError: true });
   sleep(2);

 async function createSandbox(gpu) {
   step(3, 7, "Creating sandbox");
@@
   if (existing) {
@@
       if (recreate.toLowerCase() !== "y") {
         console.log("  Keeping existing sandbox.");
         return sandboxName;
       }
@@
   }
+
+  // Only clean stale dashboard forwards/processes when we are actually creating/recreating.
+  run("kill $(lsof -ti :18789 -c openclaw) 2>/dev/null || true", { ignoreError: true });

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bin/lib/onboard.js` around lines 279 - 285, The unconditional kill of the
dashboard-forward process (the run invocation that executes "kill $(lsof -ti
:18789 -c openclaw) ...") should be deferred until after createSandbox()
determines a recreate/create is required; change control flow so createSandbox()
(or the caller that decides recreate) runs first and only when it indicates a
new sandbox will be created/recreated do we execute the run("kill $(lsof -ti
:18789 -c openclaw) 2>/dev/null || true", { ignoreError: true }) cleanup (and
keep sleep(2) paired with that cleanup if still needed).

🧹 Nitpick comments (1)

bin/lib/onboard.js (1)

812-824: Remove unreachable duplicate config-write block.

buildSandboxConfigSyncScript() already writes ~/.nemoclaw/config.json and exits; the appended nemoClawConfigScript after ${script} never executes.

Cleanup diff

-    // Also write ~/.nemoclaw/config.json inside the sandbox so the NemoClaw
-    // plugin displays the correct endpoint/model in its banner instead of
-    // falling back to the cloud defaults.
-    const nemoClawConfigScript = `
-mkdir -p ~/.nemoclaw
-cat > ~/.nemoclaw/config.json <<'EOF_NEMOCLAW_CFG'
-${JSON.stringify(sandboxConfig, null, 2)}
-EOF_NEMOCLAW_CFG
-`;
     run(`cat <<'EOF_NEMOCLAW_SYNC' | openshell sandbox connect "${sandboxName}"
 ${script}
-${nemoClawConfigScript}
 exit
 EOF_NEMOCLAW_SYNC`, { stdio: ["ignore", "ignore", "inherit"] });

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@bin/lib/onboard.js` around lines 812 - 824, The appended nemoClawConfigScript
block is redundant because buildSandboxConfigSyncScript() already writes
~/.nemoclaw/config.json and exits, so remove the duplicate: delete the
nemoClawConfigScript constant and stop injecting ${nemoClawConfigScript} into
the run(...) payload (the run call that uses openshell sandbox connect
"${sandboxName}"), leaving only the previously built ${script} so that the
remote script exits as intended; reference buildSandboxConfigSyncScript(),
nemoClawConfigScript, and the run(...) invocation to locate the code to change.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In `@bin/lib/onboard.js`:
- Around line 685-700: The chosen Jetson-specific default model (via
getDefaultOllamaModel / promptOllamaModel) can still conflict with the
provider-detection heuristic in buildSandboxConfigSyncScript; fix this by
normalizing model names or deriving provider from the explicit provider variable
instead of the model string: update buildSandboxConfigSyncScript to use the
provider variable (provider === "ollama-local") or add a normalization helper
(e.g., normalizeOllamaModelForSerialization) that maps Jetson-specific names
like "nvidia-nim" to the canonical form used by sandbox serialization, and
ensure getDefaultOllamaModel and promptOllamaModel return/consult that
normalized value so provider serialization remains consistent with the selected
model.
- Around line 279-285: The unconditional kill of the dashboard-forward process
(the run invocation that executes "kill $(lsof -ti :18789 -c openclaw) ...")
should be deferred until after createSandbox() determines a recreate/create is
required; change control flow so createSandbox() (or the caller that decides
recreate) runs first and only when it indicates a new sandbox will be
created/recreated do we execute the run("kill $(lsof -ti :18789 -c openclaw)
2>/dev/null || true", { ignoreError: true }) cleanup (and keep sleep(2) paired
with that cleanup if still needed).

---

Nitpick comments:
In `@bin/lib/onboard.js`:
- Around line 812-824: The appended nemoClawConfigScript block is redundant
because buildSandboxConfigSyncScript() already writes ~/.nemoclaw/config.json
and exits, so remove the duplicate: delete the nemoClawConfigScript constant and
stop injecting ${nemoClawConfigScript} into the run(...) payload (the run call
that uses openshell sandbox connect "${sandboxName}"), leaving only the
previously built ${script} so that the remote script exits as intended;
reference buildSandboxConfigSyncScript(), nemoClawConfigScript, and the run(...)
invocation to locate the code to change.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: f6b98de2-873f-4e85-a20b-901f36fef869

📥 Commits

Reviewing files that changed from the base of the PR and between 980e3ab and dc3c12c.

📒 Files selected for processing (1)

bin/lib/onboard.js

ericksoa · 2026-03-21T05:17:10Z

Hey @realkim93 — we independently hit the same issue (#539) and arrived at the same iptables-legacy image-patching approach, so we're confident the core fix here is sound.

We did a full review of the diff and everything looks clean. One thing we noticed (also flagged by CodeRabbit): the port-18789 cleanup in preflight() runs before createSandbox() decides whether to recreate, which could leave a healthy sandbox without its dashboard forward on a no-op rerun. Could you take a look at that timing issue?

Once that's addressed we're happy to approve from our side. Nice work getting this tested on real hardware.

realkim93 · 2026-03-23T01:39:51Z

Thanks @ericksoa and @wscurran for the thoughtful review. Really appreciate you taking the time to go through the diff carefully.

@ericksoa Thank you for raising the port-18789 timing issue. I've pushed a fix that defers the dashboard-forward cleanup into createSandbox(), so it only runs when we're actually recreating or creating a new sandbox. The port-18789 availability check in preflight is also removed since that port is now fully managed inside createSandbox(). Also addressed the provider-type mapping that CodeRabbit flagged: it now derives from provider metadata instead of comparing model names, which should handle the Jetson default model correctly.

Other cleanup in this commit:

patchGatewayImageForJetson tmpDir wrapped in try/finally
Removed unreachable duplicate config-write block in setupOpenclaw
Extended Docker restart timeout in setup-jetson.sh for slower Jetson devices

Let me know if there's anything else that needs attention!

ACCGAGTT · 2026-03-25T23:46:44Z

fyi none of these work currently with nemoclaw 0.0.1.0 and openshell 0.0.0.16
keeps pulling the wrong images , can someone make sure your containers are tagged correctly if these fixes were merged and pushed

been banging my head with my Jetson Orin NAno super, same as realkim93 system

realkim93 · 2026-03-29T12:52:00Z

@ACCGAGTT Thanks for reporting this — just to clarify, this PR has not been merged yet, so the fixes here are not included in the current NemoClaw 0.0.1.0 / OpenShell 0.0.0.16 release. That's why you're seeing wrong images being pulled on your Jetson Orin Nano Super.

I'm working on rebasing this branch against the latest main (there have been several recent merges that introduced conflicts). Once this PR is merged and a new release is cut, the Jetson support should be available.

I'll update this thread once the rebase is done — happy to have another Jetson Orin Nano Super user to help verify once it lands!

realkim93 · 2026-03-29T12:52:10Z

Hi @ericksoa @kjw3 @cv — gentle ping on this PR. I've addressed all the review feedback from March 21st:

✅ Port-18789 cleanup deferred into createSandbox() (timing issue fixed)
✅ Provider-type mapping now derives from metadata, not model name comparison
✅ tmpDir safety, duplicate code removal, Docker timeout for Jetson

The branch currently has merge conflicts against main — mostly from #1037 and other recent merges to onboard.js, nim.js, and nemoclaw.js. I'll rebase and resolve these shortly.

There's also a Jetson user (@ACCGAGTT) waiting on this — would appreciate a re-review once the rebase is up. Thank you!

ACCGAGTT · 2026-03-29T13:17:02Z

@ACCGAGTT Thanks for reporting this — just to clarify, this PR has not been merged yet, so the fixes here are not included in the current NemoClaw 0.0.1.0 / OpenShell 0.0.0.16 release. That's why you're seeing wrong images being pulled on your Jetson Orin Nano Super.

I'm working on rebasing this branch against the latest main (there have been several recent merges that introduced conflicts). Once this PR is merged and a new release is cut, the Jetson support should be available.

I'll update this thread once the rebase is done — happy to have another Jetson Orin Nano Super user to help verify once it lands!

Thank you appreciate the update.

Add GPU detection, iptables-legacy fix, and nemotron-3-nano:4b default for Jetson Orin Nano Super (8GB, JetPack 6.x). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…htening - Guard runCapture().trim() against null in patchGatewayImageForJetson - Apply same inference.local bypass to vllm-local (same DNS bug affects both local providers, not just Ollama) - Use getLocalProviderBaseUrl() as single source of truth for direct URLs - Add TODO to remove direct URLs when OpenShell fixes inference.local - Remove overly broad /usr/local/bin/node from ollama_local network policy (openclaw binary alone is sufficient) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

… and cleanup safety - Defer port-18789 kill to createSandbox() after recreate decision so no-op reruns don't break a healthy dashboard forward - Derive provider type from selectionConfig.provider metadata instead of comparing model names to DEFAULT_OLLAMA_MODEL (fixes Jetson misclassification) - Wrap patchGatewayImageForJetson tmpDir in try/finally with fs.rmSync - Remove unreachable duplicate nemoClawConfigScript in setupOpenclaw - Extend Docker restart timeout to 30s for slower Jetson devices Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

The previous commit deferred the port-18789 kill to createSandbox(), but left the port availability check in preflight. This caused a hard exit when re-running onboard with an existing dashboard forward still active. Port 18789 is now fully managed inside createSandbox(). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- inference-config.test: use getLocalProviderBaseUrl() for ollama-local endpoint URL (host-gateway bypass for OpenShell 0.0.10 DNS issue) - local-inference.test: convert assert → expect (vitest) for jetson tests Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- inference-config.test.js: use INFERENCE_ROUTE_URL for ollama-local (PR NVIDIA#1037 fixed inference.local routing, host-gateway bypass removed) - local-inference.test.js: getOllamaModelOptions no longer takes gpu param; Jetson fallback moved to getBootstrapOllamaModelOptions Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Extract detectJetson() and getUnifiedMemoryMB() helper functions to bring detectGpu() cyclomatic complexity under the lint threshold (20). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

cv · 2026-03-30T01:20:56Z

Thanks for pushing this — the Jetson-specific pieces look valuable, especially the unified-memory GPU detection, the smaller Ollama default, and the dedicated setup-jetson flow.

That said, I’m not comfortable approving this as-is because I think it introduces a potentially serious onboarding regression in bin/lib/onboard.js.

Main blocker: `onboard` becomes destructive on reruns

In preflight(), this PR now unconditionally destroys the nemoclaw gateway before port checks:

run("openshell gateway destroy -g nemoclaw 2>/dev/null || true", { ignoreError: true });

My concern is that this defeats the current “safe rerun / reuse healthy state” behavior. Today, onboarding tries to distinguish between:

stale/broken NemoClaw state that should be cleaned up, vs
a healthy named gateway/sandbox that should be reused

With this change, even a healthy existing NemoClaw gateway gets torn down before we’ve decided whether we actually need to recreate anything.

Why that feels dangerous

This changes nemoclaw onboard from a mostly idempotent re-entry path into something that can be unexpectedly destructive. Practical risks:

rerunning onboarding can tear down a healthy existing session
live gateway state may be lost even when the user just wanted to re-run or inspect setup
healthy sandboxes that could have been reused now get forced through a more disruptive path

Given how sensitive onboarding is, that feels like too much blast radius for a Jetson support PR.

Secondary concern: port 18789 handling

I understand the motivation for removing 18789 from preflight to avoid the re-run regression, but I think the current version swings too far the other way.

Right now the PR:

removes 18789 from required port checks in preflight()
defers cleanup until createSandbox()

That avoids breaking a healthy dashboard forward on no-op reruns, which is good. But it also means a real conflict on 18789 from some unrelated process may no longer be surfaced early and clearly. Since later forward setup is fairly tolerant, onboarding can appear to succeed while the dashboard is actually broken.

So I think this change trades a visible failure for a potentially silent one.

Testing concern

The new tests around:

Jetson GPU detection
Jetson Ollama default model selection

look good.

But I don’t see coverage for the riskiest part of this PR, which is the onboarding lifecycle behavior:

healthy gateway reuse
rerun idempotency
port-conflict behavior for the dashboard forward

Those are exactly the areas where I’m most worried about regressions.

What I’d suggest

I’d be happy with this direction if we tighten the lifecycle changes:

Do not unconditionally destroy the named NemoClaw gateway in preflight()
- only clean up stale / unnamed / broken state
- preserve healthy gateway reuse
Keep explicit 18789 conflict handling
- but special-case the healthy NemoClaw dashboard forward
- don’t fully drop the preflight visibility for that port
Optionally, consider splitting this into:
- Jetson support itself (GPU detection, model selection, setup script)
- onboarding lifecycle changes

The Jetson-specific work looks useful, but the onboarding behavior change is a blocker for me right now.

Address review feedback from cv and CodeRabbit: 1. Remove unconditional gateway destroy in preflight() — the existing getGatewayReuseState() logic already handles stale/unnamed cleanup while preserving healthy gateways. The unconditional destroy broke safe re-run behavior and could tear down a running session. 2. Restore port 18789 (dashboard) to requiredPorts — the existing healthy-gateway skip logic already handles the re-run case correctly. Removing it entirely masked conflicts from unrelated processes. 3. Add ollama-local and vllm-local cases to getSandboxInferenceConfig() so that Jetson's default model (nemotron-3-nano:4b) gets the correct direct endpoint URL instead of falling through to the nvidia-nim default path. 4. Add tests for ollama-local and vllm-local sandbox inference config to prevent future regressions in provider mapping.

realkim93 · 2026-03-30T02:54:16Z

Thank you @cv for taking the time to review this so carefully. Your feedback is very helpful — you are right on all three points, and I appreciate you explaining the reasoning behind the existing design.

I clearly did not study the preflight() and getGatewayReuseState() flow thoroughly enough before making changes to it. That was my mistake, and I have learned a lot from your review about how the onboarding lifecycle is designed to work.

I have pushed a fix that addresses the issues you raised:

1. Removed unconditional gateway destroy in `preflight()`

Reverted to the existing getGatewayReuseState() cleanup flow, which correctly distinguishes stale/unnamed state from healthy gateways. My previous change bypassed that logic entirely, which could tear down a running session on rerun.

2. Restored port 18789 to `requiredPorts`

The healthy-gateway skip logic was already handling the re-run case correctly. Removing 18789 from the check just masked real conflicts from unrelated processes, as you pointed out.

3. Fixed provider mapping for local providers

Added explicit ollama-local and vllm-local cases to getSandboxInferenceConfig() so the Jetson default model gets the correct direct endpoint URL instead of falling through to the default path. Added tests for both.

I also want to be transparent: my earlier commit (9601a71) claimed in the message to fix this provider mapping, but reviewing the actual diff, that change was not included. I apologize for the confusion — it is now properly addressed in this commit.

Scope of this fix

createSandbox() port 18789 cleanup remains in the recreate-only path (after the early return for ready sandboxes).
All Jetson-specific code (nim.js, local-inference.js, setup-jetson.sh, gateway image patch) is unchanged.

All 82 relevant tests pass (onboard + nim + local-inference + inference-config). Rebase against latest main is also done — no conflicts.

Thank you again for the guidance. Please let me know if there is anything else I should address.

realkim93 · 2026-03-30T03:00:09Z

Rebase against latest main is done — no conflicts. All CI checks should re-run shortly.

realkim93 · 2026-03-30T16:11:26Z

Latest changes (`1f59809`)

1. Onboarding lifecycle tests

Healthy gateway preservation on idempotent reruns
Stale gateway cleanup vs foreign gateway conflict surfacing
ollama-local / vllm-local provider routing through host.openshell.internal

2. Local provider sandbox config fix
Added ollama-local and vllm-local cases to getSandboxInferenceConfig() — local providers now get direct endpoint URLs instead of falling through to the default inference.local path.

3. Node.js version check in setup-jetson.sh
JetPack doesn't ship Node.js. Added >=22.16.0 check (matching package.json engines) with clear error messaging.

Verified on actual Jetson hardware

All changes tested on Jetson Orin Nano Super (8GB, JetPack 6.x, kernel 5.15.148-tegra):

$ npx vitest run test/onboard.test.js test/nim.test.js test/local-inference.test.js

 ✓ test/local-inference.test.js (23 tests)  27ms
 ✓ test/nim.test.js (10 tests | 2 skipped)  250ms
 ✓ test/onboard.test.js (38 tests)  5186ms

 Test Files  3 passed (3)
      Tests  69 passed | 2 skipped (71)

$ npx vitest run  # full suite
 Test Files  37 passed | 1 skipped (39)
      Tests  622 passed | 4 skipped (628)

The 2 install-preflight failures in the full suite are pre-existing and unrelated to this PR (npm path detection on the Jetson's Node.js installation).

Merge origin/main into feat/jetson-orin-nano-support to resolve conflicts from recent changes (NVIDIA#1208, NVIDIA#1200, NVIDIA#836, NVIDIA#1221, NVIDIA#1223). Jetson detection now leverages main's UNIFIED_MEMORY_GPU_TAGS with added jetson flag and /proc/device-tree fallback. All 116 tests pass.

- Move setup-jetson case into correct switch position (after setup-spark) - Apply Prettier formatting to all modified files - All 747 tests pass

- Fix setupJetson runtime crash: wire nemoclaw setup-jetson to shell script (matching setup-spark pattern) instead of non-existent local-inference export - Add Xavier to isJetson regex for consistency with UNIFIED_MEMORY_GPU_TAGS - Remove redundant os require in patchGatewayImageForJetson (already imported at file top) - Use shellQuote() for docker commands in patchGatewayImageForJetson - Export getGatewayImageTag and patchGatewayImageForJetson for testability - Add setup-jetson to CLI help text - Add tests: Xavier jetson flag, patchGatewayImageForJetson structure - Suppress pre-existing detectGpu complexity lint (added by Jetson branches) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…atewayImageForJetson Replace regex-based source code inspection test with three mock-based behavioral tests that verify actual function behavior: - Idempotency: skips docker build when image already has jetson-patched label - Build path: invokes docker build with shellQuote'd image tag when unpatched - Cleanup: temp directory removed via finally block even on build failure

…docs Add two behavioral tests that directly validate cv's blocker NVIDIA#1 fix: - Healthy gateway is preserved (no destroy/forward-stop) on rerun - Stale vs healthy vs active-unnamed states trigger correct cleanup Also add setup-jetson entry to docs/reference/commands.md. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Add Jetson row to docs/get-started/quickstart.md platform table - Remove setup-jetson.test.js (source-text inspection, same pattern flagged during patchGatewayImageForJetson review; setup-spark also has no such test) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…NemoClaw into feat/jetson-orin-nano-support

realkim93 · 2026-04-02T11:42:59Z

Hi @cv — thank you again for the thorough review on March 30. I want to start by saying that your write-up genuinely changed how I think about onboarding changes. Before your review, I didn't fully appreciate the distinction between "stale state that should be cleaned up" and "healthy state that should be reused" — and I didn't realize how much care had gone into making nemoclaw onboard a safe re-entry path. That's something I'll carry forward.

Thank you also to @ericksoa for independently confirming the port-18789 timing issue and for validating the iptables-legacy approach from #539.

I appreciate that you saw value in the Jetson-specific pieces — the unified-memory GPU detection, the smaller Ollama default, and the dedicated setup-jetson flow. Those parts are unchanged from before. What follows is how I've addressed your concerns about the onboarding lifecycle, which I agree should not have been part of a Jetson PR without much more care.

On the gateway destroy blocker — I reverted to the existing getGatewayReuseState() flow. Preflight now queries status, named gateway info, and active gateway info, then classifies the state. Only "stale" and "active-unnamed" trigger cleanup. A "healthy" gateway is left untouched.

On port 18789 — your feedback helped me see that my change traded a visible failure for a silent one, which is a worse outcome. Port 18789 is back in requiredPorts with a healthy-gateway exemption, and the kill cleanup is deferred into the createSandbox() recreate path so it can't break a healthy dashboard forward on a no-op rerun.

On test coverage — I added five behavioral tests (subprocess-isolated, following the existing onboard.test.js patterns): the two preflight lifecycle tests mentioned above, plus three for patchGatewayImageForJetson covering idempotency (skip when already patched), the build path (shellQuote verification), and tmpDir cleanup when docker build fails.

Other things I fixed while going through the code more carefully (something I should have done before the first submission):

nemoclaw setup-jetson would have crashed on every call — it was importing setupJetson from local-inference.js which doesn't export that function. Wired it to scripts/setup-jetson.sh instead, same as setup-spark.
isJetson only matched orin|thor but UNIFIED_MEMORY_GPU_TAGS has "Xavier". Added xavier to the regex.
Applied shellQuote() to the docker commands in patchGatewayImageForJetson, removed a redundant const os = require("os").
Added setup-jetson to CLI help text, docs/reference/commands.md, and the quickstart compatibility table.

One note on scope — this PR also adds ollama-local and vllm-local cases to getSandboxInferenceConfig(). This is a pre-existing bug where local providers fall through to the default case and get the wrong inference.local URL. I found it during Jetson testing since local Ollama is the primary inference path on Jetson. I'm happy to split it into a separate PR if you'd prefer.

The Jetson-specific code you've already seen (local-inference.js, setup-jetson.sh, openclaw-sandbox.yaml) is unchanged from before.

All 122 relevant tests pass. I'm grateful for your patience with this PR and for the time you've invested in the review. You also suggested splitting this into Jetson support and lifecycle changes — I kept them together since the lifecycle fixes ended up small, but I'm happy to split if you think that's cleaner. Please let me know if there's anything else I should fix.

Port Jetson changes (isJetson detection, Xavier support, DEFAULT_OLLAMA_MODEL_JETSON, device-tree fallback) to the new TypeScript sources introduced by main's CJS→TS migration. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Accept main's CJS shims for bin/lib/local-inference.js and bin/lib/nim.js, and port Jetson changes (isJetson detection, Xavier support, DEFAULT_OLLAMA_MODEL_JETSON, device-tree fallback) to the new TypeScript sources in src/lib/. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai bot reviewed Mar 19, 2026

View reviewed changes

wscurran added enhancement New feature or request Platform: AGX Thor/Orin Support for Jetson AGX Thor and Orin labels Mar 19, 2026

wscurran added the priority: medium Issue that should be addressed in upcoming releases label Mar 19, 2026

coderabbitai bot reviewed Mar 20, 2026

View reviewed changes

bin/lib/onboard.js Outdated Show resolved Hide resolved

bin/lib/onboard.js Outdated Show resolved Hide resolved

coderabbitai bot reviewed Mar 20, 2026

View reviewed changes

realkim93 force-pushed the feat/jetson-orin-nano-support branch from b18e53a to 77f2d6f Compare March 29, 2026 16:17

realkim93 and others added 8 commits March 30, 2026 09:15

feat: add Jetson Orin Nano support

902d839

Add GPU detection, iptables-legacy fix, and nemotron-3-nano:4b default for Jetson Orin Nano Super (8GB, JetPack 6.x). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

refactor: extract Jetson detection to reduce detectGpu complexity

92e51f1

Extract detectJetson() and getUnifiedMemoryMB() helper functions to bring detectGpu() cyclomatic complexity under the lint threshold (20). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chore: apply shfmt formatting to setup-jetson.sh

c4d41dd

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

realkim93 force-pushed the feat/jetson-orin-nano-support branch from 77f2d6f to c4d41dd Compare March 30, 2026 00:15

Merge branch 'main' into feat/jetson-orin-nano-support

2fad75c

realkim93 force-pushed the feat/jetson-orin-nano-support branch from 79af0ff to 33f0ead Compare March 30, 2026 02:57

realkim93 force-pushed the feat/jetson-orin-nano-support branch from 33f0ead to 9d933d6 Compare March 30, 2026 09:51

realkim93 force-pushed the feat/jetson-orin-nano-support branch 4 times, most recently from 3ba47a9 to 9d933d6 Compare April 1, 2026 04:15

realkim93 force-pushed the feat/jetson-orin-nano-support branch from e034a56 to df08f88 Compare April 1, 2026 04:47

realkim93 and others added 6 commits April 1, 2026 19:36

fix: correct setup-jetson placement and apply Prettier formatting

fc8c790

- Move setup-jetson case into correct switch position (after setup-spark) - Apply Prettier formatting to all modified files - All 747 tests pass

Merge remote-tracking branch 'origin/main' into pr-405-restore

73ca60d

Merge branch 'main' into feat/jetson-orin-nano-support

2115aa4

realkim93 force-pushed the feat/jetson-orin-nano-support branch 2 times, most recently from f4a7c48 to 7b31aa5 Compare April 1, 2026 17:08

realkim93 and others added 3 commits April 2, 2026 09:06

Merge branch 'main' into feat/jetson-orin-nano-support

4542d9f

Merge branch 'feat/jetson-orin-nano-support' of github.com:realkim93/…

b7536f1

…NemoClaw into feat/jetson-orin-nano-support

realkim93 and others added 2 commits April 2, 2026 20:51

Conversation

realkim93 commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Verified working

Test plan

Uh oh!

coderabbitai bot commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

wscurran commented Mar 19, 2026

Uh oh!

realkim93 commented Mar 20, 2026

Uh oh!

realkim93 commented Mar 20, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

ericksoa commented Mar 21, 2026

Uh oh!

realkim93 commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ACCGAGTT commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

realkim93 commented Mar 29, 2026

Uh oh!

realkim93 commented Mar 29, 2026

Uh oh!

ACCGAGTT commented Mar 29, 2026

Uh oh!

cv commented Mar 30, 2026

Main blocker: onboard becomes destructive on reruns

Why that feels dangerous

Secondary concern: port 18789 handling

Testing concern

What I’d suggest

Uh oh!

realkim93 commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1. Removed unconditional gateway destroy in preflight()

2. Restored port 18789 to requiredPorts

3. Fixed provider mapping for local providers

Scope of this fix

Uh oh!

realkim93 commented Mar 30, 2026

Uh oh!

realkim93 commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Latest changes (1f59809)

Verified on actual Jetson hardware

Uh oh!

realkim93 commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

realkim93 commented Mar 19, 2026 •

edited

Loading

coderabbitai bot commented Mar 19, 2026 •

edited

Loading

realkim93 commented Mar 23, 2026 •

edited

Loading

ACCGAGTT commented Mar 25, 2026 •

edited

Loading

Main blocker: `onboard` becomes destructive on reruns

realkim93 commented Mar 30, 2026 •

edited

Loading

1. Removed unconditional gateway destroy in `preflight()`

2. Restored port 18789 to `requiredPorts`

realkim93 commented Mar 30, 2026 •

edited

Loading

Latest changes (`1f59809`)